Protein

Protein accession
A0A222Z8D1 [UniProt]
Representative
CHtt
Source
UniProt (cluster: phalp2_36386)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAAPAANIELFLPSKAPITQGLGAEQKQGGLHAGTDFAYMANGQVYPDIYAAAEGRVIWAGDSRNLGWPNVLYLNIDFDRTDNIDSSAGNYTIIEHYDANGNKLCLTGYGHQANIFVKVGDWVRAGQIIGEVGDTGFNFGKHLHFDFVLYPYDVDDYPYYGRVDPTPYFVNQFKIEEDDMYTQTDRERDNLVADRMGYLWKHYGPGQKGYKDDGEYAALLRDTKRIADVAAVNASNAHKVTQDVLFSVTPGISEQRPAGATILSILSIVAAAQNKTVEQILEGIPEAQLSGAEFVLVPKDSLPAQPAQ
Physico‐chemical
properties
protein length:308 AA
molecular weight:33833,3 Da
isoelectric point:4,73
hydropathy:-0,31
Representative Protein Details
Accession
CHtt
Protein name
CHtt
Sequence length
323 AA
Molecular weight
35238,88140 Da
Isoelectric point
4,75557
Sequence
MAAQITLYRPSDAPISQGLGAAQKQGGLHAGTDFYYSYGGKIYDKAYAMASGKVIWASDSRGLPWPNILYLNIDFDRTDKLDSSAGNYVIIEHYDAFGNPIALSGYGHLAEIYVNVGDWVTGRQHIAKVGDTGFNYGKHLHVDFVLYPYDVDDAPYYGRVDPTPYFIDFEEDDMYTDADRERDIQAAKDAKAAKEAIIFGGTSMKYGASLQNITDDVPRRVAEYQVLRGGKKLTWIQDNADGTSAAIRTEAKVDALLAIVTKLTAASGTPVTVEQLVEAIDAAVDRTLGELTGTATVVIGRAPETHVQALEAHEILDAEEVKQ
Other Proteins in cluster: phalp2_36386
Total (incl. this protein): 15 Avg length: 309,7 Avg pI: 4,76

Protein ID Length (AA) pI
CHtt 323 4,75557
7jQkA 309 4,90210
7zT7x 308 4,72977
A0A140G6T8 309 4,90210
A0A515MLD7 308 4,72977
A0A222Z634 308 4,72977
A0A222Z6L7 308 4,72977
A0A222Z6W7 308 4,72977
A0A222Z772 308 4,72977
A0A222Z9E7 308 4,72977
A0A514A3S6 308 4,72977
A0A5P8D7G7 308 4,72977
A0A9X9P5Q4 308 4,72977
A0AA49E4X7 316 4,76870
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13280
3inuw
7 27,1% 210 3.392E-08
2 phalp2_32733
8iF6d
6 25,8% 244 1.874E-07
3 phalp2_33279
5HnbH
5 26,6% 255 3.309E-07

Domains

Domains [InterPro]
PET_M23
Unannotated
Disordered region
Representative sequence (used for alignment): CHtt (323 AA)
Member sequence: A0A222Z8D1 (308 AA)
1 323 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Arthrobacter phage Correa
[NCBI]
2024275 Mudcatvirus > Mudcatvirus correa
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MF189171 [NCBI]
CDS location
range 1506 -> 2432
strand +
CDS
ATGGCGGCTCCAGCTGCAAATATCGAGCTATTCCTGCCTTCGAAAGCTCCTATTACTCAGGGGCTTGGTGCAGAACAAAAACAAGGCGGACTTCATGCTGGTACAGACTTCGCATATATGGCTAATGGTCAAGTCTATCCGGACATCTATGCCGCAGCTGAAGGACGAGTTATTTGGGCCGGAGATTCTCGTAATCTTGGTTGGCCTAATGTCCTTTACCTGAACATTGACTTTGATCGCACAGACAACATTGATAGTTCTGCGGGCAACTACACTATTATCGAACATTATGATGCTAATGGCAATAAGCTTTGTTTGACTGGTTACGGTCATCAAGCAAACATCTTTGTTAAAGTTGGCGATTGGGTTCGAGCAGGACAGATCATCGGTGAAGTTGGCGATACCGGATTTAACTTCGGTAAACATCTTCACTTCGACTTTGTGTTGTATCCGTACGATGTGGACGATTATCCATACTATGGTCGTGTTGATCCGACTCCGTACTTTGTAAACCAATTCAAGATTGAGGAAGACGACATGTACACTCAAACTGATCGTGAACGCGATAATCTCGTAGCGGATCGCATGGGCTATCTCTGGAAGCATTATGGCCCCGGACAAAAGGGCTATAAGGATGATGGCGAATACGCAGCACTCCTTCGAGATACAAAGCGAATTGCTGATGTAGCAGCAGTTAACGCTAGCAATGCACATAAGGTAACTCAAGATGTACTCTTCTCAGTTACTCCTGGAATCTCCGAGCAACGTCCTGCTGGCGCCACTATTCTTTCGATTCTGTCTATTGTGGCAGCAGCTCAGAACAAGACGGTTGAGCAGATTCTCGAGGGAATCCCTGAAGCACAGCTTTCTGGAGCTGAGTTTGTCTTGGTCCCCAAGGATTCTCTTCCTGCTCAGCCAGCACAATAA

Gene Ontology

Description Category Evidence (source)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000b94c020_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (CHtt) rather than this protein.
PDB ID
CHtt
Method AlphaFoldv2
Resolution 78.81
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50