Protein

Protein accession
M1I7M3 [UniProt]
Representative
50ZYP
Source
UniProt (cluster: phalp2_28844)
Protein name
Peptidase M15
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MQLSKHFKLEEFTKSMTATRKGIDNTPGAGDIKNLENVCYEILEPVRAKFEKPITVTSGYRSEALCEAIGSKKTSQHAKGQAVDFEIAGVPNIQVAYWLQNNVDFDQLILEFYNPDDPAAGWVHVSYNESGSNRKQVLTYDGKKYDNGLPDMKWKDGKVEG
Physico‐chemical
properties
protein length:161 AA
molecular weight:18065,1 Da
isoelectric point:5,65
hydropathy:-0,64
Representative Protein Details
Accession
50ZYP
Protein name
50ZYP
Sequence length
246 AA
Molecular weight
27599,87910 Da
Isoelectric point
6,52525
Sequence
MPHQRQIALARAAGLMTMIFHFENTGKTIGQVAYEDDLQTQPLYHDGSKRPAWDELSCHAKWSWHNNPTARSTENHVSIHHNMLKNNGGTMTDLNQKLSEHFTLRELIKSPEAARLGIDNTPTPEVIENLTIICKRILEPVRQKFGVPFTPNSGYRSPALNAAIGGAKGSQHMTGEAVDIEIPGVSNYDLAWWIGKQLLFDQVILEHYEPGDPHSGWVHVSYSRTGENRTHCLTFDGKIYKLGLIA
Other Proteins in cluster: phalp2_28844
Total (incl. this protein): 4 Avg length: 180,5 Avg pI: 6,73

Protein ID Length (AA) pI
50ZYP 246 6,52525
A0A6J5LBE7 154 9,04383
A0A9E7ICG6 161 5,71388
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2615
6LPFo
3 45,8% 155 9.257E-46
2 phalp2_39724
8AJC5
1 50,0% 152 8.269E-45
3 phalp2_38153
6WaM5
59 48,4% 161 3.141E-42
4 phalp2_30101
38zFk
347 48,1% 158 1.965E-38
5 phalp2_5035
1nwkL
67 41,8% 177 6.062E-32
6 phalp2_850
86GPz
3 34,2% 216 8.687E-30
7 phalp2_855
876CY
2 33,8% 210 8.177E-23
8 phalp2_29306
zsqs
20 33,6% 193 1.108E-20

Domains

Domains [InterPro]
Disordered region
Unannotated
PET_M15
Representative sequence (used for alignment): 50ZYP (246 AA)
Member sequence: M1I7M3 (161 AA)
1 246 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08291

Taxonomy

  Name Taxonomy ID Lineage
Phage Pelagibacter phage HTVC010P
[NCBI]
1283077 No lineage information
Host Candidatus Pelagibacter ubique HTCC1062
[NCBI]
335992 Proteobacteria > Alphaproteobacteria > Pelagibacterales > Pelagibacteraceae > Candidatus Pelagibacter >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KC465898 [NCBI]
CDS location
range 24036 -> 24521
strand +
CDS
ATGCAACTATCTAAACATTTTAAACTAGAAGAATTTACTAAATCAATGACAGCAACTCGTAAGGGTATTGATAATACACCTGGAGCTGGTGATATTAAAAACCTTGAGAATGTCTGTTATGAAATACTAGAACCAGTAAGAGCCAAGTTTGAGAAACCTATAACTGTTACATCAGGATATAGATCAGAAGCATTATGTGAAGCTATTGGTTCAAAGAAAACTTCACAACACGCAAAAGGTCAGGCGGTAGACTTTGAGATTGCAGGTGTACCTAACATTCAAGTAGCTTACTGGCTACAAAACAATGTAGACTTTGATCAATTAATATTAGAGTTTTATAATCCAGATGATCCTGCTGCTGGTTGGGTTCATGTATCTTACAACGAATCAGGATCAAATAGAAAACAAGTCTTAACTTATGATGGTAAGAAGTATGACAATGGTCTGCCAGATATGAAGTGGAAAGATGGAAAGGTAGAAGGATAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0002b29fac_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (50ZYP) rather than this protein.
PDB ID
50ZYP
Method AlphaFoldv2
Resolution 83.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50