Protein

Protein accession
P15132 [UniProt]
Representative
83ZZs
Source
UniProt (cluster: phalp2_11784)
Protein name
Morphogenesis protein 1
Lysin probability
99%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MVYVSNKYLTMSEMKVNAQYILNYLSSNGWTKQAICGMLGNMQSESTINPGLWQNLDEGNTSLGFGLVQWTPASNYINWANSQGLPYKDMDSELKRIIWEVNNNAQWINLRDMTFKEYIKSTKTPRELAMIFLASYERPANPNQPERGDQAEYWYKNLSGGGGGGLQLAQFPMDIINISQGENGSFSHKGTLCIDFVGKTEKYPYYAPCDCTCVWRGDASAYLAWTSDKEVMCADGSVRYITWVNVHESPLPFDVGKKLKKGDLMGHTGIGGNVTGDHWHFNVIDGKEYQGWTKKPDSCLAGTELHIYDVFAVNNVEIINGNGYDWKTSDWQDGDGGDGDDDNDNNKTKDLITLLLSDALHGWKA
Physico‐chemical
properties
protein length:365 AA
molecular weight:40924,3 Da
isoelectric point:4,93
hydropathy:-0,54
Representative Protein Details
Accession
83ZZs
Protein name
83ZZs
Sequence length
259 AA
Molecular weight
29173,29190 Da
Isoelectric point
8,47806
Sequence
MANYDNTRYTYAQVLAGLGYYMKDDQLRAATEVKIMQTKLNKVNYNCGTPDGKFGNNTDTAVRAFQRAKGLTVDGKAGKNTLKALDTATASGGDSTGILYCSNRYLTLEQMKVNAQYILDYLRDRKWTKNAVCGMLGNMQTESTINPGIWQSLKEDNLSGGFGLVQWTPASKYIDWANTQGLEVANMDSQLQRILYELEYGKQYYATNAYKLSFSAFSQSTESAYYLGCAFLHNYERPKNSFQDTTRGGQATYWYENLT
Other Proteins in cluster: phalp2_11784
Total (incl. this protein): 17 Avg length: 351,2 Avg pI: 5,50

Protein ID Length (AA) pI
83ZZs 259 8,47806
OBT2 287 8,71511
A0A222YYN6 364 5,13020
A0A6M9Z7E5 352 5,20744
A0A142IG94 363 5,18897
P07538 365 5,13020
Q37894 365 5,68028
A0A7T8EP20 365 5,13020
A0A889IM74 365 5,06307
A0A889INF0 365 5,05460
A0A976N0B7 365 4,78194
A0A9E7N068 331 4,68157
A0AAE9FJX8 365 5,13679
A0AAE9FLE8 365 5,05460
A0AAE9JU96 365 5,05460
A0AAE9JVZ7 365 5,05460
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1644
8rpOB
718 54,6% 161 4.235E-66
2 phalp2_26077
7r1w8
2 39,8% 168 1.423E-32
3 phalp2_9119
5OLtN
133 36,3% 179 6.055E-29

Domains

Domains [InterPro]
Representative sequence (used for alignment): 83ZZs (259 AA)
Member sequence: P15132 (365 AA)
1 259 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage phi29 (Bacteriophage phi-29)
[NCBI]
2884424 Salasmaviridae > Salasvirus > Salasvirus phi29
Host Bacillus subtilis
[NCBI]
1423 Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus subtilis group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
M14782 [NCBI]
CDS location
range 8804 -> 9901
strand +
CDS
ATGGTGTATGTATCTAATAAATATCTGACAATGTCTGAGATGAAGGTCAACGCTCAGTACATTCTAAACTACCTCAGTAGTAACGGTTGGACAAAGCAAGCCATTTGCGGGATGCTCGGTAATATGCAAAGTGAGAGCACCATAAATCCGGGATTATGGCAGAATCTTGACGAGGGTAACACCTCGTTAGGATTTGGGCTTGTGCAATGGACACCCGCTAGTAATTATATCAATTGGGCTAACAGTCAAGGCTTACCATATAAAGACATGGATAGTGAATTAAAGCGAATTATATGGGAAGTAAACAATAATGCTCAATGGATAAACTTACGTGACATGACCTTTAAAGAGTACATTAAGAGCACGAAGACGCCTAGAGAACTAGCGATGATATTTCTAGCATCTTATGAGCGACCTGCTAACCCCAACCAACCCGAACGGGGCGACCAAGCCGAATACTGGTATAAAAACCTCAGCGGCGGCGGAGGCGGTGGACTACAATTAGCACAATTCCCAATGGATATTATCAATATATCGCAAGGGGAAAACGGTAGTTTCTCGCATAAGGGAACACTTTGTATAGACTTTGTGGGCAAAACTGAAAAGTACCCGTATTATGCTCCGTGTGATTGCACATGTGTATGGCGGGGTGATGCGAGTGCATATCTAGCGTGGACTTCTGATAAAGAGGTTATGTGTGCGGACGGAAGTGTTCGCTATATTACATGGGTAAACGTTCATGAAAGTCCTTTGCCCTTTGATGTTGGTAAGAAGCTCAAAAAAGGCGATCTTATGGGACACACAGGTATCGGAGGAAACGTAACAGGCGACCATTGGCACTTCAATGTTATTGACGGTAAGGAGTACCAAGGATGGACAAAGAAACCTGATTCGTGTTTAGCAGGGACAGAGTTACACATATATGATGTTTTCGCTGTCAACAACGTGGAGATAATCAACGGAAACGGCTACGACTGGAAAACTAGTGATTGGCAAGACGGAGACGGTGGAGATGGCGACGACGACAACGATAACAATAAAACAAAAGATTTAATAACCCTTTTACTATCTGACGCCCTCCATGGTTGGAAAGCATAG

CDS Source ID
CDS Source
X04962 [NCBI]
CDS location
range 1 -> 385
strand +
CDS
TCGCTATATTACATGGGTAAACGTTCATGAAAGTCCTTTGCCCTTTGATGTTGGTAAGAAGCTCAAAAAAGGCGATCTGATGGGACACACAGGTATCGGAGGAAACGTAACAGGCGACCATTGGCACTTCAATGTTATTGACGGTAAGGAGTACCAAGGATGGACAAAGAAACCTGATTCGTGTTTAGCAGGGACAGAGTTACACATATATGATGTTTTCGCTGTCAACAACGTGGAGATAATCAACGGAAACGGCTACGACTGGAAAACTAGTGATTGGCAAGACGGAGACGGTGGAGATGGCGACGACGACAACGATAACAATAAAACAAAAGATTTAATAACCCTTTTACTATCTGACGCCCTCCATGGTTGGAAAGCATAG

CDS Source ID
CDS Source
EU771092 [NCBI]
CDS location
range 13947 -> 15041
strand +
CDS
ATGGTGTATGTATCTAATAAATATCTGACAATGTCTGAGATGAAGGTCAACGCTCAGTACATTCTAAACTACCTCAGTAGTAACGGTTGGACAAAGCAAGCCATTTGCGGGATGCTCGGTAATATGCAAAGTGAGAGCACCATAAATCCGGGATTATGGCAGAATCTTGACGAGGGTAACACCTCGTTAGGATTTGGGCTTGTGCAATGGACACCCGCTAGTAATTATATCAATTGGGCTAACAGTCAAGGACTACCATATAAGAACATGGATAGTGAATTAAAGCGAATTATATGGGAAGTAAACAATAATGCTCAATGGATAAACTTACGTGACATGACCTTTAAAGAGTACATTAAGAGCACGAAGACGCCTAGAGAACTAGCGATGATATTTCTAGCATCTTATGAGCGACCTGCTAACCCCAACCAACCCGAACGGGGCGACCAAGCCGAATACTGGTATAAAAACCTCAGCGGCGGCGGAGGCGGTGGACTACAATTAGCACAATTCCCAATGGATATTATCAATATATCGCAAGGGGAAAACGGTAGTTTCTCGCATAAGGGAACACTTTGTATAGACTTTGTGGGCAAAACTGAAAAGTACCCGTATTATGCTCCGTGTGATTGCACATGTGTATGGCGGGGTGATGCGAGTGCATATCTAGCGTGGACTTCTGATAAAGAGGTTATGTGTGCGGACGGAAGTGTTCGCTATATTACATGGGTAAACGTTCATGAAAGTCCTTTGCCCTTTGATGTTGGTAAGAAGCTCAAAAAAGGCGATCTGATGGGACACACAGGTATCGGAGGAAACGTAACAGGCGACCATTGGCACTTCAATGTTATTGACGGTAAGGAGTACCAAGGATGGACAAAGAAACCTGATTCGTGTTTAGCAGGGACAGAGTTACACATATATGATGTTTTCGCTGTCAACAACGTGGAGATAATCAACGGAAACGGCTACGACTGGAAAACTAGTGATTGGCAAGACGGAGACGGTGGAGATGGCGACGACAACGATAACAATAAAACAAAAGATTTAATAACCCTTTTACTATCTGACGCCCTCCATGGTTGGAAAGCATAG

Gene Ontology

Description Category Evidence (source)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008237 metallopeptidase activity molecular function None (UniProt)
GO:0016798 hydrolase activity, acting on glycosyl bonds molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0046872 metal ion binding molecular function None (UniProt)
GO:0071555 cell wall organization biological process None (UniProt)
GO:0098004 virus tail fiber assembly biological process None (UniProt)
GO:0098023 virus tail, tip cellular component None (UniProt)
GO:0098932 symbiont entry into host cell via disruption of host cell wall peptidoglycan biological process None (UniProt)
GO:0098994 symbiont entry into host cell via disruption of host cell envelope biological process None (UniProt)
GO:0099002 symbiont genome ejection through host cell envelope, short tail mechanism biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.- None None N/A UniProt
3.4.-.- None None N/A UniProt

Tertiary structure

PDB ID
3CSQ
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
3CSR
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
3CSZ
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
3CT0
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
3CT1
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

Showing first 5 of 8 structures.


The structures below correspond to the cluster representative (83ZZs) rather than this protein.
PDB ID
83ZZs
Method AlphaFoldv2
Resolution 92.86
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50