Protein

Protein accession
Q37894 [UniProt]
Representative
83ZZs
Source
UniProt (cluster: phalp2_11784)
Protein name
Morphogenesis protein 1
Lysin probability
96%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MFYSKNAYLSMKEMTVNAQYILNYLLPRGWTKNAICGMLGNMQTESTINPGIWQNLDEGNTSLGFGLVQWTPATKYLNWADRNGLKRDHMDSQLKRILWEVDNNEQWINLRNMTFKEFTKSTKSANELAMIFLASYERPANPNQPERGTQAEYWFKTLTGKGSTGIQLAQFPMDIINITQGENGSFSHKGTLCIDFVGKHEKYPYYAPCDCTCVWRGDESAYLAWTSDKEVMCADGTVRYITWVCVHDENLLYNVGKKLKKGELMGHSGKGGRATGDHLHLNVIEGNKYQGWVKKPDSALAGTELHIYDVFAVNGVEIVNGLGYDWKTSDWVDGSDENNGDDKDKDKDETKNIVNLLLCGALNGW
Physico‐chemical
properties
protein length:365 AA
molecular weight:41283,1 Da
isoelectric point:5,68
hydropathy:-0,55
Representative Protein Details
Accession
83ZZs
Protein name
83ZZs
Sequence length
259 AA
Molecular weight
29173,29190 Da
Isoelectric point
8,47806
Sequence
MANYDNTRYTYAQVLAGLGYYMKDDQLRAATEVKIMQTKLNKVNYNCGTPDGKFGNNTDTAVRAFQRAKGLTVDGKAGKNTLKALDTATASGGDSTGILYCSNRYLTLEQMKVNAQYILDYLRDRKWTKNAVCGMLGNMQTESTINPGIWQSLKEDNLSGGFGLVQWTPASKYIDWANTQGLEVANMDSQLQRILYELEYGKQYYATNAYKLSFSAFSQSTESAYYLGCAFLHNYERPKNSFQDTTRGGQATYWYENLT
Other Proteins in cluster: phalp2_11784
Total (incl. this protein): 17 Avg length: 351,2 Avg pI: 5,50

Protein ID Length (AA) pI
83ZZs 259 8,47806
OBT2 287 8,71511
A0A222YYN6 364 5,13020
A0A6M9Z7E5 352 5,20744
A0A142IG94 363 5,18897
P07538 365 5,13020
A0A7T8EP20 365 5,13020
A0A889IM74 365 5,06307
A0A889INF0 365 5,05460
P15132 365 4,92660
A0A976N0B7 365 4,78194
A0A9E7N068 331 4,68157
A0AAE9FJX8 365 5,13679
A0AAE9FLE8 365 5,05460
A0AAE9JU96 365 5,05460
A0AAE9JVZ7 365 5,05460
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1644
8rpOB
718 54,6% 161 4.235E-66
2 phalp2_26077
7r1w8
2 39,8% 168 1.423E-32
3 phalp2_9119
5OLtN
133 36,3% 179 6.055E-29

Domains

Domains [InterPro]
Representative sequence (used for alignment): 83ZZs (259 AA)
Member sequence: Q37894 (365 AA)
1 259 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage B103 (Bacteriophage B103)
[NCBI]
2994042 Salasmaviridae > Beecentumtrevirus > Beecentumtrevirus B103
Host Bacillus subtilis
[NCBI]
1423 Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus subtilis group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
X99260 [NCBI]
CDS location
range 13639 -> 14736
strand +
CDS
ATGTTTTACAGTAAAAATGCCTATCTATCAATGAAAGAAATGACAGTCAATGCACAGTATATATTAAATTATTTACTACCGAGAGGTTGGACGAAAAACGCTATTTGCGGAATGCTTGGTAACATGCAAACGGAAAGCACAATCAACCCTGGAATCTGGCAGAATCTTGATGAGGGTAACACCTCATTGGGATTCGGTCTTGTTCAGTGGACACCTGCAACAAAATATTTAAATTGGGCAGATAGAAACGGTCTTAAACGAGATCATATGGATAGCCAGTTGAAGCGGATACTCTGGGAAGTTGACAACAATGAGCAATGGATAAACCTCAGGAATATGACGTTTAAGGAATTTACAAAAAGCACGAAGTCTGCAAACGAGTTAGCCATGATTTTCCTAGCCTCATATGAGCGTCCTGCAAATCCAAATCAGCCAGAGCGGGGGACGCAAGCTGAGTATTGGTTTAAGACGTTAACTGGAAAAGGTTCTACTGGCATTCAACTCGCTCAATTCCCAATGGACATTATTAATATCACACAGGGTGAAAATGGAAGTTTCTCACATAAAGGAACTCTGTGCATTGACTTTGTGGGTAAACATGAAAAATACCCGTATTATGCTCCATGCGACTGCACGTGTGTCTGGAGAGGTGACGAAAGTGCCTACCTTGCGTGGACTTCTGATAAAGAAGTGATGTGTGCAGACGGCACTGTTAGATATATTACGTGGGTGTGTGTTCATGACGAAAACTTATTGTACAATGTTGGAAAGAAGCTTAAAAAGGGCGAATTAATGGGACACAGTGGTAAAGGCGGACGTGCCACAGGTGATCATCTACATTTAAACGTTATTGAAGGTAATAAATATCAAGGGTGGGTTAAGAAGCCCGATTCGGCATTGGCGGGAACAGAGCTACACATTTATGATGTGTTTGCTGTGAATGGTGTTGAAATTGTAAACGGTCTGGGTTATGACTGGAAAACAAGTGACTGGGTTGACGGCTCAGATGAGAATAATGGTGACGATAAAGATAAAGATAAAGACGAAACCAAAAATATTGTGAACCTGTTACTATGTGGAGCACTTAACGGTTGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008237 metallopeptidase activity molecular function None (UniProt)
GO:0016798 hydrolase activity, acting on glycosyl bonds molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044423 virion component cellular component None (UniProt)
GO:0046872 metal ion binding molecular function None (UniProt)
GO:0071555 cell wall organization biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.- None None N/A UniProt
3.4.-.- None None N/A UniProt

Tertiary structure

PDB ID
Q37894
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00001384d4_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (83ZZs) rather than this protein.
PDB ID
83ZZs
Method AlphaFoldv2
Resolution 92.86
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50