Protein

Protein accession
A0A142IG94 [UniProt]
Representative
83ZZs
Source
UniProt (cluster: phalp2_11784)
Protein name
Morphogenesis protein
Lysin probability
99%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MTYSKNAYLTLEEMTVNAEYILSYLLSRGWTKNAICGMLGNMQSESTINPGIWQNLDEGNTSLGFGLVQWTPATKYLNWADRNGLKRDDITSQLKRILWEVENNEQWINVRNMTFKEFTKSTKSAYDLAMIFIASYERPANPNQPERGTQAEYWFKTLTGKGSSGIQLAQFPLDIINITQGENGSYSHKGTLCIDFVGRHEKYPYYAPCDCTCVWRGDESAYLAWTSDKEVMCADGVIRYITWVCVHEQPLMYNVGKKLKKGELMGHTGIGGNVTGDHVHLNVIEGNKYQGWVKKPDSALAGTELHLYDVFAVNGVEIVNGLGYDWKTSDWVDGSDGNNGDDTEKDETKNIVNLLLCGALNGW
Physico‐chemical
properties
protein length:363 AA
molecular weight:40845,5 Da
isoelectric point:5,19
hydropathy:-0,44
Representative Protein Details
Accession
83ZZs
Protein name
83ZZs
Sequence length
259 AA
Molecular weight
29173,29190 Da
Isoelectric point
8,47806
Sequence
MANYDNTRYTYAQVLAGLGYYMKDDQLRAATEVKIMQTKLNKVNYNCGTPDGKFGNNTDTAVRAFQRAKGLTVDGKAGKNTLKALDTATASGGDSTGILYCSNRYLTLEQMKVNAQYILDYLRDRKWTKNAVCGMLGNMQTESTINPGIWQSLKEDNLSGGFGLVQWTPASKYIDWANTQGLEVANMDSQLQRILYELEYGKQYYATNAYKLSFSAFSQSTESAYYLGCAFLHNYERPKNSFQDTTRGGQATYWYENLT
Other Proteins in cluster: phalp2_11784
Total (incl. this protein): 17 Avg length: 351,2 Avg pI: 5,50

Protein ID Length (AA) pI
83ZZs 259 8,47806
OBT2 287 8,71511
A0A222YYN6 364 5,13020
A0A6M9Z7E5 352 5,20744
P07538 365 5,13020
Q37894 365 5,68028
A0A7T8EP20 365 5,13020
A0A889IM74 365 5,06307
A0A889INF0 365 5,05460
P15132 365 4,92660
A0A976N0B7 365 4,78194
A0A9E7N068 331 4,68157
A0AAE9FJX8 365 5,13679
A0AAE9FLE8 365 5,05460
A0AAE9JU96 365 5,05460
A0AAE9JVZ7 365 5,05460
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1644
8rpOB
718 54,6% 161 4.235E-66
2 phalp2_26077
7r1w8
2 39,8% 168 1.423E-32
3 phalp2_9119
5OLtN
133 36,3% 179 6.055E-29

Domains

Domains [InterPro]
Representative sequence (used for alignment): 83ZZs (259 AA)
Member sequence: A0A142IG94 (363 AA)
1 259 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage vB_BsuP-Goe1
[NCBI]
1807511 Salasmaviridae > Beecentumtrevirus > Beecentumtrevirus Goe1
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KU831549 [NCBI]
CDS location
range 13554 -> 14645
strand +
CDS
ATGACTTATAGCAAAAACGCCTATCTGACACTGGAAGAAATGACAGTCAACGCTGAGTATATATTAAGTTATTTACTGTCGAGAGGTTGGACGAAAAACGCTATTTGCGGGATGCTTGGAAACATGCAATCAGAAAGTACAATCAACCCTGGAATATGGCAGAATCTTGATGAGGGTAACACCTCATTGGGATTCGGTCTTGTCCAGTGGACACCCGCAACAAAATATTTGAATTGGGCGGATCGTAACGGACTTAAAAGAGATGATATAACTAGCCAATTGAAACGAATACTCTGGGAAGTTGAAAACAATGAGCAGTGGATAAATGTCAGGAATATGACATTTAAGGAATTCACAAAAAGCACAAAATCTGCTTATGATCTAGCAATGATTTTCATCGCTTCATATGAACGACCTGCAAATCCTAATCAGCCTGAGAGGGGAACGCAAGCTGAATACTGGTTTAAAACATTAACTGGAAAAGGGTCTAGCGGTATTCAGCTTGCACAATTCCCACTTGACATTATCAATATTACACAAGGTGAAAATGGGAGTTACTCACATAAGGGAACGCTATGTATTGACTTTGTAGGTAGACATGAGAAATACCCATATTATGCTCCTTGTGACTGCACGTGTGTGTGGAGAGGTGACGAAAGCGCCTATCTTGCGTGGACTTCTGATAAGGAAGTTATGTGTGCAGACGGTGTTATTAGATATATTACATGGGTATGTGTTCACGAACAGCCATTAATGTACAATGTTGGAAAGAAACTTAAAAAGGGTGAATTAATGGGTCACACCGGTATCGGTGGAAATGTAACAGGTGATCATGTACATTTAAACGTTATTGAGGGTAATAAATATCAAGGATGGGTAAAGAAACCTGATTCAGCATTAGCAGGAACAGAATTACATCTTTATGATGTGTTTGCTGTAAATGGTGTTGAAATTGTTAATGGTCTGGGATACGACTGGAAAACAAGTGACTGGGTTGACGGCTCAGACGGGAATAATGGTGACGATACAGAGAAAGACGAAACCAAAAATATTGTGAACCTGTTACTATGTGGGGCGCTTAACGGATGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0003824 catalytic activity molecular function None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (83ZZs) rather than this protein.
PDB ID
83ZZs
Method AlphaFoldv2
Resolution 92.86
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50