Protein

Protein accession
5yszY [EnVhog]
Representative
4kzMA
Source
EnVhog (cluster: phalp2_13392)
Protein name
5yszY
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSKFLKRPFVAFAPFFVLGLFVLVACATPQESPANVVEYATTTTPTTVISTTTTSSTTTSTTSTIVIPVPTTMAAAPEPIVEEPVAPPADLVGIEKMICDTFAGDCDKALSVVYCESRFNPSTVGSAGERGLFQIHPVHIPYLQERGLTWDAMFDPQANIAYAYDLYARAGWGPWTCA
Physico‐chemical
properties
protein length:178 AA
molecular weight:19134,7 Da
isoelectric point:4,63
hydropathy:0,23
Representative Protein Details
Accession
4kzMA
Protein name
4kzMA
Sequence length
138 AA
Molecular weight
15601,97550 Da
Isoelectric point
9,39080
Sequence
MTIFILLIAVSLFIGSKVAPYAPRLYETGTYTVAPKLTPAPVPVNKIIQAIAVEFEPEGKKVVLDAIRISFCESGWRPEALNTNKSGSTDHSIFQVNSFWKKVFGDGFTSDWRENIRIAHKIWLRNRSFGPWVCMDKL
Other Proteins in cluster: phalp2_13392
Total (incl. this protein): 9 Avg length: 163,8 Avg pI: 7,45

Protein ID Length (AA) pI
4kzMA 138 9,39080
2PS0E 153 5,69489
2Q0Df 186 9,44676
4HjZQ 167 8,68010
5F4ZX 166 9,59800
5cbQH 166 4,76944
5ff7o 140 7,79521
8dLOH 180 7,06448
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31657
4GV6b
7 34,3% 134 8.181E-28
2 phalp2_16
4MrYc
22 39,2% 107 1.301E-22
3 phalp2_39375
4Fqro
3 26,8% 119 1.642E-18
4 phalp2_18772
164cC
5 27,8% 122 3.092E-15
5 phalp2_4553
3ZnDV
172 29,1% 96 5.713E-12
6 phalp2_23120
4Du7v
14 20,2% 138 1.068E-11
7 phalp2_16990
7ZmEr
12 27,9% 136 1.068E-11
8 phalp2_14343
4cr5d
12 24,2% 136 6.966E-11
9 phalp2_14996
ECkt
4 30,6% 88 1.778E-10
10 phalp2_34794
3yTQQ
1 26,6% 124 4.019E-09

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 4kzMA (138 AA)
Member sequence: 5yszY (178 AA)
1 138 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4kzMA) rather than this protein.
PDB ID
4kzMA
Method AlphaFoldv2
Resolution 84.08
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50