Protein

Protein accession
20eQ9 [EnVhog]
Representative
U0z9
Source
EnVhog (cluster: phalp2_26276)
Protein name
20eQ9
Lysin probability
99%
PhaLP type
endolysin
Probability: 78% (predicted by ML model)
Protein sequence
MAFRPILVPLAVLMWAATAIIHGAHATEHPAPTAQLHHHHHDRSCHAKAHRACKGKKHVKHAIGHHGHWVITDEDDLPDGPMSNVLDVAAQYQGLDEHKDGKQLTQLFDTELDMSINPRKTAWCAAFANAVLVQAGYDYSGGLESASFVRYGKPVKEPARGDIVVLHGGRRSPTHVGFLVGTARVNGQLFYTVLGGNQSNRVQISYFPASKVIAIRRVG
Physico‐chemical
properties
protein length:219 AA
molecular weight:23931,0 Da
isoelectric point:9,17
hydropathy:-0,25
Representative Protein Details
Accession
U0z9
Protein name
U0z9
Sequence length
222 AA
Molecular weight
23980,22410 Da
Isoelectric point
10,01763
Sequence
MAYRSVSVPLAVLIWAATAIIQVAHATEHHGAKIAPLHHHHHHARSCAAKSRKACAKHHHTRHAVGHHGNWLISSESDLPEGPMAVVITAASAYEGLNQNRDAAALSKLFNEQLDLRINPQHTAWCAAFANAILVQTGHAFSGSIETMSFMRYGQPVKQPAQGDIVVLKGVSRRSLTHVGFLVGSAMVNGQLYYKVLGGNQSNSVRVSMFAASKVIAIRRAT
Other Proteins in cluster: phalp2_26276
Total (incl. this protein): 7 Avg length: 218,3 Avg pI: 9,38

Protein ID Length (AA) pI
U0z9 222 10,01763
2bEa1 218 9,29965
2bFef 219 9,40209
31yun 212 8,69435
47G4X 219 9,32279
mV01 219 9,77452
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36789
6KVwf
51 39,4% 137 4.260E-28
2 phalp2_37918
4UqBP
157 33,3% 147 2.343E-17
3 phalp2_39271
47VoK
404 28,1% 149 1.266E-06

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): U0z9 (222 AA)
Member sequence: 20eQ9 (219 AA)
1 222 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
20eQ9
Method AlphaFoldv2
Resolution 78.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (U0z9) rather than this protein.
PDB ID
U0z9
Method AlphaFoldv2
Resolution 79.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50