Protein

Protein accession
5EAHl [EnVhog]
Representative
4QQBX
Source
EnVhog (cluster: phalp2_23194)
Protein name
5EAHl
Lysin probability
89%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIRTLATIALAIATANPRLPAAATRSHAAVLRAHVSGTGLDPLDVVAIIWEESGWRSGAISPDGEDVGLGQIRVRHLPGCRRDPDPVRNPGPDCRAARASLLDPANNMRLTVQALTRWRSICRKKVGRTDLASFLAGYGGLNSPAHNHWCGWKKSRGKWVRQPHPGVRRALEYRRRLRA
Physico‐chemical
properties
protein length:179 AA
molecular weight:19682,4 Da
isoelectric point:11,43
hydropathy:-0,40
Representative Protein Details
Accession
4QQBX
Protein name
4QQBX
Sequence length
183 AA
Molecular weight
20756,17800 Da
Isoelectric point
11,42562
Sequence
MPAAQLASYARAVVKHADRVSYDPLLAVAIAHHESGWRPSAVSRDGEDIGLGQIRARFVGACRKDPLPVKAPGKACRAVRARLKVGAYNIKLIFAYLKAWRKLCRAKTGSGKVRRVLMGYGGLSRPRHNQWCGARKKAGKWTDQKAPRAVRRIYNRWHDLRKGWAKCRQRCTAYKLARRRLTR
Other Proteins in cluster: phalp2_23194
Total (incl. this protein): 13 Avg length: 200,2 Avg pI: 11,05

Protein ID Length (AA) pI
4QQBX 183 11,42562
2jyLB 213 11,38513
35kKb 184 10,99613
37XGu 204 10,22264
3j7Vy 214 10,60094
4GE9s 206 9,86310
4JfFu 195 11,85150
5dHbs 205 11,62650
7GOJR 231 11,50969
7oYK4 206 11,99997
e8dF 192 10,42185
jjzf 190 10,35899
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19496
3eE8N
3 30,6% 160 3.882E-31
2 phalp2_27571
5BdJX
4 30,0% 123 8.402E-14
3 phalp2_10704
36twR
16 28,7% 139 2.120E-13
4 phalp2_5017
1iiXN
3 21,6% 134 1.818E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4QQBX (183 AA)
Member sequence: 5EAHl (179 AA)
1 183 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4QQBX) rather than this protein.
PDB ID
4QQBX
Method AlphaFoldv2
Resolution 94.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50