Protein

Protein accession
1lEg0 [EnVhog]
Representative
5HOT
Source
EnVhog (cluster: phalp2_39688)
Protein name
1lEg0
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKTNHKGIEDFIYSHRKKQEIMIQKKKTRRKTMLFFLFLPIFSIFIFSSAVFPPSNKNVSRSPALTISLQDAYYLTLVPEEYRDYVYENCVKRNIPITYYYRMIWNESRWRWWAVGKNYYKGKIVSYDRGLGQINSKYEKDLALLFFIPSYEGEVFDVFNWQHNLQVSMNYIEDLYETFGNWPEAFMAYNCGYNAVKNNQIPERTYEYVKKIIY
Physico‐chemical
properties
protein length:214 AA
molecular weight:25925,5 Da
isoelectric point:9,24
hydropathy:-0,46
Representative Protein Details
Accession
5HOT
Protein name
5HOT
Sequence length
179 AA
Molecular weight
20572,47170 Da
Isoelectric point
9,08445
Sequence
MMKKKIGTSLILFFIMVSTEASSIDFKSNTFIYEMPQRYKRVYAPFIYQDFVLEVADVVGVDPDLLCAVIKVESNWVTFAIGNNGNSVDRGLGQINSKYESWYTEKFGIEDYRWDDGKKNILLTAKILKWSGVGFSCITHGVAAYNCGRSRVVKNTIPESTKKYVEKVMAYYNYYKNHK
Other Proteins in cluster: phalp2_39688
Total (incl. this protein): 2 Avg length: 196,5 Avg pI: 9,16

Protein ID Length (AA) pI
5HOT 179 9,08445
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20578
4B877
3 27,6% 130 1.599E-14
2 phalp2_25316
8ft2I
8 35,4% 124 5.512E-14
3 phalp2_28314
15Ckr
15 30,1% 126 7.509E-14
4 phalp2_9430
lDQe
5 25,7% 132 1.897E-13
5 phalp2_4655
4Aahq
2 32,5% 126 1.208E-12
6 phalp2_25511
3gJbi
26 29,6% 152 1.644E-12
7 phalp2_16340
5sNZI
2 29,9% 127 7.669E-12
8 phalp2_24875
7cHt
1 27,8% 183 1.929E-11
9 phalp2_29294
tD4R
30 31,0% 145 2.597E-09
10 phalp2_1437
1BM9o
11 28,0% 114 2.193E-08

Domains

Domains
Disordered region
SLT
Representative sequence (used for alignment): 5HOT (179 AA)
Member sequence: 1lEg0 (214 AA)
1 179 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5HOT) rather than this protein.
PDB ID
5HOT
Method AlphaFoldv2
Resolution 84.98
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50