Protein

Protein accession
ZEpI [EnVhog]
Representative
1lmMl
Source
EnVhog (cluster: phalp2_38789)
Protein name
ZEpI
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VLQYPGAIQAPGPADRQGYGAYPSPQNKEGFVLHSMEGSYSSAYGEMFNPTRQASWHFSVNKDGRVYQHYDPRAVAWHCGGPGDSNPLSAAIGNVALIGIEHEGRVGEPLTTAQFNASVAVQRWCYSVLPTIKQPALRSSHWEHGWISGTSCPSGRIPWTMVLAATSTTPVIPEEDNYMRVFKTIDAGRTSNYYSVGAGYKRHINDVMEVGMMLRAAKQTAAENCYQAEMDVIEDLESVIKRSAPAPTLTATQLTQIADAVSSKVGDSIAVKVANQLAARLAA
Physico‐chemical
properties
protein length:283 AA
molecular weight:30619,1 Da
isoelectric point:6,47
hydropathy:-0,28
Representative Protein Details
Accession
1lmMl
Protein name
1lmMl
Sequence length
216 AA
Molecular weight
24265,15270 Da
Isoelectric point
7,12956
Sequence
MELWYPGARKWLGPSNKKRYGTNGVKGIVNHSAVGYSAGLHHELVRADRGAAWHFSVMQDGTVEQHYPLNSVLWHAADAFGNMNYIGIEHEGGFNPANEPLTVKQREASVKLCQWIAKQAGFSLSRVDKKTLWEHNEIADKATQCPSGRIPWQYYVDEVTTKVPRLPTIEEFGDVVASIYYYSHLNTPMKATLIPISIEGRVTRYELVITGGDFST
Other Proteins in cluster: phalp2_38789
Total (incl. this protein): 2 Avg length: 249,5 Avg pI: 6,80

Protein ID Length (AA) pI
1lmMl 216 7,12956
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3554
4DvvK
9 43,8% 162 2.936E-43
2 phalp2_13494
4PpCO
19 43,1% 167 3.562E-39
3 phalp2_34349
3YRKh
5 40,0% 135 5.013E-31
4 phalp2_12419
gjha
11 33,7% 157 2.626E-21
5 phalp2_33973
40pAF
5 32,3% 173 7.851E-20
6 phalp2_7742
6WdTo
48 33,3% 132 9.246E-19
7 phalp2_14594
5ioYm
94 27,1% 188 2.724E-17
8 phalp2_4479
3gnNS
19 32,0% 181 2.334E-16
9 phalp2_8458
15NMW
2 28,5% 147 1.465E-15
10 phalp2_9423
jh6D
5 29,3% 177 6.757E-15

Domains

Domains
Representative sequence (used for alignment): 1lmMl (216 AA)
Member sequence: ZEpI (283 AA)
1 216 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
ZEpI
Method AlphaFoldv2
Resolution 74.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1lmMl) rather than this protein.
PDB ID
1lmMl
Method AlphaFoldv2
Resolution 87.46
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50