Protein

Protein accession
5zkiz [EnVhog]
Representative
1gp6M
Source
EnVhog (cluster: phalp2_1376)
Protein name
5zkiz
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MENEKVMKYKYRTQRDNKHQPYATCNVTSLAMGLSCVGIEVDEDRLYEQANAPEIKVWALKNVGAWTADYARANHLNQVWAVLEKLADDAIGHKDGATFKDNWLTMVQLIDHLAAGHPVLVGGLFTHGGHIICLVGYNGRGFICADPWGDWSEGYKNKNGENVLYFYDRIREVMAGKDGHFRALVLKKV
Physico‐chemical
properties
protein length:189 AA
molecular weight:21312,0 Da
isoelectric point:6,70
hydropathy:-0,34
Representative Protein Details
Accession
1gp6M
Protein name
1gp6M
Sequence length
188 AA
Molecular weight
20942,54010 Da
Isoelectric point
5,56405
Sequence
MTNEKILPHTYHSQRDNADDPSTPQVEAYVTCNVTSLSMALSCLGIEIPPMELFKRANSPEYVAYAQQIGVAGFIKDKKLAQVWAILEKLANEHCHAQFKTDWLTLQAITDQIDAGNPIVVGGLFTHGGHIICIVGYNSLGFICTDPWGDWAHGYVNRNGDNVLYEYDKIRQVLSGNGSEFWSLVLKK
Other Proteins in cluster: phalp2_1376
Total (incl. this protein): 3 Avg length: 200,3 Avg pI: 6,15

Protein ID Length (AA) pI
1gp6M 188 5,56405
7Kyfn 224 6,18962
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5390
2yClh
4 36,6% 180 1.053E-26
2 phalp2_16156
4uGeP
55 32,9% 179 9.700E-21
3 phalp2_34142
2jEnG
37 27,4% 211 1.595E-19
4 phalp2_84
4WuCJ
39 30,3% 188 4.053E-19
5 phalp2_34437
4wMwB
66 30,8% 204 1.470E-16
6 phalp2_18460
6EWgS
1 26,5% 196 5.204E-14
7 phalp2_29834
276so
38 32,0% 184 9.663E-12
8 phalp2_10798
3VYNI
16 26,0% 184 3.288E-11
9 phalp2_18755
ZDbB
27 26,9% 182 4.464E-11
10 phalp2_315
6Ri0c
3 25,3% 205 8.227E-11

Domains

Domains
Representative sequence (used for alignment): 1gp6M (188 AA)
Member sequence: 5zkiz (189 AA)
1 188 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13529

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gp6M) rather than this protein.
PDB ID
1gp6M
Method AlphaFoldv2
Resolution 90.45
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50