Protein

Protein accession
4LI8Z [EnVhog]
Representative
8mbtZ
Source
EnVhog (cluster: phalp2_40144)
Protein name
4LI8Z
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
LSIKDYWWSKAIEASQMIGWFPTVILAQWQLETGNFTSSNLQRNNNIAGQTWYKGLPETMKGTARPPNEGGFYIRYDDPVDGYVEFIQQNGRYRGVKDMPDENAQIDAIAAAGWAIDPNYADKLKAILAANTAAGYVLEVEPVLDKGVATTIINTWIKPEWETEFKNGNTKQCDYLHFLAQSLRNAAGITEAE
Physico‐chemical
properties
protein length:193 AA
molecular weight:21642,0 Da
isoelectric point:4,81
hydropathy:-0,43
Representative Protein Details
Accession
8mbtZ
Protein name
8mbtZ
Sequence length
189 AA
Molecular weight
20054,06460 Da
Isoelectric point
8,77687
Sequence
MQLSSTVGQTEMSMRDYASYASQKTGWDAGLIYAQWVLETGNFTSSVFRKDNNLAGIKWVSSRNNPGATGPGSEANDGGSYAHYPNLAAGVQGYINFISANPRYANVKTGKTTAEQAQFLKNDGWATDPDYVSKVVSIAGGNNNVNIAQLGDTSNNTGIATSSIGQKASIIVKSPLFWVVLLAGLVFKA
Other Proteins in cluster: phalp2_40144
Total (incl. this protein): 2 Avg length: 191,0 Avg pI: 6,79

Protein ID Length (AA) pI
8mbtZ 189 8,77687
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5987
6F4h5
1 38,9% 136 1.135E-35
2 phalp2_8561
1Ibzf
44 36,4% 148 1.347E-13
3 phalp2_5346
2cQBB
648 35,1% 128 6.265E-13
4 phalp2_20226
8kJAz
18 35,0% 134 5.368E-12
5 phalp2_32846
31z3H
398 31,5% 130 9.652E-10
6 phalp2_32724
8pNqF
4 34,9% 126 2.403E-09
7 phalp2_30397
4MmCW
1 28,1% 135 8.089E-09
8 phalp2_24545
7HQNr
274 31,5% 133 1.095E-08
9 phalp2_23271
4Y2T6
95 27,4% 131 1.483E-08
10 phalp2_35649
1pmVz
1 30,2% 129 9.094E-08

Domains

Domains
GLUCO
Disordered region
Representative sequence (used for alignment): 8mbtZ (189 AA)
Member sequence: 4LI8Z (193 AA)
1 189 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4LI8Z
Method AlphaFoldv2
Resolution 92.83
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (8mbtZ) rather than this protein.
PDB ID
8mbtZ
Method AlphaFoldv2
Resolution 74.78
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50