Protein

Protein accession
1079F [EnVhog]
Representative
5EIlb
Source
EnVhog (cluster: phalp2_13652)
Protein name
1079F
Lysin probability
99%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MFLLYLHLLLFNINPIHIGQTYYVLKQLQPDLKVCQAIELATSIESLCKKYNLPEDEFLSILFQESSLKTDPKSCLKRPQNCTHDFGIGQVNYIIWGKKINLEPIKALKNISYSVEISAKIINHYKKKYSQKDKDWIYRYHSKTPSLKKAYKERIEAIHAKIKKYKRGYLDGRRTKNKVIGTHFNYCQKQEEISSRLP
Physico‐chemical
properties
protein length:198 AA
molecular weight:23344,0 Da
isoelectric point:9,60
hydropathy:-0,55
Representative Protein Details
Accession
5EIlb
Protein name
5EIlb
Sequence length
198 AA
Molecular weight
22471,35920 Da
Isoelectric point
6,95825
Sequence
MTSTLLALCLVGQFEIKDVEQANKVINALQPELTDEQAMDLSWAVVVNAPKAGIPWTKFLAVLFQESSLWLDPKGCMEGKKCHDYGVGQVNWRTWGEELNLDRYKLLTDYAYSVKVSAEVFAHYHAKYSKKDPKNWWGFYHSKTPSLKRAYQARVRGVHAKIVGELEVEPNESRVFDISESDRCVFGPGEKTSLRSAP
Other Proteins in cluster: phalp2_13652
Total (incl. this protein): 4 Avg length: 204,5 Avg pI: 8,79

Protein ID Length (AA) pI
5EIlb 198 6,95825
3WXtp 206 8,83605
6NtEU 216 9,74938
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9786
8askd
4 33,0% 121 1.259E-22
2 phalp2_28385
1pszz
2 31,0% 129 1.395E-18
3 phalp2_4558
45Gxl
4 24,4% 143 3.530E-18
4 phalp2_26826
3T8ym
5 26,5% 128 5.794E-13
5 phalp2_31545
4c1Of
8 24,8% 141 2.653E-07
6 phalp2_7844
8zPN
9 26,2% 145 2.653E-07
7 phalp2_11964
2UWwJ
5 23,5% 153 1.183E-06
8 phalp2_23147
4HkdD
8 28,9% 121 9.507E-06
9 phalp2_24669
5JJaQ
2 19,6% 132 1.279E-05
10 phalp2_29576
7W94G
14 22,8% 140 1.363E-04

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 5EIlb (198 AA)
Member sequence: 1079F (198 AA)
1 198 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5EIlb) rather than this protein.
PDB ID
5EIlb
Method AlphaFoldv2
Resolution 84.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50