Protein

Protein accession
3iI8J [EnVhog]
Representative
2cCct
Source
EnVhog (cluster: phalp2_31325)
Protein name
3iI8J
Lysin probability
92%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MKLFEFLLNLLRSLFGGKSKPNKELPPAAEEPTPTPEPEPEPEPEPAPEPAPEPEDQPEEEFDDPEYNVEEVTEDDETEREPLDEEDEPEEEVVVELELKMLPLDSWKERQQALVDLGFNPGKVDGIPGRKTSAAMRRAEKEYGLEQDGEWDFELHQAISDALKEKGKPKLKPMPIVPPPSGEYEDMLDPSEYELDDVFFASFIDLTHKSNVVKNGSRRRKGRRRWSRLTRFCWHQTAFVWRPYRESKRLGKHTGHHRMNAHMCFDRDGTILLIHNFFYYLWTANAFNPDCISIEVLGNFEGIQGSGKWYKGDKFGRARPTRMQLIRCRQFTIWMHNPELGPEDDKLPKPLLEWRLHCRKEGNPLKWVNTHRESADQRNGDCGSELWYHLCEWAYWYFSGDLTQGPKKGKGKDIPAVWRAKPPAPPLPLDTGEDEAA
Physico‐chemical
properties
protein length:437 AA
molecular weight:50659,3 Da
isoelectric point:5,09
hydropathy:-0,94
Representative Protein Details
Accession
2cCct
Protein name
2cCct
Sequence length
429 AA
Molecular weight
49278,83060 Da
Isoelectric point
5,47942
Sequence
MKQILAFLFRLFGKLFGKQVPKKELPPNSEEPDQDPPAPTEPEEPEEPDQPEEEFDDPDYDVDHVTPDDEEPRAPLDEEDEPDPKVVKDVEAAMLPLDSWKERQQALTDLGYSLGKIDGIPGRKTSAAMRRAEKDFGLEQDGEWDLELHRAIDKALKEKGKKALAPMPFPAPPGTAYENMIDPSEYELDDAFYNSFIDLTGKSNVKDSKGRRRRKGVRSFSKLVRFCWHQTAFIWRPYRVSKEQRKYTGHHKINAHMLFDTDGTILLLHNFKYYLWTANAFNPDCVSIEVMGNFEGVQGSGRWYKGDKFGRARPTREQIIRCRQFTLWLLDPEQGPADEELPKPLLEWREGCRKHGNPLKWDNTHRESTDDRNADCGSELWYHVVEWALSAKGEHLVQGPIRGKGQTVPTVWRAKPPAPPLPPDQRGKV
Other Proteins in cluster: phalp2_31325
Total (incl. this protein): 2 Avg length: 433,0 Avg pI: 5,29

Protein ID Length (AA) pI
2cCct 429 5,47942
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21712
3mZuj
73 27,9% 333 1.626E-18
2 phalp2_15318
1kA5d
1 23,0% 334 4.016E-06

Domains

Domains
Disordered region
PG_1
Unannotated
Representative sequence (used for alignment): 2cCct (429 AA)
Member sequence: 3iI8J (437 AA)
1 429 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2cCct) rather than this protein.
PDB ID
2cCct
Method AlphaFoldv2
Resolution 74.48
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50