Protein

Protein accession
4K3rn [EnVhog]
Representative
4EA8c
Source
EnVhog (cluster: phalp2_1978)
Protein name
4K3rn
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MPLQAGSPQLLLPDLSEWQPHADMAAIRSANGGAAIIRGAYGDAHPDAAFPAFRAAASGYRWLGLYQYLRSGQDVSAQARAFVAIVGQLAAHEVPILDLEEGDGDQAERAGTWLATVDAGLGLGERPLSERSWLYSGLAFLQAHGLAPLFGSRRRTWIAAYSATEPVIGHTLWQCTNGKRGSHLTSWPGTGYCDTSLYHGTISQLAATTGRDGKDDDMLSGTLLADQATPIRVGEGRGAAQLLLSVADGAPAQKLTVSVRDDGGSYTQGVDVTWQDPGVVKFRHADKVKAVSVSRPSAAVSYVLF
Physico‐chemical
properties
protein length:305 AA
molecular weight:32285,8 Da
isoelectric point:5,88
hydropathy:-0,13
Representative Protein Details
Accession
4EA8c
Protein name
4EA8c
Sequence length
333 AA
Molecular weight
35462,02790 Da
Isoelectric point
4,72056
Sequence
MLLPDVSEFQSGATAPNWAGIKAQNGGAAIIRVGYGNGHLDNMFVDNYTAVKANDFAFVGLYHYLRMGQDALSQAQQFCDWVGPLSALAPGSIPMLDLEEGSGDQSGRANTWLNFVDHFYGLDQLTLDQRSWLYSGDSFARTANLGPIFNSARHTWVAAYRSSEAGLLPHTLWQSTNGSLGANITNWSGCGNVDTSIAHVDLPTLASMAYQPTDGPPAKPAQILQEEENMLKETEKTTVISFGNAQYSWIAFFADPAVEGKGDPQHIRVAPWSADGGGAFAGIVADVAVGSATEKVTVNLPAHCAGVSIQRTGTQAADPTTWAPIAWNLGPVS
Other Proteins in cluster: phalp2_1978
Total (incl. this protein): 6 Avg length: 301,3 Avg pI: 5,67

Protein ID Length (AA) pI
4EA8c 333 4,72056
2Vowd 289 5,12053
4C3fR 282 5,00816
4EWeL 313 6,18217
4Esp9 286 7,12382
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36183
6XM6V
1 62,9% 208 2.992E-88
2 phalp2_19875
5GUq8
18 41,8% 220 1.985E-52
3 phalp2_9690
23WOG
27 42,0% 219 2.500E-49
4 phalp2_25550
3zK3V
3 37,5% 208 1.461E-45
5 phalp2_10923
4EY3t
12 33,3% 222 1.831E-34
6 phalp2_39507
5nLHO
1 32,5% 243 6.962E-31
7 phalp2_19688
4Ezja
7 27,8% 266 7.520E-28
8 phalp2_16973
870CF
1 30,1% 209 1.865E-27
9 phalp2_39912
1jXPT
300 28,0% 221 4.216E-18
10 phalp2_34559
4UNXW
287 27,2% 209 3.472E-16

Domains

Domains
GH25
Unannotated
Representative sequence (used for alignment): 4EA8c (333 AA)
Member sequence: 4K3rn (305 AA)
1 333 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EA8c) rather than this protein.
PDB ID
4EA8c
Method AlphaFoldv2
Resolution 88.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50