Protein

Protein accession
6EChK [EnVhog]
Representative
4I8UQ
Source
EnVhog (cluster: phalp2_31888)
Protein name
6EChK
Lysin probability
97%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
LRKLNTALIGVAILALSFSAAEARPRHHHHHHHARVAKINVTEPQDLSFFGSFQSASSDVVGRARQFIGESAHQVGVRSTLWCSAFLRKVTGAQDVDDRALSWEKHQHIAPQVGAVVTMGRRGGGHVGVVSGFTAKGDPIVISGNHGHRVAESVYPRSRIRAWLSPT
Physico‐chemical
properties
protein length:167 AA
molecular weight:18114,3 Da
isoelectric point:11,05
hydropathy:-0,19
Representative Protein Details
Accession
4I8UQ
Protein name
4I8UQ
Sequence length
100 AA
Molecular weight
10659,02340 Da
Isoelectric point
11,12719
Sequence
MGETAAQVGVRRNLWCAAFMNKLLNGGTHSDLAASYSHYGRPASAGCVGCIAVTSRRGGGHVGVVTGWQRNNPIIVSGNHGRRVGEGVYPRTRIVSLRWP
Other Proteins in cluster: phalp2_31888
Total (incl. this protein): 5 Avg length: 135,2 Avg pI: 11,24

Protein ID Length (AA) pI
4I8UQ 100 11,12719
6DV5N 128 11,07923
6Jhi1 103 11,31718
7xite 178 11,61193
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8041
5Foum
5 36,1% 94 9.255E-12
2 phalp2_1584
87k6T
5 32,6% 104 1.743E-11
3 phalp2_556
H4lq
5 35,6% 73 3.008E-10
4 phalp2_21
4NKG1
6 37,3% 91 1.223E-07
5 phalp2_11009
4NiJd
22 31,2% 96 5.937E-07
6 phalp2_768
1YX1Z
1 32,3% 68 2.878E-06
7 phalp2_34973
4FMbb
48 23,9% 96 1.911E-05
8 phalp2_38004
5Bbta
1 39,7% 68 2.619E-05
9 phalp2_19771
7LzfF
669 30,8% 94 2.376E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4I8UQ (100 AA)
Member sequence: 6EChK (167 AA)
1 100 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4I8UQ) rather than this protein.
PDB ID
4I8UQ
Method AlphaFoldv2
Resolution 84.24
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50