Protein

Protein accession
22clE [EnVhog]
Representative
87k6T
Source
EnVhog (cluster: phalp2_1584)
Protein name
22clE
Lysin probability
99%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MKELFDNIIKEYALHESRPEVDKIIDRFNNKFDTNLNPDGDKTAWCSIYLNMKAFDLGYEHSGSALARSWMRVGDVVTDNWRDLSNTDMGDVVILWRKQKRGTFGHVGLLVTYTDKYVYLLGANQNNEVNISKYARNRILSVNRLNKV
Physico‐chemical
properties
protein length:148 AA
molecular weight:17179,3 Da
isoelectric point:8,63
hydropathy:-0,53
Representative Protein Details
Accession
87k6T
Protein name
87k6T
Sequence length
153 AA
Molecular weight
17024,33610 Da
Isoelectric point
9,62444
Sequence
MKKLASVMMLSLAILATPASAGEWFRDNQVYTQTFDPEQLIERIRQDIGKTARELGLPRTTLWCSEYLNSVTEGGTGSAQAKSWLSFPKVKQPRPGLIVVLKRGNSPSAGHVGVVEDFDNHYVYVLGGNQGNRVSLSRYPRRMVIAYVDPLKN
Other Proteins in cluster: phalp2_1584
Total (incl. this protein): 5 Avg length: 152,2 Avg pI: 9,69

Protein ID Length (AA) pI
87k6T 153 9,62444
1z3Wl 148 6,90181
5FomQ 156 11,73984
5IYaF 156 11,57435
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31888
4I8UQ
5 36,2% 102 5.835E-16
2 phalp2_30760
3FkY
209 34,6% 127 3.389E-14
3 phalp2_38506
1rtno
9 31,7% 145 1.046E-12
4 phalp2_30664
6NcXd
1 31,1% 109 9.243E-12
5 phalp2_34569
4WIGQ
127 31,4% 162 2.064E-10
6 phalp2_35973
5kZXR
1764 30,0% 110 3.837E-10
7 phalp2_8041
5Foum
5 32,6% 101 5.231E-10
8 phalp2_556
H4lq
5 32,3% 102 2.460E-09
9 phalp2_38022
5Ighg
1 30,2% 119 7.344E-08
10 phalp2_39960
1CQ4o
6 27,3% 161 1.587E-06

Domains

Domains
Disordered region
CHAP
Representative sequence (used for alignment): 87k6T (153 AA)
Member sequence: 22clE (148 AA)
1 153 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (87k6T) rather than this protein.
PDB ID
87k6T
Method AlphaFoldv2
Resolution 82.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50