Protein

Protein accession
20Jou [EnVhog]
Representative
4l17x
Source
EnVhog (cluster: phalp2_31586)
Protein name
20Jou
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTKTNEGLVAYCKTKLTLPTIYMLGGFGRLLTQANIDRRINQLRCPHTIQNLKTIQSGIGKYCFDCVGLIKGYLWEEKPGIVPYNIPKGSDQNVKMMYSACLQKGPLASMPDLPGLLVFTENLGHVGVYIGKDPAGKRQYIEATPAWNIWGVTQSNDEIRKWAFWGKYGYITYIEPKKEPVQSEIKVGDFVLVSGVGRGTSLGTGGFTANLKSRRMKVIKILSKAPYSYGCSFNLKAVIGETGSRYITAYFKPTSIRKG
Physico‐chemical
properties
protein length:259 AA
molecular weight:28756,2 Da
isoelectric point:9,65
hydropathy:-0,21
Representative Protein Details
Accession
4l17x
Protein name
4l17x
Sequence length
311 AA
Molecular weight
34643,13680 Da
Isoelectric point
9,59452
Sequence
MVKTNLGLVEYVKSKLALNTIYMLGGFGRILTQAMIDRRLNMGCPHTIRNLATIQAGIGRYCFDCVGLIKGYLWELSPGKVDYNIPNGSDQNVGMMYNSCTQKGVLISMPEVLGLLVFTRDLGHVGVYIGRDANGKRQYIECTPAWGKWGVCQSNDAIRSWAFWGKHHLIEYIESVPVPTIPKLKFKAGQKVVLNGRVYRNSLLSGPGAIFSNRVGFIRFLVDEEIVPAPYHIDGLGWVKEESLSLAPITMMTPKIKAGDKVKISGTHYATGEKVPFWVKLRVHSEASVSGSKARLKEINSWVNLKDLKRI
Other Proteins in cluster: phalp2_31586
Total (incl. this protein): 11 Avg length: 273,1 Avg pI: 9,66

Protein ID Length (AA) pI
4l17x 311 9,59452
1Ekgv 311 9,92454
2GWsl 259 9,51458
2t4sR 311 9,59949
2tacZ 259 9,72604
4GKfS 258 9,54192
4l1Gz 259 9,55591
4l1ca 259 9,68194
4l1fB 259 9,74151
7kPAR 259 9,68936
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32929
3PO25
1 33,3% 207 9.831E-29
2 phalp2_21023
7YTt
3 28,9% 314 1.009E-23
3 phalp2_13339
3ZqUL
1 30,0% 240 6.175E-20
4 phalp2_1806
3apWy
1 32,8% 216 3.214E-17
5 phalp2_23301
5hS2g
4 27,7% 238 3.214E-17
6 phalp2_16010
3o4Sp
262 27,6% 268 2.704E-15
7 phalp2_19917
6hBsp
1 28,4% 232 3.630E-15
8 phalp2_29540
3v8t1
106 30,3% 221 5.100E-14
9 phalp2_24045
8beMs
3 29,1% 213 1.268E-12
10 phalp2_30278
4kbSa
2 25,2% 317 7.393E-09

Domains

Domains
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): 4l17x (311 AA)
Member sequence: 20Jou (259 AA)
1 311 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4l17x) rather than this protein.
PDB ID
4l17x
Method AlphaFoldv2
Resolution 90.83
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50