Protein

Protein accession
4fOJ8 [EnVhog]
Representative
1Cglk
Source
EnVhog (cluster: phalp2_33862)
Protein name
4fOJ8
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MPGKYFSDEELACKCGCGLLPKQRTIDIADKLRVLWGAGLRVTSGARCQNYNNYLRLHGIPAAQYSAHIEGLALDLRPLNGKVRELQELAIQNAERLGIRVEDPKATPQWLHFDLRPVAPGKNRVFKP
Physico‐chemical
properties
protein length:128 AA
molecular weight:14347,5 Da
isoelectric point:9,35
hydropathy:-0,39
Representative Protein Details
Accession
1Cglk
Protein name
1Cglk
Sequence length
139 AA
Molecular weight
15579,52990 Da
Isoelectric point
5,57871
Sequence
MPTPLEGFACPCCGLYRVSNRLITLDELLHIAFGDRLRRSSGTRCRAYNTSEEIKGSRTSGHLPIWGPENNESVAGDYELIDATETDLRRLMWTAIQEGALGVGYMPDDNALHVDLKPRGYGVALWIVRDGKITYFFNI
Other Proteins in cluster: phalp2_33862
Total (incl. this protein): 10 Avg length: 136,0 Avg pI: 7,95

Protein ID Length (AA) pI
1Cglk 139 5,57871
14IQI 123 8,70505
2TyJ9 123 8,85301
3utDt 174 6,81155
4JvJJ 119 7,70227
4NT9E 156 6,21804
7HHEk 101 8,76314
7HK1s 123 8,91309
XF0R 174 8,59430
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38763
1dS3s
1408 23,3% 120 1.245E-18
2 phalp2_14040
85wjf
1353 28,9% 121 1.539E-17
3 phalp2_29133
79Li3
3905 28,4% 123 6.673E-16
4 phalp2_11062
4US9Y
716 30,8% 123 1.711E-15
5 phalp2_34760
3hjNi
68 27,1% 129 2.342E-15
6 phalp2_15919
4GXbY
30 25,2% 115 3.160E-12
7 phalp2_30902
HOls
632 22,0% 100 1.104E-11
8 phalp2_6873
2aTQG
108 25,6% 121 1.509E-11
9 phalp2_19392
2nERN
10 24,6% 134 3.853E-11
10 phalp2_26705
2Q6OF
1 27,5% 116 5.266E-11

Domains

Domains
Unannotated
Representative sequence (used for alignment): 1Cglk (139 AA)
Member sequence: 4fOJ8 (128 AA)
1 139 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1Cglk) rather than this protein.
PDB ID
1Cglk
Method AlphaFoldv2
Resolution 92.83
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50