Protein

Protein accession
4Gfxx [EnVhog]
Representative
58KdN
Source
EnVhog (cluster: phalp2_11093)
Protein name
4Gfxx
Lysin probability
88%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIELGCLALVMYMESSISSVTTQAYVASVVIERAKQDKQTICKNIHRPISYSWMFDGKNTKVDQLFLHQKLYPIALAQLKKPRLVGYYYFNECRLGKRFKTKNKMIRSDTMCFY
Physico‐chemical
properties
protein length:114 AA
molecular weight:13376,7 Da
isoelectric point:9,61
hydropathy:-0,19
Representative Protein Details
Accession
58KdN
Protein name
58KdN
Sequence length
113 AA
Molecular weight
13309,45820 Da
Isoelectric point
10,11614
Sequence
MYFEARNQPVDTMLGVGQVLIEHARPGEDLCHVIQRDPGLFTWARHGMKTPHPKRKADRDVLDKQYELARKMLFRNLRTTKLTEGYKHFNNVPLGKRFRTKVKMVKIGDLLFF
Other Proteins in cluster: phalp2_11093
Total (incl. this protein): 13 Avg length: 121,1 Avg pI: 10,06

Protein ID Length (AA) pI
58KdN 113 10,11614
2apSo 114 9,66892
315tZ 124 10,07236
3duXC 116 11,28643
4GAkT 125 10,09351
52mDT 128 9,59201
5480o 120 10,95016
56p7d 124 10,05051
58j3c 122 9,90803
5wtA9 123 9,74345
5xDgY 124 10,08648
8n3JT 127 9,65416
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38660
87tsE
25 30,7% 114 4.217E-22
2 phalp2_8296
8wndF
3734 27,0% 133 7.221E-14
3 phalp2_19867
5DmT3
416 22,6% 115 5.444E-11
4 phalp2_37440
2KS36
1494 18,4% 125 1.401E-10
5 phalp2_25954
6zMiD
37 27,5% 116 1.271E-09
6 phalp2_2416
54zSE
377 20,3% 118 1.742E-09
7 phalp2_5410
2GVbS
418 21,3% 122 8.406E-09
8 phalp2_22991
3BMOW
57 29,7% 84 1.151E-08
9 phalp2_13037
83E6T
2 25,7% 128 2.958E-08
10 phalp2_6737
26WZo
49 26,8% 82 5.548E-08

Domains

Domains
Unannotated
Unannotated
Disordered region
Representative sequence (used for alignment): 58KdN (113 AA)
Member sequence: 4Gfxx (114 AA)
1 113 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (58KdN) rather than this protein.
PDB ID
58KdN
Method AlphaFoldv2
Resolution 43.09
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50