Protein

Protein accession
2gFc3 [EnVhog]
Representative
G3e0
Source
EnVhog (cluster: phalp2_29319)
Protein name
2gFc3
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
TVKKLQKALGVKADGDFGAGTEAALKAWQREHDCVPDGVAGPQTLGKLFS
Physico‐chemical
properties
protein length:50 AA
molecular weight:5210,9 Da
isoelectric point:7,83
hydropathy:-0,32
Representative Protein Details
Accession
G3e0
Protein name
G3e0
Sequence length
47 AA
Molecular weight
4941,59310 Da
Isoelectric point
6,16717
Sequence
VQEKLGIDPADGIYGFWTSNVVKKWQADNGLVADGVAGPKTLAKLLG
Other Proteins in cluster: phalp2_29319
Total (incl. this protein): 11 Avg length: 50,1 Avg pI: 6,96

Protein ID Length (AA) pI
G3e0 47 6,16717
227Kg 35 4,42596
2FM6g 55 5,18016
2nhCY 57 4,71465
3jxJ5 51 5,86950
4rA1K 44 9,40628
55G3w 48 9,40009
8E483 47 4,26579
8FWn3 58 9,81829
8GKcR 59 9,52258
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18966
44iTY
17 61,1% 36 7.843E-13
2 phalp2_23772
15aka
4 51,0% 47 1.079E-12
3 phalp2_28956
5LhfG
1 53,8% 39 2.449E-10
4 phalp2_1201
8ENBB
6 40,8% 49 8.788E-10
5 phalp2_24531
7CwK0
7 46,8% 47 1.665E-09
6 phalp2_28190
8EvJv
1 42,1% 38 2.292E-09
7 phalp2_1380
1h0Fx
4 40,0% 45 5.981E-09
8 phalp2_17651
6NSiH
6 50,0% 32 1.134E-08
9 phalp2_23266
4WI7r
1 55,1% 29 1.561E-08
10 phalp2_746
1OPDR
1 42,5% 40 1.561E-08

Domains

Domains
Representative sequence (used for alignment): G3e0 (47 AA)
Member sequence: 2gFc3 (50 AA)
1 47 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (G3e0) rather than this protein.
PDB ID
G3e0
Method AlphaFoldv2
Resolution 97.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50