Protein

Protein accession
4520s [EnVhog]
Representative
3dGxn
Source
EnVhog (cluster: phalp2_37499)
Protein name
4520s
Lysin probability
97%
PhaLP type
VAL
Probability: 89% (predicted by ML model)
Protein sequence
MDRFLRLVGRPYGFPSHPPETFDCWTLVKHVRGSMGLPCPLPFGDTEEWCVPGNLARATSAARPMWHTRPYPIEGDMAVLEPAHVGVFLANGVLHALSRNSSVVWTSLPVIRRVWPKAEWWTV
Physico‐chemical
properties
protein length:123 AA
molecular weight:13888,9 Da
isoelectric point:8,61
hydropathy:-0,07
Representative Protein Details
Accession
3dGxn
Protein name
3dGxn
Sequence length
144 AA
Molecular weight
16188,94830 Da
Isoelectric point
9,36753
Sequence
MSLRSPWRKKIAPPPPPVTVVPSAVRTLVGQPYAFPSFPPVSFDCWTLVKYVRELHSLPCPLPFNEKAPWCVPEYMPSAIKLAAPHWVIVAEPAQMSMAVMERQHVGVVVDDGVLHALARNASVVWTPMKGILRKWPNTEWWTA
Other Proteins in cluster: phalp2_37499
Total (incl. this protein): 4 Avg length: 128,5 Avg pI: 8,39

Protein ID Length (AA) pI
3dGxn 144 9,36753
5ErSi 124 9,09534
6Sab8 123 6,49240
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18081
3SQ2q
120 26,9% 115 9.510E-12
2 phalp2_8929
3gJCo
23 32,4% 111 1.299E-11
3 phalp2_14093
8iNl2
704 26,8% 119 1.573E-10
4 phalp2_6414
eXTW
41 26,9% 126 1.456E-07
5 phalp2_19944
6EYn3
118 33,5% 128 1.983E-07
6 phalp2_36398
HQJh
44 27,2% 110 5.011E-07
7 phalp2_23293
5dNMt
14 26,3% 114 3.187E-06
8 phalp2_37361
7nTsk
49 27,5% 127 5.898E-06
9 phalp2_26857
47Pe5
2 25,0% 124 1.091E-05
10 phalp2_26210
j4zT
4 26,8% 123 1.483E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 3dGxn (144 AA)
Member sequence: 4520s (123 AA)
1 144 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3dGxn) rather than this protein.
PDB ID
3dGxn
Method AlphaFoldv2
Resolution 81.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50