Protein

Protein accession
2DG4n [EnVhog]
Representative
46YCk
Source
EnVhog (cluster: phalp2_39266)
Protein name
2DG4n
Lysin probability
86%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
MGMFDSLNEKQRATAEKVMASAEKYGVPKTLAFGMAMQESGFDQSKHSKTGPVGVMMLGRKAAKDMGVDRYNEDENIDGGMRYARQLLDKHNGDWDKTLVAYHDGPNSPYFKGGEMSPEAKTHIQKVKGYADMAGPNSTQFSTDVEDIEPLDLSGINDTTQTVDQGRDMDVTDAMSGGAGALAGAMGGADRKRTEANVRAQTANANVVRDANRTAQLQHQDQLKNLDRQDKYALELRKAAMEHAAAMQKASEQAAATAQKAAGQDPRNWIRGQFGGDIADVEGRNVLSQAEAQQAGTQGVSRVRQAQSMMPGAAPDPTTGLWLGQDVLASRPAAPVPQAPKVPNFPMPKPLPRPNAPAALPVPHIEAGSRIGSALNAGTGAIMGQQAKEAILAAQEGDYLHSGLSGLASAGAGAATYSRNPKTKAIGAGVVAAAKGADYLKDYIMNKINPQPAGQ
Physico‐chemical
properties
protein length:455 AA
molecular weight:48007,3 Da
isoelectric point:6,73
hydropathy:-0,58
Representative Protein Details
Accession
46YCk
Protein name
46YCk
Sequence length
453 AA
Molecular weight
47999,32730 Da
Isoelectric point
6,77353
Sequence
MGMFDSLNEKQRATAEKVMAAAEKYGVPKTLAFGMAMQESGFDQNKHSKTGPVGVMMLGRKAAKDMGVNRYDEDENIDGGMRYARQLLDKHNGDWDKTLVAYHDGPNSPYFKGGEMSPEAKTHIQKVKGYADMAGPTSTQFSTDVEDIEPLDLSGINDTTQTVDQGRDMDVTDAMSGSAGALAGAMGGAERKRTEANVRAQTANAHSVRDANRAAQLQHQEQLKNLDRQDKYALELRKAAMEHAAAVQKAADQTAQKAAGQDPRNWIRGQFGGDIADVEGRNVLSQAEAQQAGTQGVGRVRQAQAMMPGAAPDPTTGLWLGQDVLATRPAAPAPQAPKVPNFPMPKPLLRPTAPVPQPVPHIEAGSRIGSALNAGTGAIMGQQAKDAILAAQQNDWLRAGLDALTSGGAAAATYSHNPKTKAIGAGVAVAAKGADYLKDYIMNKINPQPPAGQ
Other Proteins in cluster: phalp2_39266
Total (incl. this protein): 3 Avg length: 650,7 Avg pI: 7,59

Protein ID Length (AA) pI
46YCk 453 6,77353
A0A6J7WNC7 1044 9,27521
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9601
1uHUx
19 27,7% 475 2.469E-36
2 phalp2_14379
4oC9w
41 27,3% 461 1.360E-27
3 phalp2_33327
6zj4x
29 24,0% 466 1.524E-22

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (46YCk) rather than this protein.
PDB ID
46YCk
Method AlphaFoldv2
Resolution 53.89
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50