Protein

Protein accession
2Dhy7 [EnVhog]
Representative
16sLM
Source
EnVhog (cluster: phalp2_30950)
Protein name
2Dhy7
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKFIWPCGSLANVLRGFWYAASFYYLGRHYAVDIAVPWWTPIKSSLAGKVAVRSYDSESGYKVYIDSAIGDGIIIRCCYRHMTGPAKVTIGSSVSQGQIIGYVGSTGNSLGPHLHFDLWTNKIIRDDSIVWKPAAGFYAVDPAIYLGQEAALTTAEVEAVIRRTTTDAAIAQAIGRIAREGLLGLDKGIVGSKKLLDIISKMQAEIRDLQT
Physico‐chemical
properties
protein length:211 AA
molecular weight:23018,3 Da
isoelectric point:8,91
hydropathy:0,13
Representative Protein Details
Accession
16sLM
Protein name
16sLM
Sequence length
199 AA
Molecular weight
22120,25450 Da
Isoelectric point
8,75025
Sequence
MIWPCGSLADVTRGFWFPSDGYYLKKHYAVDLGKRVTWGTRIASCLPGKVSYKAYDPVYSGHYIYVDTPMGDGIVIRCCYRHLLESSPLKVGAPVAQGQIIGRVGATGDAQGPHLHFDMWVNKPIRDDSIIWKARVGRYAVDPLIYLGQEVCVTKAEVEKIVRDTTLTHDARVKTIPGWIDALLANVGTLNEHTKHPPA
Other Proteins in cluster: phalp2_30950
Total (incl. this protein): 2 Avg length: 205,0 Avg pI: 8,83

Protein ID Length (AA) pI
16sLM 199 8,75025
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_39714
fccV
7 38,2% 170 3.648E-28
2 phalp2_39127
2AUvp
1 30,1% 146 4.255E-11
3 phalp2_5764
7GfkS
23 31,2% 125 1.059E-10
4 phalp2_5756
7Czmx
45 30,1% 159 4.829E-10
5 phalp2_10419
1ZZt0
3 30,2% 152 6.537E-10
6 phalp2_27905
l4Bb
39 33,5% 131 6.031E-08
7 phalp2_14848
7w0BH
74 33,8% 142 3.642E-07
8 phalp2_9303
7pYyV
24 32,6% 156 7.175E-06
9 phalp2_30590
66hsO
9 29,8% 124 3.158E-05

Domains

Domains
Representative sequence (used for alignment): 16sLM (199 AA)
Member sequence: 2Dhy7 (211 AA)
1 199 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (16sLM) rather than this protein.
PDB ID
16sLM
Method AlphaFoldv2
Resolution 78.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50