Protein

Protein accession
20DiK [EnVhog]
Representative
20DiK (this protein)
Source
EnVhog (cluster: phalp2_16893)
Protein name
20DiK
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VNIIKNITTVNRTVYSNRPIDYIVIHYFGALGSAASTCAYFKSVNRSASAHYFVDGDGVWQCVEDKDASWHCGDSGKGAFKNRCMNRNSIGIEVRPYKLNTATAS
Physico‐chemical
properties
protein length:105 AA
molecular weight:11585,9 Da
isoelectric point:8,99
hydropathy:-0,27
Other Proteins in cluster: phalp2_16893
Total (incl. this protein): 4 Avg length: 124,3 Avg pI: 8,57

Protein ID Length (AA) pI
20DdQ 105 9,00593
69dwP 162 7,65453
BGRv 125 8,64175
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14283
3qOze
23 47,6% 84 1.725E-23
2 phalp2_18703
nfVS
25 50,5% 93 5.606E-22
3 phalp2_30401
4NIg7
17 45,7% 94 1.614E-15
4 phalp2_38069
6uYUB
10 46,9% 83 5.236E-14
5 phalp2_6829
8nZie
4 41,3% 75 4.382E-12
6 phalp2_17574
6jj3R
3 41,9% 81 1.774E-09
7 phalp2_20835
6rXam
1 34,4% 87 1.617E-08
8 phalp2_25840
5j1TG
1 33,7% 86 5.192E-07
9 phalp2_14312
3Q436
4 41,7% 67 1.831E-06
10 phalp2_6395
6t0P
6 34,1% 85 5.827E-05

Domains

Domains
Protein sequence: 20DiK
1 105
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
20DiK
Method AlphaFoldv2
Resolution 91.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50