Protein

Protein accession
BGRv [EnVhog]
Representative
20DiK
Source
EnVhog (cluster: phalp2_16893)
Protein name
BGRv
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MLTVEKRIISRNFTRAGAGRKIEYIVIHYFGSLGTAAAVANYFAGADRQASAHYCLDEGNIVYQCVEDNNIAWHCGTSGGYVHPRCRNANSIGIEVRPYKLDKTTPPGGGGGGGGVCAKTDRDPP
Physico‐chemical
properties
protein length:125 AA
molecular weight:13361,9 Da
isoelectric point:8,64
hydropathy:-0,34
Representative Protein Details
Accession
20DiK
Protein name
20DiK
Sequence length
105 AA
Molecular weight
11585,87920 Da
Isoelectric point
8,98556
Sequence
VNIIKNITTVNRTVYSNRPIDYIVIHYFGALGSAASTCAYFKSVNRSASAHYFVDGDGVWQCVEDKDASWHCGDSGKGAFKNRCMNRNSIGIEVRPYKLNTATAS
Other Proteins in cluster: phalp2_16893
Total (incl. this protein): 4 Avg length: 124,3 Avg pI: 8,57

Protein ID Length (AA) pI
20DiK 105 8,98556
20DdQ 105 9,00593
69dwP 162 7,65453
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14283
3qOze
23 47,6% 84 1.725E-23
2 phalp2_18703
nfVS
25 50,5% 93 5.606E-22
3 phalp2_30401
4NIg7
17 45,7% 94 1.614E-15
4 phalp2_38069
6uYUB
10 46,9% 83 5.236E-14
5 phalp2_6829
8nZie
4 41,3% 75 4.382E-12
6 phalp2_17574
6jj3R
3 41,9% 81 1.774E-09
7 phalp2_20835
6rXam
1 34,4% 87 1.617E-08
8 phalp2_25840
5j1TG
1 33,7% 86 5.192E-07
9 phalp2_14312
3Q436
4 41,7% 67 1.831E-06
10 phalp2_6395
6t0P
6 34,1% 85 5.827E-05

Domains

Domains
Representative sequence (used for alignment): 20DiK (105 AA)
Member sequence: BGRv (125 AA)
1 105 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (20DiK) rather than this protein.
PDB ID
20DiK
Method AlphaFoldv2
Resolution 91.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50