Protein

Protein accession
46RgI [EnVhog]
Representative
46RgI (this protein)
Source
EnVhog (cluster: phalp2_37693)
Protein name
46RgI
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIDHQRLIRAIAAVEGQPWDSPGGGLQYTKATWYDYTRIPYHKAKDKAAATTVALRILEDAMSRLTKAGAEPTVYLLALRWRYGYAGMIQRKHTTDNDYAVRVQNLYLDPSFA
Physico‐chemical
properties
protein length:113 AA
molecular weight:12848,5 Da
isoelectric point:9,39
hydropathy:-0,42
Other Proteins in cluster: phalp2_37693
Total (incl. this protein): 40 Avg length: 133,7 Avg pI: 9,39

Protein ID Length (AA) pI
1KmQ2 136 10,26583
1h4AO 136 10,15527
1okOk 132 9,20339
1rJ26 141 9,41182
22O3m 135 9,18044
2LKrl 146 10,60539
2YC92 144 9,46314
3f1A2 146 6,05008
41Nb2 119 9,68123
49wDK 138 9,06040
4GCTr 130 9,95200
4Nup5 125 9,34116
4NwEs 125 9,34116
4V9t2 144 8,77333
4YAdS 138 9,51381
4bLwk 144 9,24955
57KOX 124 9,71547
57Rp6 127 9,02933
5a58l 144 9,32898
5aKd0 146 8,99129
5koiQ 127 9,50910
5wTA9 146 8,99129
5zpEd 136 10,15372
6xTFF 144 9,00980
6y2vc 130 9,50859
7XVve 135 9,32730
84vxk 144 9,32898
8ebDk 144 9,46314
8msaf 118 9,33678
8mw2t 114 9,64139
8nkZg 141 9,65209
8ruCL 100 9,66170
8sFWj 144 9,26000
8sG3M 144 9,32898
Ca2q 119 9,61535
ItKr 119 9,61451
SBMs 125 9,62218
aSqC 144 9,14924
hXCy 141 9,73693
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2584
6Dn98
10 30,1% 116 7.594E-18
2 phalp2_24226
2SbQh
14 34,8% 112 1.305E-16
3 phalp2_14452
4GjIY
569 33,8% 118 2.239E-15
4 phalp2_20608
4HuY8
4 31,0% 116 3.071E-15
5 phalp2_18954
3NSCQ
6 32,4% 117 1.086E-14
6 phalp2_16115
4fBEl
12 31,8% 116 3.500E-13
7 phalp2_39860
14Mmd
7 30,5% 118 1.920E-10
8 phalp2_26418
1Jbra
6 32,1% 115 3.270E-09
9 phalp2_23156
4IKzD
8 36,9% 111 2.160E-08
10 phalp2_28028
6WGGg
2 27,5% 116 4.051E-08

Domains

Domains
Unannotated
Protein sequence: 46RgI
1 113
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
46RgI
Method AlphaFoldv2
Resolution 96.94
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50