Protein

Protein accession
32VuI [EnVhog]
Representative
2RgXy
Source
EnVhog (cluster: phalp2_32827)
Protein name
32VuI
Lysin probability
95%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MNIFRKAAPAAALLAMMIIVPAPAAQSMTPKNDSVSSMSVVTAQKTVLQLNIPNKKVIASQGHLSTNTNPRMIKKLVRRAWYNKEFRWHSTTQWNCFNQLIMHESSWDPWATNGRGWEETGGIPQAHPSYKMASEGKDYVRNVWTQVKWGLRYIYSVYKSPCHAWSRWQDRAGSGSYGWY
Physico‐chemical
properties
protein length:180 AA
molecular weight:20624,4 Da
isoelectric point:10,01
hydropathy:-0,52
Representative Protein Details
Accession
2RgXy
Protein name
2RgXy
Sequence length
155 AA
Molecular weight
17938,18020 Da
Isoelectric point
10,35080
Sequence
VVALLAALVVVQPPAAQSMTIQEAPATLVKKVRYSATPLSTNTRPSTIRSKVYRAWKNREFRWSSETQWRCFDNIIKKESAWSPWSTNGIGWEETGGIPQAHPSRKMRTAGRDYRTNAWTQVRWGLGYIYSVYGTPCRAWSKWQYRAATQRYGWY
Other Proteins in cluster: phalp2_32827
Total (incl. this protein): 5 Avg length: 167,6 Avg pI: 9,71

Protein ID Length (AA) pI
2RgXy 155 10,35080
181mT 173 7,78006
Sy9N 168 9,92434
TP1Q 162 10,48670
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28896
5ouYs
43 41,0% 100 1.202E-24
2 phalp2_30224
4aiz2
5 40,1% 117 7.922E-24
3 phalp2_34474
4HktX
2 36,8% 103 6.245E-16
4 phalp2_2932
U3n1
1044 37,7% 98 2.978E-15
5 phalp2_28855
57haC
91 32,4% 108 1.419E-14
6 phalp2_33116
4KV2m
1 32,6% 98 2.076E-12
7 phalp2_14148
2iIxu
157 28,1% 149 6.329E-11
8 phalp2_29608
7ZKXR
3 28,1% 153 2.982E-10
9 phalp2_5567
3Wp4H
119 27,1% 103 1.052E-07
10 phalp2_33229
5nuDD
24 28,1% 103 3.072E-06

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 2RgXy (155 AA)
Member sequence: 32VuI (180 AA)
1 155 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2RgXy) rather than this protein.
PDB ID
2RgXy
Method AlphaFoldv2
Resolution 78.82
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50