Protein

Protein accession
2utEq [EnVhog]
Representative
5GIW3
Source
EnVhog (cluster: phalp2_17535)
Protein name
2utEq
Lysin probability
92%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNRHFRTLSLTLLAIALALVVFALASAITNFYLKGGSPQHPTAGPALAHVAEIPLPAPNIATGIRSAPTPLNEVRCLADAIYFEARGESVANQIAVAQVVVNRVRHSAYPSTVCGVVYQGAGGLVCQFSWACEPHFIHDRTAYARSLDLAQFVMLNFKHSNTLPDLVDGATHFHNTTVSPGWAERLRPTATIDGHLFFAKADPRRTRG
Physico‐chemical
properties
protein length:208 AA
molecular weight:22493,5 Da
isoelectric point:9,07
hydropathy:0,13
Representative Protein Details
Accession
5GIW3
Protein name
5GIW3
Sequence length
216 AA
Molecular weight
24418,13930 Da
Isoelectric point
9,37875
Sequence
MNVLVERLSPTIQRKLMKTKGQILYTVMDFTLRTLLSALMALLFILLAVKAAESRHPETFVSVELSVPAYKGVQHSFADIEDMARNIYYESRGQPDLGQVAVAHVVLNRLRAGVFANTVHGVIYQRLPVCQFSWVCQPRYPIHDQVAWTKAMVIAQNVLEGNIADPTSGSLYFHVTTLGLSSQDDKIICIKDHVFYNKKPIVYAKDNNRTTPTRRI
Other Proteins in cluster: phalp2_17535
Total (incl. this protein): 2 Avg length: 212,0 Avg pI: 9,22

Protein ID Length (AA) pI
5GIW3 216 9,37875
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19867
5DmT3
416 36,1% 141 9.960E-24
2 phalp2_15462
87Yrl
67 35,1% 145 1.189E-22
3 phalp2_26324
1bAeu
16 36,1% 144 1.677E-20
4 phalp2_32502
1hod3
9 29,5% 159 9.300E-17
5 phalp2_3334
38dW3
10 29,0% 196 3.170E-16
6 phalp2_37567
3Q8a9
8 32,2% 161 6.757E-15
7 phalp2_38710
V6TC
7 29,7% 141 1.425E-13
8 phalp2_23168
4KMdc
1 25,3% 154 9.967E-12
9 phalp2_640
1dgUf
3 24,1% 174 1.004E-08
10 phalp2_17317
4iq6x
1 31,2% 157 1.823E-08

Domains

Domains
Unannotated
Hydro_2
Representative sequence (used for alignment): 5GIW3 (216 AA)
Member sequence: 2utEq (208 AA)
1 216 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
2utEq
Method AlphaFoldv2
Resolution 83.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5GIW3) rather than this protein.
PDB ID
5GIW3
Method AlphaFoldv2
Resolution 84.94
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50