Protein

Protein accession
1OAI7 [EnVhog]
Representative
49rGR
Source
EnVhog (cluster: phalp2_21788)
Protein name
1OAI7
Lysin probability
96%
PhaLP type
endolysin
Probability: 94% (predicted by ML model)
Protein sequence
MLIALIILFSVTLIGVRASSKVITYPPLEEVNELRFLKCIELVENSRGKTGKRGEYGIYQIHPATWKEHSNIPMLMVPESVQRKVALNILRHYAKIIERRGDVVNNYSLAIAWCAGPYAKRISSHACNYAYRVINLYPATQ
Physico‐chemical
properties
protein length:141 AA
molecular weight:16046,6 Da
isoelectric point:9,57
hydropathy:0,02
Representative Protein Details
Accession
49rGR
Protein name
49rGR
Sequence length
180 AA
Molecular weight
20531,20400 Da
Isoelectric point
9,85246
Sequence
MNTAKVIKAAIAMIVVAVMINVVWSSNKPIKQWPKVTEVDEQKMLESIRQIENSGWRTGARGEKGPYQIHPNTWDEHCNIPVSLAPKSYHDRIALSILRQFGSILRKRNIEVNAYNLALAWCSGPYYKKPSSRAVSYASRLRNYYESINESQYSQSQQLPGVRYKPAGERDSRSFAQGVR
Other Proteins in cluster: phalp2_21788
Total (incl. this protein): 10 Avg length: 161,4 Avg pI: 9,64

Protein ID Length (AA) pI
49rGR 180 9,85246
2RTAP 145 9,99055
4e4R5 141 9,16194
4rRq5 202 9,76781
50Wn3 174 9,39848
5lV2I 202 9,66525
5wdXn 146 9,74029
7XcSD 145 9,69587
hYcM 138 9,50794
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24226
2SbQh
14 36,6% 112 5.455E-21
2 phalp2_23156
4IKzD
8 34,7% 115 1.684E-19
3 phalp2_16115
4fBEl
12 24,4% 127 7.834E-10
4 phalp2_101
55Nbc
7 23,3% 124 1.654E-08
5 phalp2_16968
7Zc1N
2 21,8% 137 4.599E-04

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 49rGR (180 AA)
Member sequence: 1OAI7 (141 AA)
1 180 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (49rGR) rather than this protein.
PDB ID
49rGR
Method AlphaFoldv2
Resolution 80.11
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50