Protein

Protein accession
A0A7M1RSX2 [UniProt]
Representative
1dQRh
Source
UniProt (cluster: phalp2_28351)
Protein name
Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase-like domain-containing protein
Lysin probability
95%
PhaLP type
VAL
Probability: 97% (predicted by ML model)
Protein sequence
MKFDNKTFLLKYEAWKNGADYWKDIRGINLGGNTQAEEPSPEELLLMDQNVLSILNAYNEGKDANIAEDIIKPLPFDTPLNEEHPILHKYKGGKDDSINTFVNRMGPLVGQLLNRYGYGDAAFYNVMRLLAYESNYGRSRVARRQHNYGGVGWNGKTYNTYKSDADFVKDYVRLMHTRYGAALRAKSTLDYARALKQKGYYTDSLENYSRNLRGMDSLVKAANYHRINHKDAYNYNVKFDDLVLDYEDTKNASPIIINSPSTRLPGTIRADVPTTLLGPTLEEIKAKQLRDLNKYKQLMYDSITLPSLPNILNLLPSNNFGKDSYGLKFWWRRGNNLHFKGGKDERIAGVNPLLQDKFNQDINLELGLPKNHFASHVFKDSDGFGYTLTDFYDAGQLDDVVAKPSEYDKKLFSITKKMFPNKLVRDLVYLDLAYNDEVSDYPFNDRLLNMILLYKLSGSPSITNKSKSKYNIIRSNYIPYYNHINAYPGSLVSELSHAYLYNNSIRKPIMPLDNKGVNGSDYKRRGSIEHLAHEVILPNLVNFIETGGKKYLKKAKIDANKEYDKHGHWWWNKGKD
Physico‐chemical
properties
protein length:576 AA
molecular weight:66209,1 Da
isoelectric point:9,12
hydropathy:-0,67
Representative Protein Details
Accession
1dQRh
Protein name
1dQRh
Sequence length
153 AA
Molecular weight
17168,97080 Da
Isoelectric point
9,22125
Sequence
MDMKVQSILNSYDTGKDSGLADDITKPMPYDTPLPEEHPILHKYKGGKNGGNDSISSFVSRLGPLVGQQLTRYGYGDTAYYNVMRQLAYESSYGKSDVARKQHNYGGVGWNGKTYTTYKSDADFVKDYVRLMHNRYGAALRARSTQDYAKALK
Other Proteins in cluster: phalp2_28351
Total (incl. this protein): 5 Avg length: 303,4 Avg pI: 9,31

Protein ID Length (AA) pI
1dQRh 153 9,22125
A0A7M1RSD5 378 9,08722
A0A8D9PFD2 70 10,18518
A0AAE7RX81 340 8,93043
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9152
6w7kU
5 58,0% 124 1.162E-34
2 phalp2_12447
pNVl
16 44,1% 111 1.113E-18
3 phalp2_32468
17iN5
5 31,8% 116 1.352E-05

Domains

Domains [InterPro]
Disordered region
GLUCO
Representative sequence (used for alignment): 1dQRh (153 AA)
Member sequence: A0A7M1RSX2 (576 AA)
1 153 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured phage cr124_1
[NCBI]
2772090 Crassvirales > Suoliviridae > Burzaovirus > Burzaovirus faecalis
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MT774406 [NCBI]
CDS location
range 59958 -> 61688
strand +
CDS
ATGAAGTTTGACAACAAGACATTTTAGTAGAAGTATGAGGCGTGGAAGAATGGCGCTGATTACTGGAAGGATATTAGAGGAATCAACTTGGGTGGAAACACCCAGGCTGAGGAACCTAGTCCAGAAGAGTAGCTGTAGATGGATCAGAATGTATAGTCTATACTTAATGCTTATAATGAAGGAAAGGATGCTAATATAGCTGAAGACATTATTAAGCCATTACCTTTTGACACTCCATTAAATGAAGAGCATCCTATACTTCATAAATATAAAGGTGGAAAAGATGATTCTATTAATACTTTTGTTAACAGAATGGGCCCTCTTGTAGGACAATAGCTGAACAGATATGGTTATGGTGATGCTGCGTTTTACAATGTAATGCGTTAGCTTGCATACGAATCTAATTATGGTAGATCTAGAGTTGCTAGAAGACAACACAATTATGGTGGAGTAGGCTGGAATGGTAAGACTTACAATACATATAAGAGCGATGCGGATTTCGTTAAGGATTATGTAAGGCTTATGCATACACGATATGGAGCAGCACTTAGAGCTAAATCTACATAGGATTATGCTAGGGCTCTCAAACAGAAGGGGTATTACACAGATTCTCTCGAGAATTATTCTAGAAACCTTCGAGGAATGGACAGCTTGGTAAAGGCGGCCAACTATCACAGAATTAATCATAAAGACGCTTACAACTATAATGTTAAGTTTGATGATCTTGTGTAGGATTATGAAGACACAAAGAATGCTAGTCCTATAATTATCAATTCGCCATCTACAAGATAGCCTGGTACTATTAGAGCGGATGTTCCAACAACTTTACTTGGTCCGACTTAGGAAGAGATAAAGGCTAAGCAATAGCGTGATCTTAATAAGTATAAACAGTAGATGTACGATAGTATAACATAGCCTTCACTTCCGAATATACTAAACTTGCTTCCATCTAATAACTTTGGCAAAGACTCTTATGGCTAGAAGTTCTGGTGGAGAAGAGGAAATAATTTGCACTTCAAAGGTGGTAAAGATGAAAGAATAGCAGGCGTGAATCCATTGTAGCAAGATAAGTTTAATCAAGATATAAATTTAGAACTTGGATTGCCTAAAAATCATTTCGCATCTCATGTTTTTAAAGACTCTGATGGTTTTGGATATACACTGACTGATTTTTATGATGCAGGACAACTTGATGACGTTGTTGCAAAACCATCAGAATACGATAAAAAGCTTTTTAGTATAACAAAAAAGATGTTTCCTAATAAATAGGTGAGAGATCTTGTTTATTAGGATCTTGCTTATAATGACGAAGTTTCAGATTATCCTTTTAATGATAGATTGCTAAATATGATATAGTTATACAAGCTTTCTGGATCCCCGTCTATTACTAATAAAAGCAAATCGAAATATAATATAATTCGTAGTAATTATATTCCATACTATAATCATATAAACGCGTATCCTGGATCACTTGTTTCAGAATTATCTCATGCATATTAGTATAATAATTCAATAAGGAAACCTATTATGCCATTAGATAACAAAGGTGTCAATGGATCAGATTACAAAAGAAGAGGAAGTATAGAGCATTAGGCTCATGAGGTAATCTAGCCAAATTTGGTAAATTTCATAGAAACTGGTGGCAAGAAGTATTTAAAGAAAGCTAAAATAGATGCAAATAAAGAATACGACAAACATGGTCATTGGTGGTGGAATAAAGGAAAGGATTAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1dQRh) rather than this protein.
PDB ID
1dQRh
Method AlphaFoldv2
Resolution 71.85
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50