Protein

Protein accession
A0A5S8Y0I9 [UniProt]
Representative
6S30T
Source
UniProt (cluster: phalp2_319)
Protein name
Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MATIVDALVVTLGFDLSAFKRGKSEAGAATKKLTAEERAAAKEIEERNKKAAESFRSIRNEVLALVAIFTAGVGIKQFAESTINSAVNLGYMAQNLQMSTRDLAAWQRAAERAGGSAEGITAALQASQNDVSKLKFGQVTEGVQWFLRMGGSVKDLKDGNSYLLARARIISNMFKTDPGRARFIAQQMGIGDGEFNFLKQGEGAVLALVDAQKKNSAVTEQQAAQALKLRNAWLDLRDRLQYVGTTVLLELMPTFEKLLGKLQSMADWVADHKADISAWIDRAVTAVQQFVEWADKGAQAVGGWKNVLIAFAGLKLLSMASGVLSLAGALLKLGGALGGVSTAGASALPILGRLLGIAGLALYSQGLNDGEDQTRLTQPGDTWDGDPVGKARAAANNGSLNDRRRYLMGRLKEAGYTDAQAAGITGSLQQESQLDPNAVNKTSGNYGLAQWGKARAAQFEKQFGKPIQQSTFGEQVDFMLWELKNTEKTADQRIRMAKTPEFAAEVHAREYERPGANEINIPRRQQYAREAAGIGADQPGAVATADEAQRANALKIAQQTATAAAPAVSNSTSTSTTSNETNINGPITVHTQATDAAGIARDIGGAMKRYGFVVPQANTGLS
Physico‐chemical
properties
protein length:622 AA
molecular weight:66254,1 Da
isoelectric point:9,18
hydropathy:-0,25
Representative Protein Details
Accession
6S30T
Protein name
6S30T
Sequence length
309 AA
Molecular weight
32143,60630 Da
Isoelectric point
9,15369
Sequence
VGGWKNVLIAFAGLKLLSMASGVLSLAGALFKLGGALGGVSTAGASALPILGRLLGIAGLALYSQGLNEGEEQTRLTQPGDTWDGDPVGKARAAANSGSLADRRRYLVGRLKEAGYTDAQVAGIAGSLQQESQLDPTAVNKTSGAAGIAQWLGPRARQFEKQFGHSLAQSTFGEQVDFMLWELKNTEKQADQRLRMAKTADAAAEIHSREYERPGAAEANIARRQQYAREVFAGLGQANAAQIAQQTAAAAAPAGGNTSSTTTNTNEMHVNGPITVHTQATDAVGVARDLGGALRRYSFVVPQANTGLS
Other Proteins in cluster: phalp2_319
Total (incl. this protein): 4 Avg length: 539,3 Avg pI: 9,15

Protein ID Length (AA) pI
6S30T 309 9,15369
A0A3B8DY94 613 9,13873
A0A3B8DYD5 613 9,13873
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18863
1GZe6
1 47,0% 202 4.737E-48
2 phalp2_38467
1jrO8
5 39,6% 202 2.883E-24
3 phalp2_34736
3atQ7
4 36,5% 208 1.719E-17
4 phalp2_14835
7srGX
1 30,1% 219 3.972E-12
5 phalp2_19142
1KMw9
1 25,8% 317 3.989E-07
6 phalp2_25983
6FLLr
2 26,4% 212 5.305E-07

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Burkholderia phage PE067
[NCBI]
1735698 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KT803877 [NCBI]
CDS location
range 12505 -> 14373
strand +
CDS
TTGGCGACGATCGTCGATGCTCTCGTTGTAACACTCGGCTTCGACCTGTCCGCCTTCAAGCGCGGCAAGAGCGAGGCCGGCGCGGCTACGAAGAAGCTCACCGCCGAGGAGCGGGCCGCCGCCAAGGAAATCGAGGAGCGCAACAAGAAGGCGGCCGAATCGTTCCGCAGCATCCGCAACGAGGTGCTCGCGCTCGTCGCGATCTTCACGGCCGGCGTCGGCATCAAGCAGTTCGCCGAGAGCACGATCAACTCCGCGGTGAACCTGGGCTACATGGCCCAGAACCTCCAGATGAGCACGCGCGATCTCGCCGCGTGGCAGCGCGCCGCTGAGCGCGCGGGCGGCTCAGCCGAAGGCATCACGGCCGCGCTGCAGGCCTCCCAGAATGACGTCTCGAAGCTGAAGTTCGGCCAAGTCACCGAGGGCGTGCAGTGGTTCCTGCGCATGGGTGGCTCCGTCAAGGATCTGAAGGACGGCAACAGCTACCTGCTCGCCCGCGCGCGGATCATCTCCAACATGTTCAAGACCGACCCGGGCCGCGCGCGCTTCATCGCGCAGCAGATGGGCATCGGTGACGGCGAGTTCAATTTCCTGAAGCAGGGCGAGGGCGCGGTGCTCGCGCTCGTCGACGCGCAGAAGAAGAACTCGGCCGTCACGGAACAACAGGCGGCGCAGGCGCTGAAGCTGCGCAACGCATGGCTCGACCTGCGAGACCGGTTGCAGTACGTCGGCACGACCGTCCTGCTCGAGCTGATGCCGACGTTCGAGAAGCTGCTCGGCAAGCTGCAGAGCATGGCGGACTGGGTGGCCGACCACAAGGCGGATATCAGCGCATGGATCGACCGCGCGGTGACGGCGGTGCAGCAGTTCGTCGAGTGGGCTGACAAGGGCGCGCAGGCCGTCGGCGGCTGGAAGAATGTGCTGATCGCGTTCGCGGGCCTGAAGTTGCTTTCGATGGCCTCGGGCGTGCTGTCGCTGGCTGGCGCGCTGCTGAAGCTCGGCGGCGCTCTCGGTGGCGTCAGCACCGCTGGCGCGAGCGCGCTACCGATCCTCGGGCGCCTGCTCGGTATCGCCGGGCTCGCGCTGTACAGCCAGGGGCTCAACGATGGCGAGGATCAAACGCGCCTCACGCAGCCCGGCGATACGTGGGATGGCGACCCGGTCGGTAAGGCGCGCGCGGCCGCAAACAACGGCTCGCTGAACGATCGCCGCCGTTACCTGATGGGACGTCTGAAGGAAGCCGGCTATACCGACGCTCAGGCGGCGGGCATCACCGGAAGCCTGCAACAGGAAAGCCAGCTCGACCCGAACGCCGTCAACAAGACATCCGGCAACTACGGGCTCGCGCAATGGGGGAAGGCACGCGCAGCGCAGTTCGAGAAGCAGTTCGGCAAGCCGATTCAGCAGTCGACCTTCGGCGAGCAGGTCGACTTCATGCTCTGGGAGCTGAAGAACACCGAGAAGACGGCCGACCAGCGCATCAGGATGGCGAAGACACCCGAATTCGCTGCCGAGGTACATGCGCGCGAATACGAGCGGCCCGGCGCAAATGAGATCAACATCCCGCGCCGGCAGCAGTATGCGCGCGAAGCTGCAGGCATTGGCGCCGATCAACCGGGCGCGGTAGCGACCGCAGACGAAGCGCAACGCGCGAACGCACTGAAGATCGCACAGCAAACCGCAACGGCGGCCGCGCCGGCTGTCAGTAACTCGACATCGACCAGCACGACGTCGAACGAGACAAACATCAACGGGCCGATCACCGTGCACACGCAAGCGACGGACGCCGCGGGCATTGCGCGCGACATCGGCGGCGCCATGAAGCGATACGGCTTTGTCGTGCCGCAGGCCAACACGGGACTGAGCTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6S30T) rather than this protein.
PDB ID
6S30T
Method AlphaFoldv2
Resolution 71.93
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50