Protein
- UniProt accession
- A0A1D7RAG5 [UniProt]
- Protein name
- Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase-like domain-containing protein
- PhaLP type
-
VAL
evidence: ML prediction
probability: 76 % (predicted by ML model)
- Protein sequence
-
MATVQKSTKINFYKFVATKDVSPSAKGADEGTVATVKVLNKNTEALNNIGSVLNGIAKIAQDLKKVMLMQLDAQQRKNVGTFDAQYTKTKKEKKSGFVAGIVGKSVSFLEGLLTTFSNLFKLLVVIPALKWLSDPKNQETIATALRIIRSVVTFIFDWAKFGITNTIDGLYNLLRDDASWWDRIVGLGQAIAGIGAVVLGIRYLSNPLKLVKDITNGVRALVRFVLGGGRGGGGGGKKPRGRFGGAMRLLGGAAIVGGSAYAISQMNQPEEKSQGGSVKILPSRSQGGWISGPQSGYKVSLDGGRSTSFIGHGTEYVARKANGGAFVVPFNTPGTKTQPHLTQKRLGEAKRLGYFSNGGEITGNNEQKWSKVMGLGKAAGAKYPELVAAQFALESAWGTALSAKNNFFGIKATGNESATVSNTREVINGRSVYVDARFKNFSTPFDAINHLVTQWYKDYRGYTGVNNAPDAFAAASKLQKEGYATDPQYTKHLSRLMQQYSNLRGAQVTAPTQQPSGNPFMNFFGNLANSVLGIGGANAAEHGGNDNQGRAPEYTGGTADVVGISHPDTGAGYGIKDQKDQHGRPLAFSQPAAEQFSKALKASGMNLGAYIASTGRSDAKNNSVGGHPNSHHMYGEAIDMNGDGYEWMRKNGRKYGWQYVYNHGPGSAHFKYVGPGAGSTPKLGPPGSKPKVGPSVTGGSSSSTSTHSSGTREETKKKVTLADIMGGRSTRPVPLENYKTGNINSHGDDRSIIDATEERNRARQRANEKSSQMVQAAIEAVRLSNTSNEQIIMQAQQGIQQAMQMGSGNSQPQVIPTGGGRGAVKSVVSSLASSINPLRGIFK
- Physico‐chemical
properties -
protein length: 843 AA molecular weight: 89683,00000 Da isoelectric point: 9,92351 aromaticity: 0,07829 hydropathy: -0,42586
Domains
Domains [InterPro]
Taxonomy
Name | Taxonomy ID | Lineage | |
---|---|---|---|
Phage |
Synechococcus phage S-RIM2 [NCBI] |
687800 | Kyanoviridae > Nerrivikvirus > Nerrivikvirus srim2 |
Host | No host information |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AON98388.1
[NCBI]
Genbank nucleotide accession
KX349230
[NCBI]
CDS location
range 9132 -> 11663
strand +
strand +
CDS
ATGGCAACGGTACAAAAAAGTACTAAAATCAATTTCTACAAGTTTGTAGCAACAAAAGATGTCTCACCATCTGCGAAGGGTGCTGATGAGGGAACGGTTGCAACTGTAAAAGTCCTTAATAAAAATACTGAAGCACTTAATAATATTGGTAGTGTCTTGAATGGCATTGCTAAGATCGCTCAAGACCTTAAAAAGGTCATGTTGATGCAACTTGATGCACAACAAAGAAAGAATGTTGGAACTTTTGACGCACAATATACCAAAACCAAAAAAGAGAAAAAGAGTGGATTTGTTGCTGGTATTGTAGGCAAATCAGTATCCTTCTTGGAGGGATTATTAACGACATTTTCCAACCTCTTTAAGTTATTAGTTGTTATACCTGCTCTCAAATGGTTATCAGATCCTAAGAATCAAGAAACAATAGCAACAGCACTGAGAATTATTAGATCAGTAGTTACATTCATCTTTGATTGGGCAAAGTTTGGTATTACCAACACCATTGATGGACTGTATAATCTTCTAAGGGACGATGCGTCTTGGTGGGATAGGATAGTTGGTCTTGGTCAAGCAATTGCAGGTATTGGGGCGGTTGTCCTGGGTATCAGATATCTTTCAAATCCTCTGAAACTTGTAAAGGACATTACCAACGGCGTTAGAGCACTCGTCAGATTCGTCCTTGGTGGCGGTAGAGGAGGTGGAGGAGGTGGCAAGAAACCTCGCGGACGCTTTGGTGGAGCAATGCGCTTGCTCGGTGGCGCAGCGATCGTTGGTGGTTCTGCCTATGCCATCAGTCAGATGAACCAACCTGAGGAGAAATCCCAAGGAGGTTCAGTAAAGATTCTTCCAAGTAGATCACAAGGAGGTTGGATTAGTGGACCACAATCGGGATATAAGGTATCTTTGGACGGGGGGAGATCAACCTCGTTCATCGGACATGGAACTGAGTATGTTGCTAGAAAGGCAAATGGGGGAGCTTTCGTCGTTCCTTTTAATACTCCTGGAACAAAAACACAACCCCATCTGACTCAGAAGAGACTTGGTGAGGCAAAGAGACTTGGATATTTCTCTAATGGTGGAGAGATAACTGGTAATAATGAACAGAAATGGTCCAAGGTCATGGGTCTTGGCAAAGCAGCAGGTGCTAAGTATCCAGAACTTGTCGCTGCACAGTTTGCTCTAGAAAGTGCATGGGGCACAGCACTATCTGCTAAAAACAACTTCTTCGGTATCAAAGCAACTGGTAATGAATCCGCCACAGTTTCCAACACAAGAGAGGTTATTAATGGTCGAAGTGTTTATGTTGATGCAAGATTTAAGAACTTTAGCACACCTTTTGATGCTATCAATCATCTAGTAACACAATGGTATAAAGATTATAGGGGGTATACTGGTGTTAACAATGCACCAGACGCTTTTGCTGCTGCGTCTAAGTTGCAGAAAGAAGGTTATGCTACAGATCCTCAATATACCAAGCATCTATCCAGATTGATGCAACAATATTCTAATCTAAGAGGGGCACAGGTTACGGCACCAACACAACAACCTAGTGGTAATCCCTTTATGAACTTCTTTGGGAATCTAGCGAACTCTGTACTGGGTATCGGTGGCGCAAACGCAGCTGAGCATGGAGGTAATGATAATCAAGGTAGAGCACCTGAATATACTGGCGGTACGGCTGATGTGGTAGGTATTTCTCATCCTGATACAGGTGCTGGATATGGCATTAAAGATCAGAAGGATCAACATGGTAGACCCTTAGCATTCTCTCAACCTGCTGCAGAACAGTTTTCAAAGGCGTTGAAGGCATCTGGAATGAACCTTGGTGCATATATTGCTAGCACGGGTCGTAGTGATGCAAAGAACAACTCTGTTGGTGGGCATCCAAACTCACACCATATGTACGGTGAAGCGATTGACATGAATGGTGATGGATATGAATGGATGAGAAAGAATGGTAGAAAATATGGTTGGCAATACGTTTATAATCATGGACCTGGTAGTGCTCACTTCAAATATGTTGGACCTGGTGCAGGATCTACTCCTAAGTTAGGTCCTCCTGGCAGCAAACCCAAGGTGGGACCTTCCGTTACTGGGGGTTCTAGTAGTTCGACTTCGACGCATAGTAGTGGAACCAGAGAAGAAACTAAGAAGAAAGTTACACTTGCTGATATCATGGGTGGTAGATCTACTAGACCTGTACCACTAGAAAACTATAAAACTGGCAATATAAATTCACATGGTGATGATCGATCTATAATCGATGCGACTGAAGAGAGAAATCGAGCAAGACAAAGGGCAAATGAGAAGTCTTCCCAAATGGTTCAAGCGGCAATAGAAGCAGTTAGACTATCTAATACTAGTAATGAACAAATTATTATGCAGGCACAACAAGGTATTCAGCAAGCAATGCAAATGGGTAGTGGCAATAGTCAACCTCAGGTCATTCCAACAGGCGGTGGAAGAGGTGCTGTTAAGTCTGTTGTTTCCTCACTCGCGTCCTCTATTAATCCTCTGAGAGGTATCTTCAAATGA
Genbank protein accession
AOO01387.1
[NCBI]
Genbank nucleotide accession
KX349244
[NCBI]
CDS location
range 9135 -> 11666
strand +
strand +
CDS
ATGGCAACGGTACAAAAAAGTACTAAAATCAATTTCTACAAGTTTGTAGCAACAAAAGATGTCTCACCATCTGCGAAGGGTGCTGATGAGGGAACGGTTGCAACTGTAAAAGTCCTTAATAAAAATACTGAAGCACTTAATAATATTGGTAGTGTCTTGAATGGCATTGCTAAGATCGCTCAAGACCTTAAAAAGGTCATGTTGATGCAACTTGATGCACAACAAAGAAAGAATGTTGGAACTTTTGACGCACAATATACCAAAACCAAAAAAGAGAAAAAGAGTGGATTTGTTGCTGGTATTGTAGGCAAATCAGTATCCTTCTTGGAGGGATTATTAACGACATTTTCCAACCTCTTTAAGTTATTAGTTGTTATACCTGCTCTCAAATGGTTATCAGATCCTAAGAATCAAGAAACAATAGCAACAGCACTGAGAATTATTAGATCAGTAGTTACATTCATCTTTGATTGGGCAAAGTTTGGTATTACCAACACCATTGATGGACTGTATAATCTTCTAAGGGACGATGCGTCTTGGTGGGATAGGATAGTTGGTCTCGGTCAAGCAATTGCAGGTATTGGGGCGGTTGTCCTGGGTATCAGATATCTTTCAAATCCTCTGAAACTTGTAAAGGACATTACCAACGGCGTTAGAGCACTCGTCAGATTCGTCCTTGGTGGCGGTAGAGGGGGTGGAGGAGGTGGCAAGAAACCTCGCGGACGCTTTGGTGGAGCAATGCGCTTGCTCGGTGGCGCAGCGATCGTTGGTGGTTCTGCCTATGCCATCAGTCAGATGAACCAACCTGAGGAGAAATCCCAAGGAGGTTCAGTAAAGATTCTTCCAAGTAGATCACAAGGAGGTTGGATTAGTGGACCACAATCGGGATATAAGGTATCTTTGGACGGGGGGAGATCAACCTCGTTCATCGGACATGGAACTGAGTATGTTGCTAGAAAGGCAAATGGGGGAGCTTTCGTCGTTCCTTTTAATACTCCTGGAACAAAAACACAACCCCATCTGACTCAGAAGAGACTTGGTGAGGCAAAGAGACTTGGATATTTCTCTAATGGTGGAGAGATAACTGGTAATAATGAACAGAAATGGTCCAAGGTCATGGGTCTTGGCAAAGCAGCAGGTGCTAAGTATCCAGAACTTGTCGCTGCACAGTTTGCTCTAGAAAGTGCATGGGGCACAGCACTATCTGCTAAAAACAACTTCTTCGGTATCAAAGCAACTGGTAATGAATCCGCCACAGTTTCCAACACAAGAGAGGTTATTAATGGTCGAAGTGTTTATGTTGATGCAAGATTTAAGAACTTTAGCACACCTTTTGATGCTATCAATCATCTAGTAACACAATGGTATAAAGATTATAGGGGGTATACTGGTGTTAACAATGCACCAGACGCTTTTGCTGCTGCGTCTAAGTTGCAGAAAGAAGGTTATGCTACAGATCCTCAATATACCAAGCATCTATCCAGATTGATGCAACAGTATTCTAATCTAAGAGGGGCACAGGTTACGGCACCAACACAACAACCTAGTGGTAATCCCTTTATGAACTTCTTTGGGAATCTAGCGAACTCTGTACTGGGTATCGGTGGCGCAAACGCAGCTGAGCATGGAGGTAATGATAATCAAGGTAGAGCACCTGAATATACTGGCGGTACGGCTGATGTGGTAGGTATTTCTCATCCTGATACAGGTGCTGGATATGGCATTAAAGATCAGAAGGATCAACATGGTAGACCCTTAGCATTCTCTCAACCTGCTGCAGAACAGTTTTCAAAGGCGTTGAAGGCATCTGGAATGAACCTTGGTGCATATATTGCTAGCACGGGTCGTAGTGATGCAAAGAACAACTCTGTTGGTGGGCATCCAAACTCACACCATATGTACGGTGAAGCGATTGACATGAATGGTGATGGATATGAATGGATGAGAAAGAATGGTAGAAAATATGGTTGGCAATACGTTTATAATCATGGACCTGGTAGTGCTCACTTCAAATATGTTGGACCTGGTGCAGGATCTACTCCTAAGTTAGGTCCTCCTGGCAGCAAACCCAAGGTGGGACCTTCCGTTACTGGGGGTTCTAGTAGTTCGACTTCGACGCATAGTAGTGGAACCAGAGAAGAAACTAAGAAGAAAGTTACACTTGCTGATATCATGGGTGGTAGATCTACTAGACCTGTACCACTAGAAAACTATAAAACTGGCAATATAAATTCACATGGTGATGATCGATCTATAATCGATGCGACTGAAGAGAGAAATCGAGCAAGACAAAGGGCAAATGAGAAGTCTTCCCAAATGGTTCAAGCGGCAATAGAAGCAGTTAGACTATCTAATACTAGTAATGAACAAATTATTATGCAGGCACAACAAGGTATTCAGCAAGCAATGCAAATGGGTAGTGGCAATAGTCAACCTCAGGTCATTCCAACAGGCGGTGGAAGAGGTGCTGTTAAGTCTGTTGTTTCCTCACTCGCGTCCTCTATTAATCCTCTGAGAGGTATCTTCAAATGA
Gene Ontology
Description | Category | Evidence (source) | |
---|---|---|---|
GO:0004040 | amidase activity | Molecular function | Inferred from Electronic Annotation (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available.