Protein

UniProt accession
J9PKF7 [UniProt]
Protein name
Putative endo-beta-N-acetylglucosaminidase
PhaLP type
VAL

evidence: ML prediction

probability: 99 % (predicted by ML model)

Protein sequence
MANRESFIFDVNADISQAMSGLEKVKRAMDDIDSLRKKGEGKGRTTSNKDMLDAMRIAKQAAQEYRNLQKQLKDLDRQIKTSSNKMLNKDQSAELKKRTGLNMNTKKEYQNFIKNQQREMKRMQDSIRRAQNDMARFGQTYSKNFQTNYQQKGIMHVDTKNLGEAKKVVQGVLDETRKTSGELDDVIKKIKEAKKLDRRAESLSRRASVSNYMSHQQASNFTKDLNTSRVDYRNYKDANVTRLTQISTDITKFAQQISDIERKPNATQDDITKRSKLQENIAALDREWSARSELNKVLEETTANMERYNQSLQGVEKKPERGTWRGMAYERAPAIALAVMGAVTAAVGKLYNQGGSLSREMRGNEVYIGQQTGEAGGSWRSNIRNNAMEAGLKNRLGFTGVEMLGFQESYLTAKGFTNRKDLTSAMEGQATFSRATGIDASETKDFFNTAYRSGGLSGLQTKSFQNAFLGAIKQSGMEGREKDQLKALDGILNGMSQNRSISSQDMMRTVGLQSVLAGSGVGALQGSKGGALMASMDEGIRKGFDDPSLRVLFGQGTKFQGMEGRRALRKQMEQGVSNVDNVNTMIDAAMAQGGGSRDAQIEALTSLAHRMGIDMSDQQSEGLFKLKEQGKLTKGNIDKIMKDSAKEGAKESKKRQENYEKSSAATDSQSESVTAKQAVGINDMGEAVRKANAALGGLPAPLYGAVVAVGAFTASMLAAAVAFRGAGMIRGGLAQTYGNGGGGGGAGGGVGGTGRGAGTAAGAGAAASQGTFAEGVVPSTGGVSNGATRQYQGANNGGIFSGIKNFFDPSARAGGLPNLGTGEALAAGATVAGGAAAAKSFPNLFKGGLKDSIALKGLNKIMLPLAALSAYSTITSAPDEKKGEATGSAVGGIGGGILGGAAAGAAAGSIIPGAGTLVGGAIGLAGGIVGSIFGSSVGGGIGSWFDSDEPKDTAPAEMTSEPTVAPTNTSNITGAPSVSTMPNANGAVDVSTMPNMGGTANQITNMVDKENTNTKKQTEGKKTDNLAYERENLSLYERMLVKAEQILAQARAQNGIMGMGGAGGAGGTNGINGFTGGGSLKFLPDGQKWSNSNLTQHDLGTTDQKLTAEDLDSWINSKAPEGSMMRGMGAVFLKAGQETGLDPRYLIAHAAEESAWGTSKIARDKGNFFGIGAFDDSPYSSAFEFKDGSGSAAERGIMGGAKWIADKYYGKGRTTLDAMHKAGYATNSDWATNIASIMKGAPSGSGSGNVTATINVNVKGDEKVSDKIKSSSDMKKVGTNVGNMLGFFSREMVVV
Physico‐chemical
properties
protein length:1295 AA
molecular weight:136381,00000 Da
isoelectric point:9,42478
aromaticity:0,05637
hydropathy:-0,49722

Domains

Domains [InterPro]
Protein sequence: J9PKF7
1 1295
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Bastille
[NCBI]
57477 Herelleviridae > Bastillevirus > Bastillevirus bastille
Host Bacillus cereus
[NCBI]
1396 Bacteria > Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AEQ34224.1 [NCBI]
Genbank nucleotide accession
JF966203 [NCBI]
CDS location
range 22347 -> 26234
strand +
CDS
TTGGCTAATAGAGAATCGTTTATTTTTGATGTAAACGCCGATATATCGCAAGCCATGTCTGGTTTAGAAAAAGTTAAACGAGCTATGGACGATATTGATTCACTTCGTAAAAAAGGTGAAGGAAAAGGTCGTACCACTTCTAACAAAGACATGTTAGATGCAATGCGTATAGCAAAACAAGCAGCTCAAGAATATAGAAATCTTCAAAAGCAGCTAAAAGATTTAGATAGACAAATCAAGACATCCTCTAACAAGATGTTAAACAAAGACCAGTCTGCTGAGTTAAAGAAACGCACAGGTCTTAACATGAACACCAAGAAGGAATATCAAAACTTCATCAAGAACCAACAACGTGAAATGAAGCGTATGCAAGATAGTATCAGACGAGCTCAAAACGACATGGCGCGCTTTGGTCAAACATACTCTAAGAACTTCCAAACGAACTACCAGCAAAAAGGTATCATGCACGTTGATACGAAGAATTTAGGTGAAGCGAAGAAAGTTGTTCAAGGTGTCTTAGACGAGACACGAAAAACATCTGGGGAATTAGACGATGTTATTAAAAAGATTAAAGAAGCGAAGAAGCTGGATAGACGTGCAGAGAGTTTATCTCGTAGAGCAAGTGTGTCTAACTATATGTCACATCAACAAGCATCTAACTTCACGAAAGACTTGAACACTTCACGTGTAGACTATCGTAACTATAAAGATGCTAACGTTACTAGGTTAACACAAATCTCAACAGATATCACGAAGTTTGCTCAACAAATTTCTGATATTGAGAGGAAACCTAACGCTACTCAAGATGATATCACTAAGCGTAGTAAATTGCAAGAGAATATTGCAGCATTAGATAGAGAGTGGTCTGCTCGCTCTGAATTGAACAAAGTTCTGGAAGAAACTACAGCTAACATGGAGCGTTACAATCAATCCTTACAAGGTGTAGAAAAGAAACCAGAACGTGGTACTTGGAGAGGTATGGCTTATGAACGTGCTCCCGCTATCGCACTAGCAGTAATGGGCGCAGTAACAGCCGCCGTAGGTAAGCTTTATAATCAAGGTGGTTCGTTAAGTAGAGAGATGCGCGGTAATGAAGTTTATATCGGTCAACAAACTGGTGAAGCTGGTGGCTCTTGGAGAAGTAACATCCGTAACAACGCTATGGAAGCAGGTCTTAAAAATAGACTAGGATTCACAGGCGTAGAGATGCTAGGATTCCAAGAAAGCTATTTAACAGCTAAAGGTTTCACAAACCGCAAAGACTTGACATCAGCGATGGAAGGACAAGCTACATTTAGTCGAGCAACAGGTATCGATGCATCAGAAACGAAAGACTTCTTCAACACCGCTTACCGTAGTGGTGGTTTATCTGGTCTTCAAACTAAATCATTCCAGAATGCGTTCTTAGGGGCGATAAAACAATCTGGTATGGAAGGTCGAGAAAAAGACCAGCTTAAAGCCTTAGACGGTATCTTGAATGGTATGTCACAGAATCGTTCTATCTCTTCTCAGGACATGATGAGAACAGTAGGACTACAATCTGTTTTAGCAGGTTCTGGTGTAGGCGCACTCCAAGGTAGTAAGGGTGGCGCGTTAATGGCGAGTATGGATGAAGGTATCCGTAAAGGATTCGATGACCCATCTCTACGTGTACTATTCGGTCAAGGTACGAAGTTCCAAGGTATGGAAGGACGTCGTGCTCTCCGTAAGCAAATGGAGCAAGGAGTATCAAACGTTGACAATGTTAACACCATGATTGATGCAGCTATGGCGCAAGGTGGAGGTAGTCGAGACGCGCAAATTGAAGCTTTAACATCATTAGCTCACCGTATGGGTATCGATATGAGTGACCAGCAATCAGAAGGGTTATTCAAGCTTAAGGAACAAGGTAAGCTTACAAAAGGCAATATCGATAAAATCATGAAGGATAGCGCTAAGGAAGGTGCTAAAGAATCTAAGAAACGTCAAGAAAACTATGAGAAGTCAAGTGCAGCTACAGACAGTCAGAGTGAGTCAGTAACCGCTAAACAAGCTGTAGGTATCAATGATATGGGTGAAGCCGTTCGTAAAGCCAATGCAGCTCTAGGCGGATTACCAGCCCCATTATACGGAGCAGTCGTAGCCGTAGGAGCATTCACAGCTTCTATGTTAGCCGCAGCAGTAGCCTTTAGAGGTGCTGGAATGATTCGTGGTGGTTTAGCCCAAACGTACGGTAACGGAGGCGGAGGCGGAGGCGCTGGTGGCGGTGTAGGTGGTACTGGTCGTGGAGCTGGTACAGCCGCAGGTGCAGGAGCTGCCGCATCACAAGGTACATTTGCAGAAGGTGTAGTACCATCTACAGGCGGTGTATCTAACGGTGCAACCAGACAGTATCAAGGCGCTAACAACGGCGGTATCTTTAGTGGTATCAAGAACTTCTTCGACCCTAGCGCTAGAGCTGGTGGATTACCTAACTTAGGAACTGGAGAAGCGCTTGCAGCAGGAGCAACAGTTGCAGGTGGAGCGGCGGCAGCTAAAAGTTTCCCTAATCTGTTTAAAGGCGGATTAAAAGATAGCATAGCTCTTAAAGGTCTTAATAAGATTATGCTACCTCTCGCAGCACTATCCGCTTACAGTACAATCACTTCTGCACCAGATGAGAAGAAGGGAGAAGCAACAGGTAGCGCAGTAGGCGGTATCGGCGGTGGTATTCTTGGTGGAGCAGCAGCAGGGGCAGCAGCAGGTTCAATAATTCCAGGGGCTGGTACACTGGTCGGAGGAGCTATTGGTCTAGCTGGAGGGATTGTCGGAAGTATATTCGGTAGTTCTGTCGGCGGAGGTATCGGTAGTTGGTTCGATTCTGATGAACCTAAAGATACAGCTCCAGCAGAGATGACTAGTGAGCCAACAGTAGCTCCTACGAACACATCGAACATTACAGGAGCTCCATCCGTTTCAACCATGCCTAACGCTAATGGTGCAGTTGACGTATCTACAATGCCTAATATGGGCGGCACAGCTAACCAGATTACAAACATGGTAGATAAAGAAAACACAAACACGAAAAAGCAAACGGAAGGTAAGAAAACGGACAACCTTGCTTACGAACGCGAGAACTTATCCCTATATGAACGTATGTTAGTGAAGGCAGAGCAGATACTTGCTCAAGCTAGAGCGCAGAACGGTATTATGGGTATGGGTGGCGCTGGAGGCGCTGGTGGAACTAACGGTATTAACGGTTTCACTGGTGGAGGCTCTTTGAAGTTCTTACCAGATGGTCAGAAGTGGTCAAACAGCAACTTGACACAACATGACTTAGGTACTACAGACCAGAAGCTAACAGCCGAAGATTTAGATAGTTGGATTAACTCTAAAGCGCCAGAAGGCTCAATGATGCGCGGTATGGGTGCAGTATTCTTAAAAGCTGGACAAGAAACTGGACTTGACCCACGTTACTTAATCGCACATGCGGCTGAGGAGTCTGCTTGGGGTACATCTAAAATCGCTAGAGACAAGGGTAACTTCTTCGGAATCGGTGCATTCGATGATAGCCCATACTCAAGTGCCTTTGAATTCAAAGACGGTTCGGGTTCTGCTGCTGAGAGAGGTATCATGGGTGGTGCGAAATGGATTGCCGATAAGTACTACGGAAAAGGTCGAACAACTCTTGACGCGATGCACAAAGCTGGATACGCAACAAACTCTGATTGGGCTACAAACATTGCAAGCATCATGAAGGGCGCACCGTCTGGTTCGGGCTCTGGTAATGTGACTGCCACAATCAACGTCAACGTTAAAGGAGATGAAAAAGTATCTGACAAAATAAAGAGCTCTAGTGACATGAAGAAAGTCGGTACAAACGTCGGAAACATGTTAGGGTTCTTCTCTAGAGAGATGGTAGTGGTGTAA

Gene Ontology

Description Category Evidence (source)
GO:0004040 amidase activity Molecular function Inferred from Electronic Annotation (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.