Protein

UniProt accession
Q2WC39 [UniProt]
Protein name
Lysozyme
PhaLP type
VAL

evidence: ML prediction

probability: 99 % (predicted by ML model)

Protein sequence
MAIERQAVQGLPQVQATSPNVMTFAPQQVGGVEAGVASTSGSRFIEDLIRAASSVADVTTGILNQKIEEDKVVQMERAYNGLMPSEDATRGGARANMLVKAQLLANDEAARMKDMATRFQGTDDEWTQLMVDSRNEMQNKLFQQYPELQGDKDTMRMVTNVFQEQQPQIWATRTQHKLDREQADREDTFDGRVASTWDPNIDPEASGYALQERIREGLTQGLLPEQMHKKLVQRAISLAQGGDVSMAEALKYVKDDKGVSVYAKNPQLITAITSGNAVWARNNVADVTRMSFEVKESYLAGDLTDEELLERAQHINNLTGNSVFSNPELEALMRQRAKQNAELGAMQDMRRELYSDRLTGFQGKTDKEKKAYIDVIKQDSQLYADQQIKQRGLDPYSQEAEAIRGAVEVQRLQFMNSKGLVDDTFESRIKAMESMLSPEHFAKGEPQELMTIRQLWEQLPEESRGVFGDTVNGYMDNYNTALQMGETPLQAARFAREAQQKFSRTEKETKKFNSAIGDALDEVSGAGWFDGKTEVSDLGKAIAEEELRAKANMLWSSGMRNMDSIKKALITWGNKRYTQSEDAKTSGGYFIKGDYTSASDMLMSVGKGVNPTDVPLALGRYVETQMPELKKELQEWETKDDVYIDYNEQKGTFVIRAGAAGRPLSGVIPVTSLDTTSLLDSAYQKKVEERDKGEYVHPYRTDIGAQEPMPAKPTAKDIGKLGLANFLMSSAFASGENLPSNFEINYRGNMQQFYDKLAMDENKDKVGFNKATGTFTPYKDAHGESIGYGHFLTEEEKRDGYIKIGDELVPYRGSMSQLTESKARALMEQDARKHVPPTRDWKIPFDQMHPAQQRGLMDLTYNLGKGGIQNSPRALAAFKAGKLTEGFIEMLGTASSEGKRIPGLLKRRAEAYNMAAAGGVPKITEVETREDGSMWVKFGGPMPAGSVSAWTHKRIGADGWYQVYEAAPTKLAKDSKVGKVKL
Physico‐chemical
properties
protein length:982 AA
molecular weight:109363,00000 Da
isoelectric point:5,46009
aromaticity:0,07536
hydropathy:-0,64348

Domains

Domains [InterPro]
Protein sequence: Q2WC39
1 982
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage K1E (Bacteriophage K1E)
[NCBI]
344022 Autographiviridae > Vectrevirus >
Host Escherichia coli
[NCBI]
562 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
CAJ29444.1 [NCBI]
Genbank nucleotide accession
AM084415 [NCBI]
CDS location
range 29620 -> 32568
strand +
CDS
ATGGCAATTGAGCGACAAGCAGTACAAGGTCTGCCACAAGTGCAGGCCACTTCTCCTAATGTCATGACCTTTGCACCTCAACAAGTGGGAGGTGTGGAGGCTGGCGTGGCTTCTACCTCCGGTAGTAGGTTTATCGAAGACCTTATTCGTGCAGCCAGCAGTGTGGCTGATGTTACCACTGGTATCCTTAATCAGAAGATTGAGGAAGATAAGGTTGTTCAAATGGAACGGGCATATAACGGACTAATGCCTTCTGAGGATGCAACTCGTGGTGGCGCTCGTGCTAACATGCTTGTCAAAGCTCAACTGCTAGCTAATGATGAAGCAGCACGAATGAAAGACATGGCTACTCGTTTCCAAGGGACGGATGACGAGTGGACACAACTTATGGTTGACTCTCGTAATGAGATGCAGAATAAGCTGTTCCAGCAATACCCTGAGTTGCAAGGTGACAAAGATACTATGCGTATGGTCACTAATGTCTTCCAAGAACAGCAGCCTCAGATTTGGGCTACACGAACCCAGCATAAACTTGACCGTGAACAAGCAGACCGGGAGGATACCTTTGACGGGCGAGTGGCTTCTACTTGGGATCCTAATATTGACCCTGAAGCATCTGGCTATGCTTTACAGGAACGAATCCGCGAAGGTCTTACTCAAGGATTACTACCTGAACAGATGCACAAGAAGTTAGTCCAGCGAGCAATTTCACTTGCACAAGGCGGTGATGTTAGCATGGCTGAAGCCCTGAAGTATGTGAAGGACGATAAGGGTGTTTCTGTTTATGCTAAGAATCCACAGCTTATCACAGCCATCACTAGTGGTAATGCAGTTTGGGCTAGGAATAATGTAGCTGATGTAACTCGTATGTCTTTCGAAGTTAAAGAATCATACCTTGCAGGTGATTTAACTGATGAAGAATTGTTGGAACGAGCACAGCACATTAATAATCTGACAGGTAACTCTGTCTTCTCTAATCCAGAACTAGAGGCACTGATGCGCCAACGGGCTAAGCAGAATGCAGAGCTAGGTGCAATGCAGGATATGCGACGTGAGCTTTACTCCGACCGCCTGACTGGCTTCCAAGGTAAGACTGATAAAGAGAAGAAGGCTTACATTGATGTTATCAAACAGGATAGCCAACTTTATGCAGACCAGCAAATCAAACAACGTGGCTTGGACCCTTACAGTCAAGAGGCTGAAGCTATTCGTGGTGCAGTGGAAGTGCAGCGCCTGCAATTCATGAACTCCAAAGGTTTAGTGGATGATACCTTTGAATCTCGTATCAAGGCTATGGAATCCATGCTATCACCTGAGCACTTTGCTAAAGGTGAACCACAGGAGTTAATGACCATTCGTCAGTTGTGGGAGCAGTTACCTGAAGAAAGTCGAGGTGTCTTCGGTGACACTGTGAACGGTTATATGGATAACTACAATACTGCATTACAAATGGGAGAGACACCTTTGCAGGCTGCAAGGTTTGCCCGTGAAGCACAGCAGAAATTCTCTCGTACTGAGAAGGAAACCAAGAAGTTCAACTCCGCTATTGGAGATGCACTGGATGAGGTATCTGGTGCTGGCTGGTTTGATGGTAAAACCGAGGTGTCAGACTTAGGTAAAGCTATTGCGGAAGAAGAGTTACGAGCTAAGGCCAATATGTTGTGGTCTAGCGGTATGCGTAACATGGATTCTATCAAGAAGGCTTTAATCACTTGGGGCAATAAACGCTACACTCAATCAGAGGATGCAAAGACTTCCGGTGGCTATTTCATTAAAGGTGATTACACTTCTGCATCTGATATGCTTATGTCAGTTGGGAAAGGTGTAAACCCTACCGATGTCCCTCTGGCGCTTGGTAGGTATGTAGAAACACAGATGCCAGAATTGAAGAAGGAGCTTCAAGAGTGGGAAACTAAGGATGATGTGTACATTGATTACAATGAACAGAAAGGAACTTTTGTGATTCGTGCTGGTGCAGCAGGTCGCCCTCTTTCTGGAGTAATCCCTGTAACTTCTTTGGATACCACTTCACTACTGGATTCTGCCTATCAGAAGAAAGTAGAAGAACGAGATAAAGGCGAGTATGTTCATCCCTATCGTACAGATATCGGTGCACAAGAACCAATGCCAGCTAAGCCAACTGCCAAAGATATTGGTAAATTAGGATTAGCTAACTTCCTCATGTCTTCTGCTTTTGCTTCTGGTGAGAATCTACCTTCTAACTTCGAGATTAACTATCGAGGCAATATGCAACAATTCTATGACAAGCTAGCTATGGATGAGAATAAAGATAAAGTTGGCTTTAATAAGGCAACTGGAACCTTTACTCCATATAAAGACGCTCACGGTGAGTCTATCGGTTACGGTCATTTCTTAACGGAAGAAGAGAAGCGAGACGGGTATATTAAGATTGGCGATGAACTAGTTCCCTATCGAGGGTCTATGTCTCAGCTTACAGAGAGTAAGGCTCGCGCTCTTATGGAGCAAGATGCTAGGAAGCATGTGCCTCCTACTCGTGACTGGAAGATTCCGTTTGACCAGATGCATCCTGCACAGCAACGTGGCTTGATGGATTTAACCTACAATTTAGGTAAAGGTGGAATCCAGAACTCACCGCGTGCTCTTGCTGCATTCAAAGCTGGTAAGCTTACGGAAGGCTTTATCGAAATGCTGGGTACTGCATCAAGTGAAGGTAAACGTATTCCGGGCCTACTGAAGCGACGCGCTGAGGCATACAATATGGCAGCTGCTGGTGGTGTACCTAAGATCACAGAAGTGGAGACGAGGGAAGATGGCTCTATGTGGGTTAAGTTTGGTGGACCTATGCCAGCAGGCTCTGTTTCTGCGTGGACGCATAAACGTATTGGAGCCGATGGCTGGTATCAGGTTTATGAGGCTGCACCTACCAAGTTAGCTAAAGACTCTAAGGTAGGTAAAGTTAAATTGTAG

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity Molecular function Inferred from Electronic Annotation (UniProt)
GO:0009253 peptidoglycan catabolic process Biological process Inferred from Electronic Annotation (InterPro)
GO:0016998 cell wall macromolecule catabolic process Biological process Inferred from Electronic Annotation (InterPro)
GO:0031640 killing of cells of another organism Biological process Inferred from Electronic Annotation (UniProt)
GO:0042742 defense response to bacterium Biological process Inferred from Electronic Annotation (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 lysozyme
aka muramidase
D-glucosamine residues in chitodextrins
Hydrolases
Glycosylases
Glycosidases, i.e. enzymes hydrolyzing O- and S-glycosyl compounds
match to sequence model evidence used in automatic assertion
ECO:0000256
RuleBase:RU003788

Tertiary structure

No tertiary structures available.