Protein

Protein accession
I2E8W3 [UniProt]
Representative
72syZ
Source
UniProt (cluster: phalp2_36845)
Protein name
Lysozyme
Lysin probability
92%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MGKVQLYKFNNKNFVNNGDMILEPSKCTLKMELVTGLNEVTMEHEYDEEERWKYISRDDVIKVSTPYKKIPEQLYRIYDIEKNLDYMSVKARHIFYDLVDIYIKDLTREENIFDVRCVNCNGQQALDKILKGTEFKGHSDIEKIATSYYFRKKIVQAIGGDDKENSFLSRWGGELFLNNFDIYLNKRVGEDNNVRIAYKKNLTGLVETIDMDSLITRVIPTGYDGICISGTTPWVDSPLINNYSHVFESEQRFEDIKLKGTQNNKGENAEGFDTQEQVNEALKNKGKELISKGIDKPLVNYSVQFIPLSSTEEYKEYKSLEEILLGDTVYIEHKPLDINIKARCISFEYDCLNEEMLNCEIGNYLDTYASAQADKSVTFDTIAGSFDGDGNLGGENIIGAINAMKAPLLAQRDRAKKLDIVAWIQEVLDPNDPDFGCVQGGTKGILLSDKRLSDNSGWDFKTAITPKGIIADELIGILTTVLIRNMDKSFEIDLKKAGGALFRNNGKDAIKIENNMIKLYNWKKNGDYIGALMSLVQGDDENKPLIGLANDIDSAMSLGYAVEGKTKVPSYIAFDKYNILDDSSGKPVRIYEEVDFKGNKVYNIDIRSDNGKNNIMVGDHFINITTPDNEIVVSGSGTRIGKDKSLYYDARTGELRCNDLVLNGVIKNTSGTTVFDPNSPIGGGGVDTLGNVSKGIPSRKYFRYVKGIEGLQQYPGNIGDGQITYGYGVTKANEPTYFAKLGNPPCSEGTASKVLFELIPDKYGSLVKNQMIKDGVDLNKVPIHIFDAFVDLCYNSGYYNSRMYRAWIRGASLDSIYNDWLTYATMPGTIFENGLKRRRKEEAEMFKNANYIMSPIGILNSSGSQIGTVKGDGYFPPIESNNFKTINNEYGNGWIIPVSNGHVTATFPYYPSGAPHSGIDFGVPIGTPVRASKPGKVIKRRELTTSYGKYLFIDHGGGLITIYAHNSELLVNEGDTVKAGQVIARSGNTGNSSGPHCHWELRVNGTAQNIAPSLKVGDLV
Physico‐chemical
properties
protein length:1020 AA
molecular weight:113625,1 Da
isoelectric point:5,51
hydropathy:-0,41
Representative Protein Details
Accession
72syZ
Protein name
72syZ
Sequence length
995 AA
Molecular weight
102339,36080 Da
Isoelectric point
9,34381
Sequence
MSDIEGLIRRPTEPSPSSITPLLGTNRLQSAVDHMSKSVDKVSGWLDKASDRLAKMSAPTAGMGSGTTLNGGKQNGGGSSIGGVRMLGAGASNGAASGTPHSFLGMPYQQNIPTMTQQYGAPTQTGGKSNGGAVNWPRYGAALGGAWGAATVQAGLDRTQSFVASNTSAQLMTRSVIGGDWQRLRTGVVHDNWTATSDQDAFAGVANLRASGFTYGSRQWNSQMSQAKNYSMMDPTVSGADAGARIAAPQQIATTNYLLGRGINTRGKTGMAVADQMVDKFIKSNKSWSQKDYDSLWGAQGGARLATNQIAKVDPNAAALFQDRLQGTMQAQMHGMTKEAYRSAIDSGNWDALKQYGFSETDLNKTKEKAARQRNIDDNMAGGFSTGLNNAVDAIGAFQEALEKVTDGPLGAMAGYGHGWAGSSGLLGSATHIGGSMAGGYFGGKMASSLAKNGVKGTAINAASSGAGAVAKGASGVASVAGKAGKFMKGVPLLGLGIDAATTAFQGDERDRRKQNAMSHGWGDGMATYMSWRDTFASNLTFGLYGGAPDTQDPSSGGSAGERYGMFGTGGEKGGQAVKPVGNAPITAPFGRYKKSGKAHYGIDFGVPSGTPVRANRSGKVVVAGWSNTGFGNHVRIDIGDGTIEIYGHLSQLNVRVGQTIKAGDVIGKSGSSGNSTGPHLHFEVRKGGGGTGNAVDPSSYLKGADNPAPVSADAGSGSSVGSAGKGADGLGSARSMTASPWGVGEGQIVSGALGSLVGSMRSGSSGEGDSGAADAPSSPATAASGQFRDVLARAGFKGTSLGMAYAIMMAESGGNAHAHNTNKATGDNSYGLFQINMLGGMGPERRSKFHLANNDALFDPLTNAKVAYDMSKGGSDWHDWSTYKRGDYKKYLGGQQKSYDVGSTNIDVDQVARVHKGEMILDPVTADEMRKALSSNRPTDLFGSKGGKGGGIRIDNLTIQTAHQFTDSAATDLAKQFLSIVNNHDEVSTIMGGN
Other Proteins in cluster: phalp2_36845
Total (incl. this protein): 4 Avg length: 884,0 Avg pI: 7,67

Protein ID Length (AA) pI
72syZ 995 9,34381
4MSie 994 9,54514
A0A343X839 527 6,26107
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26491
27CrY
1 27,6% 1123 4.239E-51

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage PhiS63
[NCBI]
1187894 No lineage information
Host Clostridium perfringens
[NCBI]
1502 Firmicutes > Clostridia > Clostridiales > Clostridiaceae > Clostridium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ660954 [NCBI]
CDS location
range 12154 -> 15216
strand +
CDS
ATGGGGAAAGTACAATTATATAAATTTAATAATAAGAATTTTGTTAATAATGGAGATATGATATTAGAACCTTCTAAATGTACTTTGAAAATGGAATTAGTAACTGGATTAAATGAAGTAACAATGGAGCATGAATATGATGAAGAGGAAAGATGGAAATATATTTCTAGAGATGATGTTATAAAAGTTAGTACACCATATAAAAAAATACCAGAACAACTTTATAGAATCTACGATATAGAAAAAAACTTAGATTATATGAGTGTAAAAGCAAGGCATATATTCTATGACTTAGTTGACATATATATTAAGGATTTAACTAGAGAGGAAAATATATTTGATGTTAGATGTGTAAATTGTAATGGACAACAAGCATTAGATAAAATTTTAAAAGGGACAGAGTTTAAAGGACATTCAGATATAGAAAAGATAGCAACATCTTATTATTTTAGAAAGAAGATAGTACAAGCAATAGGTGGAGATGATAAAGAAAATTCTTTTTTAAGTAGATGGGGAGGAGAATTATTTTTAAATAATTTTGATATTTACTTAAATAAAAGAGTAGGAGAAGATAATAATGTTAGAATTGCATATAAAAAGAATTTAACAGGTTTAGTTGAAACAATAGATATGGATAGTTTAATAACTAGAGTAATACCAACTGGTTATGATGGTATCTGTATAAGTGGAACAACTCCATGGGTAGATTCTCCATTAATTAATAATTATAGTCATGTGTTTGAATCAGAGCAAAGATTTGAAGATATTAAGTTAAAAGGGACACAAAATAATAAAGGTGAAAATGCAGAGGGGTTTGATACACAAGAGCAGGTTAATGAAGCGTTAAAAAATAAAGGTAAGGAGCTTATAAGTAAAGGGATAGATAAGCCGTTAGTAAATTATAGTGTTCAATTTATTCCTTTATCAAGTACTGAAGAATATAAAGAGTATAAGTCACTAGAAGAGATTTTATTAGGAGATACAGTATATATAGAACATAAACCATTAGATATTAATATTAAAGCTAGATGTATATCATTTGAATATGATTGTTTAAATGAAGAAATGTTGAATTGTGAAATAGGAAATTATTTAGACACATATGCATCAGCACAAGCAGATAAGAGTGTAACATTTGATACTATTGCAGGAAGCTTTGATGGTGATGGGAATTTAGGTGGCGAAAATATAATTGGAGCAATAAATGCTATGAAAGCTCCATTATTAGCACAAAGAGATAGAGCAAAAAAATTAGATATAGTTGCTTGGATTCAAGAAGTTTTGGATCCAAATGATCCTGATTTTGGATGTGTTCAAGGTGGGACAAAAGGGATTCTTTTATCTGATAAAAGATTAAGCGATAATTCAGGGTGGGACTTTAAAACGGCTATAACTCCAAAAGGAATAATAGCTGACGAATTAATAGGTATTTTAACAACAGTTTTAATACGGAATATGGATAAAAGCTTTGAGATAGATTTAAAGAAAGCTGGCGGAGCTTTATTTAGAAACAATGGAAAAGATGCAATAAAGATAGAAAATAATATGATTAAGCTTTATAATTGGAAAAAGAACGGCGACTATATAGGTGCGTTAATGTCATTGGTTCAAGGAGATGATGAAAATAAGCCATTGATTGGGTTAGCAAATGATATTGATAGTGCAATGTCTCTTGGATATGCTGTTGAAGGTAAGACAAAAGTTCCTTCTTATATTGCGTTTGATAAATATAACATTTTAGATGATTCAAGTGGAAAGCCAGTAAGAATATATGAAGAAGTAGATTTTAAAGGGAACAAAGTTTATAACATAGATATTCGTTCAGACAATGGCAAGAATAATATAATGGTCGGGGATCATTTCATAAATATAACTACACCTGATAATGAAATTGTAGTTTCAGGTTCAGGAACAAGAATAGGAAAAGATAAATCTTTGTATTATGATGCTAGAACTGGAGAATTGAGATGTAATGACCTAGTATTAAATGGTGTTATTAAAAATACAAGTGGTACTACTGTGTTTGATCCAAACTCTCCTATAGGTGGAGGCGGTGTAGATACTCTAGGAAATGTTAGTAAAGGAATACCTTCAAGAAAATACTTCAGGTATGTTAAAGGAATAGAAGGACTACAACAATATCCAGGTAATATTGGAGATGGCCAAATAACATATGGTTATGGAGTTACTAAAGCTAATGAGCCAACATACTTTGCTAAATTAGGTAATCCACCTTGTTCTGAAGGAACAGCATCTAAAGTTTTATTTGAATTAATACCAGATAAATATGGTAGCCTTGTAAAAAACCAAATGATTAAAGACGGTGTAGATCTTAATAAAGTGCCTATACATATATTTGATGCTTTTGTAGATTTATGTTATAACTCAGGATATTACAATTCTCGTATGTACAGAGCTTGGATAAGGGGAGCTAGTTTAGATAGTATCTATAACGATTGGCTAACATATGCAACTATGCCTGGAACAATTTTCGAAAACGGATTAAAACGTAGAAGAAAGGAAGAAGCTGAAATGTTTAAAAATGCTAACTACATTATGTCTCCTATTGGAATTTTAAATTCAAGTGGAAGTCAAATAGGTACAGTAAAAGGGGATGGATATTTTCCACCTATAGAAAGTAATAACTTTAAAACAATAAATAATGAGTATGGCAATGGTTGGATTATTCCAGTAAGTAATGGACATGTAACAGCAACATTCCCTTATTATCCTTCAGGGGCTCCACATTCAGGAATAGATTTTGGTGTTCCTATAGGTACACCAGTTAGAGCTTCAAAGCCAGGTAAAGTTATAAAAAGAAGAGAATTAACTACAAGCTATGGCAAATATTTATTTATAGATCATGGCGGTGGATTAATTACTATTTATGCTCATAATAGCGAGTTGCTAGTAAATGAAGGTGATACAGTAAAAGCAGGACAAGTTATAGCTAGAAGTGGTAATACTGGTAATTCATCAGGCCCACATTGTCATTGGGAACTTAGAGTTAATGGTACAGCACAAAATATAGCTCCTTCTTTAAAAGTTGGAGATTTAGTGTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0004222 metalloendopeptidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
RuleBase:RU003788

Tertiary structure

PDB ID
upi00025f779e_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (72syZ) rather than this protein.
PDB ID
72syZ
Method AlphaFoldv2
Resolution 51.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50