Protein

Protein accession
A0A1W5S489 [UniProt]
Representative
4NArk
Source
UniProt (cluster: phalp2_3616)
Protein name
Lysozyme
Lysin probability
98%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MSDYIQMIAEFEEDGEFRPQAYKDGTQYSIGFGTRTTDPNEMEGTGLINEEEAYRRLEEWTSKDRVFIQEVGKREGLEWSDNELDALTSFTYNLGKDGLTQLVSGRDKATIADKMNEYVNFQGKPLNGLVRRRKAESELFSTPVTQAVEVPEPSQPVDRSYLDYTEAGVPASILRADGSVSEGEDPSISELWGASTYRNWVWNSLDRASEEVANPEPNFLLSQEQLEELSGKTESGYSKYNESELAFLSQSVSPENLSHRIDRIEADKAAKQTLEQAGMEGVGMEMLAGVTDPLLLPTMFLGGVGVAGKFKTAKAVATSMLSGASQNVAAEYLLKQGDTQRTDEDLMIAAAGGAIFSGSITAGAIGIKTGLNTRIVRANQIEAENKQALTEASVGLEYKKADSALATEVPNPVQRKSFMTEKEIIAKLQEEVGTRQDTISSKNIKKAKSGFNEYRKRQLAKIEKLKETPFKKPSSRNKQIKQLQVSIEEAQAARDNLIAENNAKLSTNSKLDQLQNGKIPDDLLDRYKEMKMESGEFDADQPTRDTVSLPVRKEQVEDPEEGVEVKDDVQSMGAMKVSSEFKDIATYDNLLPDTEVEEISNQVYDAATLGFNTPRVSRMASKAAGFRSTSTIVDTAPDNATRGLGIQTLKNGTRTIEGHQSIEEVADTLFHRNVPDYSVYEDAFDQYAKSKKVGILSKDRVRLKEEFDKEIVLAQARGEVEPNRASSEDSPVIKAAKARSRIYERSLKLNKDGRTIGFESVEHSNSYHSVVFDSSKILTAKSGSGEDAIAHADRVINTIAKAYQNGKIKLSRENAVRLAETQVARSFAYKHGTFNKVMSDNEYKLLDKELEANGVDITVREELKQNLFNKEDLDNMSPRAMFSLSPDLTASTGGVRMVDLIDTSMNRVMKYASDAAAHRGLSLQGYRSRHQWMRAVEEARKQSMNELRKELDNPNKKVAENAARELAKVERGDYADLLIDSMSLIFKEPLQSGSDAVEDLSKILRKQTSITRLRSTGLMSIPEYAIAMARNGGLSVISQLPSARRFDLRTTSIQKDEFMKAFSDSISATGHQEYLFGAQFYNNSDFDDATKTRLGNILNKVQGKMMNVTMTVNAFRTFQHGGEEMVARSIIKNLKDLSTSGKMTSNIKSSLVKVGGLSEDQVNQMITHFNNNPELDIFDSIRSMEPELNIAVSTAVRNTIGSSFMRMGVGETVPYANREMGKVLTSLLNFTIGSWEKMVVRGVKSDGLGLMASMFAGQVALAVMSQYAYVYSRAAGKEGEDRRKFIEKSLEDEGMFWGVTNRVGFLAAPMLPMQMLASARLLPEEITASPTKAGVNSMGIPSVDMGADYLKAIGSSGDLISSQFTDEYMGNKDREKAYNNIKRVLPWVDSPVYNAATGILD
Physico‐chemical
properties
protein length:1401 AA
molecular weight:155039,6 Da
isoelectric point:5,33
hydropathy:-0,51
Representative Protein Details
Accession
4NArk
Protein name
4NArk
Sequence length
231 AA
Molecular weight
26815,47230 Da
Isoelectric point
9,78019
Sequence
MSQLPTKFLFDEGVKEILMSLFSLASTAYEVDYVAKQLKSRPEPIEQKIDALKIADAKNLSPEFNSAVDKLLVYYVNKGLPKLHKYKQVRNYSGISDKLFDFIKYHEKFSPYPYADYKQTSIGYGTKALPNDRKISKLEATRRLHREVQKHRSEVIKDSKRWGYKWTPHQIDALTSFRYNVGNLKYLTSNGNRTNRQISEKILEYDKAGGKSLPGLTKRRKAESQMFLFGK
Other Proteins in cluster: phalp2_3616
Total (incl. this protein): 20 Avg length: 324,5 Avg pI: 9,07

Protein ID Length (AA) pI
4NArk 231 9,78019
10Mxn 179 9,31699
17fUo 239 9,76317
1SKXS 175 9,20320
1sAlh 175 9,69658
1yBrK 225 9,56248
1ywQH 223 9,61509
39b0H 174 9,78238
3RbZR 179 9,29939
3VwJZ 175 9,65545
4VgnU 181 8,62963
6BSSe 226 9,67885
6M66I 298 9,50968
8AQNk 239 9,44276
8BxZv 239 9,35264
8Co08 179 9,39590
jxkm 175 9,65577
jzVQ 175 9,45347
A0A1W5S4D1 1401 5,33169
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9007
4WhBz
18 43,3% 157 2.555E-48
2 phalp2_31936
7CFw1
8 41,4% 181 2.482E-42
3 phalp2_16162
4wmxe
2 41,1% 153 1.640E-33
4 phalp2_23797
1duFR
19 34,1% 158 2.447E-24
5 phalp2_27061
2hzSW
1 30,4% 151 3.466E-20
6 phalp2_31073
1Mrll
7 29,2% 140 2.486E-16
7 phalp2_35318
7dmzX
68 33,5% 164 2.486E-16
8 phalp2_39279
49SM8
4 29,0% 193 3.217E-14
9 phalp2_17663
6R9yG
6 28,9% 183 4.355E-14
10 phalp2_10624
2nrBp
12 28,5% 154 1.080E-13

Domains

Domains [InterPro]
Unannotated
GH24
Representative sequence (used for alignment): 4NArk (231 AA)
Member sequence: A0A1W5S489 (1401 AA)
1 231 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Marinomonas phage CPP1m
[NCBI]
1965370 Autographiviridae > Murciavirus > Murciavirus CPP1m
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KY626176 [NCBI]
CDS location
range 31979 -> 36184
strand +
CDS
ATGTCTGATTACATACAAATGATAGCAGAGTTCGAAGAGGATGGAGAGTTCCGTCCTCAAGCCTACAAGGATGGCACTCAGTACTCTATCGGATTTGGGACACGAACTACTGACCCTAACGAAATGGAGGGTACGGGTCTCATTAATGAGGAAGAAGCTTACCGTAGGTTAGAGGAGTGGACTTCAAAAGACAGAGTATTTATACAAGAGGTAGGTAAGCGAGAAGGTCTTGAATGGTCTGATAACGAACTAGATGCACTGACAAGCTTTACATATAACTTAGGAAAAGATGGGCTTACTCAGCTAGTATCAGGAAGAGATAAGGCCACTATTGCAGATAAGATGAATGAGTACGTTAACTTCCAAGGAAAACCTCTCAATGGTCTCGTAAGGCGTAGGAAGGCAGAAAGCGAACTCTTCAGCACTCCAGTCACTCAGGCGGTAGAAGTCCCTGAGCCTTCCCAGCCAGTAGACCGCTCTTATCTAGACTACACAGAAGCAGGTGTTCCTGCTTCTATTCTTAGGGCGGACGGCTCAGTGTCGGAAGGGGAAGACCCTAGCATATCTGAGCTATGGGGGGCATCTACATATCGTAACTGGGTATGGAACTCTCTTGATAGGGCAAGTGAAGAGGTTGCTAACCCAGAGCCTAACTTCCTTCTCAGCCAAGAGCAGCTTGAAGAACTGAGCGGTAAAACGGAATCAGGGTACTCTAAGTACAACGAAAGTGAACTGGCCTTTCTCTCTCAGTCAGTTTCACCTGAGAATCTTTCACATAGGATTGATCGAATCGAGGCGGATAAGGCCGCAAAACAAACTCTAGAACAGGCAGGTATGGAAGGAGTGGGGATGGAGATGCTGGCTGGCGTCACCGATCCTCTCTTACTTCCAACTATGTTTCTCGGTGGTGTTGGGGTTGCTGGTAAGTTTAAAACTGCTAAAGCGGTTGCCACGTCTATGTTATCAGGTGCTAGTCAGAACGTTGCGGCGGAGTACCTTCTTAAGCAAGGTGATACTCAAAGGACTGATGAGGATCTTATGATAGCCGCCGCTGGTGGTGCAATCTTCTCAGGCTCTATAACTGCTGGCGCTATAGGTATTAAGACAGGATTAAATACTCGCATTGTAAGAGCTAATCAGATAGAGGCGGAGAACAAGCAAGCTCTTACTGAGGCGTCAGTAGGATTAGAGTACAAGAAGGCTGATTCTGCACTAGCTACGGAAGTCCCAAACCCTGTACAGCGTAAAAGCTTTATGACAGAGAAAGAGATTATCGCAAAGTTGCAAGAGGAGGTTGGTACTCGCCAAGATACCATCTCCTCTAAGAATATCAAGAAAGCAAAATCAGGGTTCAATGAGTACCGCAAGAGACAGTTAGCTAAGATCGAGAAGCTAAAGGAGACTCCTTTTAAGAAGCCGTCCTCTCGTAACAAGCAAATTAAACAGTTGCAGGTTTCGATAGAAGAGGCACAGGCCGCAAGGGATAATCTAATAGCAGAGAATAACGCTAAGCTTTCTACGAACTCTAAGCTAGACCAGTTGCAAAATGGGAAGATACCTGATGACCTGCTTGATAGGTATAAGGAGATGAAGATGGAATCTGGGGAGTTCGATGCAGATCAGCCCACCAGAGATACGGTATCTCTCCCTGTAAGGAAGGAGCAGGTAGAAGACCCAGAAGAGGGAGTTGAAGTTAAGGACGATGTTCAGTCTATGGGTGCTATGAAGGTAAGCTCTGAGTTTAAGGATATTGCAACTTACGATAACCTACTGCCTGATACAGAGGTTGAAGAGATCTCTAATCAAGTCTATGATGCAGCAACATTAGGGTTTAATACTCCACGGGTTTCAAGGATGGCATCTAAGGCAGCAGGGTTCAGGAGTACGAGTACCATAGTAGACACTGCACCAGATAATGCGACTAGGGGTTTAGGTATTCAGACATTGAAGAATGGTACAAGGACTATCGAGGGTCATCAGTCTATTGAGGAAGTGGCTGATACCTTATTCCACAGGAACGTACCTGATTACTCTGTTTATGAGGACGCATTTGACCAGTACGCTAAGTCTAAGAAGGTCGGTATACTTTCTAAGGACAGGGTAAGACTTAAAGAGGAGTTTGATAAGGAGATCGTATTAGCGCAGGCTAGAGGTGAGGTAGAGCCAAACAGGGCTTCATCAGAAGATAGTCCAGTAATAAAGGCGGCTAAAGCTAGGTCAAGGATCTACGAGAGAAGTCTTAAACTCAACAAGGATGGGCGGACTATTGGATTCGAAAGTGTAGAGCATTCTAACTCTTACCACTCAGTAGTATTTGACTCCTCAAAGATACTAACAGCCAAGTCTGGTTCAGGAGAGGATGCAATCGCACACGCGGATAGGGTTATTAACACGATAGCAAAGGCTTATCAGAATGGTAAGATAAAGCTAAGCCGTGAGAATGCAGTTAGACTTGCAGAGACACAAGTAGCTAGGTCTTTCGCATACAAGCACGGAACCTTTAATAAGGTCATGTCTGATAATGAGTACAAGCTACTAGACAAGGAATTAGAGGCTAATGGTGTAGATATTACAGTGAGGGAGGAGCTTAAGCAGAATCTCTTTAATAAGGAAGACTTAGATAATATGTCACCTCGTGCGATGTTCTCACTATCTCCAGACCTTACGGCATCTACTGGCGGTGTACGTATGGTGGACTTAATTGACACAAGTATGAATAGGGTTATGAAGTACGCGAGTGATGCAGCGGCTCATAGAGGCCTGTCCCTTCAGGGCTATAGGTCACGTCATCAATGGATGCGAGCAGTTGAAGAGGCTCGTAAGCAGTCTATGAATGAGCTTAGGAAAGAGCTAGATAATCCTAACAAGAAAGTGGCGGAGAATGCAGCTAGGGAGCTTGCAAAAGTCGAAAGAGGCGATTATGCAGATCTTCTAATAGACTCTATGAGCTTAATCTTTAAAGAGCCATTGCAATCTGGTTCGGATGCAGTAGAGGACTTATCGAAGATTCTAAGAAAGCAGACATCTATTACTCGACTGAGATCGACTGGGCTTATGTCTATCCCTGAGTATGCTATAGCTATGGCACGTAATGGAGGTCTTAGTGTTATCAGTCAACTGCCTAGCGCCAGAAGGTTCGACCTTCGAACTACCAGCATCCAGAAAGATGAGTTTATGAAAGCCTTCTCGGATTCTATATCGGCTACAGGTCACCAAGAGTATCTATTCGGTGCTCAGTTTTATAACAACTCAGACTTTGATGATGCGACTAAGACGAGACTCGGTAATATCCTCAATAAGGTGCAGGGTAAGATGATGAACGTAACCATGACGGTTAATGCCTTTAGGACATTCCAACATGGAGGTGAGGAGATGGTTGCTAGGAGTATTATAAAGAACCTCAAGGATCTATCTACTTCTGGCAAGATGACTTCAAACATAAAAAGCTCGCTGGTAAAAGTTGGAGGCTTATCAGAGGATCAAGTTAATCAGATGATAACTCACTTTAATAATAACCCTGAGCTTGACATCTTCGACAGTATTCGGTCTATGGAGCCTGAGCTAAACATAGCGGTATCCACAGCGGTAAGGAATACTATCGGTAGCTCTTTTATGAGGATGGGTGTTGGTGAGACAGTGCCTTACGCTAATAGAGAGATGGGTAAAGTTCTGACTTCGTTGCTTAACTTTACTATAGGCTCTTGGGAGAAAATGGTTGTCCGTGGTGTTAAGTCTGACGGGTTAGGACTAATGGCCTCTATGTTTGCTGGCCAAGTTGCATTGGCTGTAATGTCCCAATATGCCTATGTCTATTCTCGGGCAGCAGGTAAAGAGGGTGAAGACAGGAGGAAGTTCATAGAGAAGAGCCTTGAGGATGAAGGGATGTTCTGGGGAGTAACAAATAGGGTAGGCTTCTTAGCAGCACCTATGCTCCCTATGCAGATGCTTGCGAGTGCGAGACTCCTCCCAGAGGAGATCACTGCATCACCAACAAAGGCAGGAGTTAACAGCATGGGTATACCTTCTGTTGATATGGGCGCAGATTACCTTAAAGCTATCGGTAGCTCTGGTGACCTTATATCCTCTCAGTTCACGGATGAGTATATGGGCAATAAGGACAGAGAGAAGGCTTACAACAATATCAAGAGAGTATTGCCTTGGGTAGACTCTCCTGTGTACAACGCAGCTACAGGTATACTGGATTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
RuleBase:RU003788

Tertiary structure

PDB ID
upi000a0813a0_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4NArk) rather than this protein.
PDB ID
4NArk
Method AlphaFoldv2
Resolution 83.91
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50