Protein
- Protein accession
- A0A1W5S489 [UniProt]
- Representative
- 4NArk
- Source
- UniProt (cluster: phalp2_3616)
- Protein name
- Lysozyme
- Lysin probability
- 98%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MSDYIQMIAEFEEDGEFRPQAYKDGTQYSIGFGTRTTDPNEMEGTGLINEEEAYRRLEEWTSKDRVFIQEVGKREGLEWSDNELDALTSFTYNLGKDGLTQLVSGRDKATIADKMNEYVNFQGKPLNGLVRRRKAESELFSTPVTQAVEVPEPSQPVDRSYLDYTEAGVPASILRADGSVSEGEDPSISELWGASTYRNWVWNSLDRASEEVANPEPNFLLSQEQLEELSGKTESGYSKYNESELAFLSQSVSPENLSHRIDRIEADKAAKQTLEQAGMEGVGMEMLAGVTDPLLLPTMFLGGVGVAGKFKTAKAVATSMLSGASQNVAAEYLLKQGDTQRTDEDLMIAAAGGAIFSGSITAGAIGIKTGLNTRIVRANQIEAENKQALTEASVGLEYKKADSALATEVPNPVQRKSFMTEKEIIAKLQEEVGTRQDTISSKNIKKAKSGFNEYRKRQLAKIEKLKETPFKKPSSRNKQIKQLQVSIEEAQAARDNLIAENNAKLSTNSKLDQLQNGKIPDDLLDRYKEMKMESGEFDADQPTRDTVSLPVRKEQVEDPEEGVEVKDDVQSMGAMKVSSEFKDIATYDNLLPDTEVEEISNQVYDAATLGFNTPRVSRMASKAAGFRSTSTIVDTAPDNATRGLGIQTLKNGTRTIEGHQSIEEVADTLFHRNVPDYSVYEDAFDQYAKSKKVGILSKDRVRLKEEFDKEIVLAQARGEVEPNRASSEDSPVIKAAKARSRIYERSLKLNKDGRTIGFESVEHSNSYHSVVFDSSKILTAKSGSGEDAIAHADRVINTIAKAYQNGKIKLSRENAVRLAETQVARSFAYKHGTFNKVMSDNEYKLLDKELEANGVDITVREELKQNLFNKEDLDNMSPRAMFSLSPDLTASTGGVRMVDLIDTSMNRVMKYASDAAAHRGLSLQGYRSRHQWMRAVEEARKQSMNELRKELDNPNKKVAENAARELAKVERGDYADLLIDSMSLIFKEPLQSGSDAVEDLSKILRKQTSITRLRSTGLMSIPEYAIAMARNGGLSVISQLPSARRFDLRTTSIQKDEFMKAFSDSISATGHQEYLFGAQFYNNSDFDDATKTRLGNILNKVQGKMMNVTMTVNAFRTFQHGGEEMVARSIIKNLKDLSTSGKMTSNIKSSLVKVGGLSEDQVNQMITHFNNNPELDIFDSIRSMEPELNIAVSTAVRNTIGSSFMRMGVGETVPYANREMGKVLTSLLNFTIGSWEKMVVRGVKSDGLGLMASMFAGQVALAVMSQYAYVYSRAAGKEGEDRRKFIEKSLEDEGMFWGVTNRVGFLAAPMLPMQMLASARLLPEEITASPTKAGVNSMGIPSVDMGADYLKAIGSSGDLISSQFTDEYMGNKDREKAYNNIKRVLPWVDSPVYNAATGILD
- Physico‐chemical
properties -
protein length: 1401 AA molecular weight: 155039,6 Da isoelectric point: 5,33 hydropathy: -0,51
Representative Protein Details
- Accession
- 4NArk
- Protein name
- 4NArk
- Sequence length
- 231 AA
- Molecular weight
- 26815,47230 Da
- Isoelectric point
- 9,78019
- Sequence
-
MSQLPTKFLFDEGVKEILMSLFSLASTAYEVDYVAKQLKSRPEPIEQKIDALKIADAKNLSPEFNSAVDKLLVYYVNKGLPKLHKYKQVRNYSGISDKLFDFIKYHEKFSPYPYADYKQTSIGYGTKALPNDRKISKLEATRRLHREVQKHRSEVIKDSKRWGYKWTPHQIDALTSFRYNVGNLKYLTSNGNRTNRQISEKILEYDKAGGKSLPGLTKRRKAESQMFLFGK
Other Proteins in cluster: phalp2_3616
| Total (incl. this protein): 20 | Avg length: 324,5 | Avg pI: 9,07 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4NArk | 231 | 9,78019 |
| 10Mxn | 179 | 9,31699 |
| 17fUo | 239 | 9,76317 |
| 1SKXS | 175 | 9,20320 |
| 1sAlh | 175 | 9,69658 |
| 1yBrK | 225 | 9,56248 |
| 1ywQH | 223 | 9,61509 |
| 39b0H | 174 | 9,78238 |
| 3RbZR | 179 | 9,29939 |
| 3VwJZ | 175 | 9,65545 |
| 4VgnU | 181 | 8,62963 |
| 6BSSe | 226 | 9,67885 |
| 6M66I | 298 | 9,50968 |
| 8AQNk | 239 | 9,44276 |
| 8BxZv | 239 | 9,35264 |
| 8Co08 | 179 | 9,39590 |
| jxkm | 175 | 9,65577 |
| jzVQ | 175 | 9,45347 |
| A0A1W5S4D1 | 1401 | 5,33169 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_9007
4WhBz
|
18 | 43,3% | 157 | 2.555E-48 |
| 2 |
phalp2_31936
7CFw1
|
8 | 41,4% | 181 | 2.482E-42 |
| 3 |
phalp2_16162
4wmxe
|
2 | 41,1% | 153 | 1.640E-33 |
| 4 |
phalp2_23797
1duFR
|
19 | 34,1% | 158 | 2.447E-24 |
| 5 |
phalp2_27061
2hzSW
|
1 | 30,4% | 151 | 3.466E-20 |
| 6 |
phalp2_31073
1Mrll
|
7 | 29,2% | 140 | 2.486E-16 |
| 7 |
phalp2_35318
7dmzX
|
68 | 33,5% | 164 | 2.486E-16 |
| 8 |
phalp2_39279
49SM8
|
4 | 29,0% | 193 | 3.217E-14 |
| 9 |
phalp2_17663
6R9yG
|
6 | 28,9% | 183 | 4.355E-14 |
| 10 |
phalp2_10624
2nrBp
|
12 | 28,5% | 154 | 1.080E-13 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Marinomonas phage CPP1m [NCBI] |
1965370 | Autographiviridae > Murciavirus > Murciavirus CPP1m |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KY626176
[NCBI]
CDS location
range 31979 -> 36184
strand +
strand +
CDS
ATGTCTGATTACATACAAATGATAGCAGAGTTCGAAGAGGATGGAGAGTTCCGTCCTCAAGCCTACAAGGATGGCACTCAGTACTCTATCGGATTTGGGACACGAACTACTGACCCTAACGAAATGGAGGGTACGGGTCTCATTAATGAGGAAGAAGCTTACCGTAGGTTAGAGGAGTGGACTTCAAAAGACAGAGTATTTATACAAGAGGTAGGTAAGCGAGAAGGTCTTGAATGGTCTGATAACGAACTAGATGCACTGACAAGCTTTACATATAACTTAGGAAAAGATGGGCTTACTCAGCTAGTATCAGGAAGAGATAAGGCCACTATTGCAGATAAGATGAATGAGTACGTTAACTTCCAAGGAAAACCTCTCAATGGTCTCGTAAGGCGTAGGAAGGCAGAAAGCGAACTCTTCAGCACTCCAGTCACTCAGGCGGTAGAAGTCCCTGAGCCTTCCCAGCCAGTAGACCGCTCTTATCTAGACTACACAGAAGCAGGTGTTCCTGCTTCTATTCTTAGGGCGGACGGCTCAGTGTCGGAAGGGGAAGACCCTAGCATATCTGAGCTATGGGGGGCATCTACATATCGTAACTGGGTATGGAACTCTCTTGATAGGGCAAGTGAAGAGGTTGCTAACCCAGAGCCTAACTTCCTTCTCAGCCAAGAGCAGCTTGAAGAACTGAGCGGTAAAACGGAATCAGGGTACTCTAAGTACAACGAAAGTGAACTGGCCTTTCTCTCTCAGTCAGTTTCACCTGAGAATCTTTCACATAGGATTGATCGAATCGAGGCGGATAAGGCCGCAAAACAAACTCTAGAACAGGCAGGTATGGAAGGAGTGGGGATGGAGATGCTGGCTGGCGTCACCGATCCTCTCTTACTTCCAACTATGTTTCTCGGTGGTGTTGGGGTTGCTGGTAAGTTTAAAACTGCTAAAGCGGTTGCCACGTCTATGTTATCAGGTGCTAGTCAGAACGTTGCGGCGGAGTACCTTCTTAAGCAAGGTGATACTCAAAGGACTGATGAGGATCTTATGATAGCCGCCGCTGGTGGTGCAATCTTCTCAGGCTCTATAACTGCTGGCGCTATAGGTATTAAGACAGGATTAAATACTCGCATTGTAAGAGCTAATCAGATAGAGGCGGAGAACAAGCAAGCTCTTACTGAGGCGTCAGTAGGATTAGAGTACAAGAAGGCTGATTCTGCACTAGCTACGGAAGTCCCAAACCCTGTACAGCGTAAAAGCTTTATGACAGAGAAAGAGATTATCGCAAAGTTGCAAGAGGAGGTTGGTACTCGCCAAGATACCATCTCCTCTAAGAATATCAAGAAAGCAAAATCAGGGTTCAATGAGTACCGCAAGAGACAGTTAGCTAAGATCGAGAAGCTAAAGGAGACTCCTTTTAAGAAGCCGTCCTCTCGTAACAAGCAAATTAAACAGTTGCAGGTTTCGATAGAAGAGGCACAGGCCGCAAGGGATAATCTAATAGCAGAGAATAACGCTAAGCTTTCTACGAACTCTAAGCTAGACCAGTTGCAAAATGGGAAGATACCTGATGACCTGCTTGATAGGTATAAGGAGATGAAGATGGAATCTGGGGAGTTCGATGCAGATCAGCCCACCAGAGATACGGTATCTCTCCCTGTAAGGAAGGAGCAGGTAGAAGACCCAGAAGAGGGAGTTGAAGTTAAGGACGATGTTCAGTCTATGGGTGCTATGAAGGTAAGCTCTGAGTTTAAGGATATTGCAACTTACGATAACCTACTGCCTGATACAGAGGTTGAAGAGATCTCTAATCAAGTCTATGATGCAGCAACATTAGGGTTTAATACTCCACGGGTTTCAAGGATGGCATCTAAGGCAGCAGGGTTCAGGAGTACGAGTACCATAGTAGACACTGCACCAGATAATGCGACTAGGGGTTTAGGTATTCAGACATTGAAGAATGGTACAAGGACTATCGAGGGTCATCAGTCTATTGAGGAAGTGGCTGATACCTTATTCCACAGGAACGTACCTGATTACTCTGTTTATGAGGACGCATTTGACCAGTACGCTAAGTCTAAGAAGGTCGGTATACTTTCTAAGGACAGGGTAAGACTTAAAGAGGAGTTTGATAAGGAGATCGTATTAGCGCAGGCTAGAGGTGAGGTAGAGCCAAACAGGGCTTCATCAGAAGATAGTCCAGTAATAAAGGCGGCTAAAGCTAGGTCAAGGATCTACGAGAGAAGTCTTAAACTCAACAAGGATGGGCGGACTATTGGATTCGAAAGTGTAGAGCATTCTAACTCTTACCACTCAGTAGTATTTGACTCCTCAAAGATACTAACAGCCAAGTCTGGTTCAGGAGAGGATGCAATCGCACACGCGGATAGGGTTATTAACACGATAGCAAAGGCTTATCAGAATGGTAAGATAAAGCTAAGCCGTGAGAATGCAGTTAGACTTGCAGAGACACAAGTAGCTAGGTCTTTCGCATACAAGCACGGAACCTTTAATAAGGTCATGTCTGATAATGAGTACAAGCTACTAGACAAGGAATTAGAGGCTAATGGTGTAGATATTACAGTGAGGGAGGAGCTTAAGCAGAATCTCTTTAATAAGGAAGACTTAGATAATATGTCACCTCGTGCGATGTTCTCACTATCTCCAGACCTTACGGCATCTACTGGCGGTGTACGTATGGTGGACTTAATTGACACAAGTATGAATAGGGTTATGAAGTACGCGAGTGATGCAGCGGCTCATAGAGGCCTGTCCCTTCAGGGCTATAGGTCACGTCATCAATGGATGCGAGCAGTTGAAGAGGCTCGTAAGCAGTCTATGAATGAGCTTAGGAAAGAGCTAGATAATCCTAACAAGAAAGTGGCGGAGAATGCAGCTAGGGAGCTTGCAAAAGTCGAAAGAGGCGATTATGCAGATCTTCTAATAGACTCTATGAGCTTAATCTTTAAAGAGCCATTGCAATCTGGTTCGGATGCAGTAGAGGACTTATCGAAGATTCTAAGAAAGCAGACATCTATTACTCGACTGAGATCGACTGGGCTTATGTCTATCCCTGAGTATGCTATAGCTATGGCACGTAATGGAGGTCTTAGTGTTATCAGTCAACTGCCTAGCGCCAGAAGGTTCGACCTTCGAACTACCAGCATCCAGAAAGATGAGTTTATGAAAGCCTTCTCGGATTCTATATCGGCTACAGGTCACCAAGAGTATCTATTCGGTGCTCAGTTTTATAACAACTCAGACTTTGATGATGCGACTAAGACGAGACTCGGTAATATCCTCAATAAGGTGCAGGGTAAGATGATGAACGTAACCATGACGGTTAATGCCTTTAGGACATTCCAACATGGAGGTGAGGAGATGGTTGCTAGGAGTATTATAAAGAACCTCAAGGATCTATCTACTTCTGGCAAGATGACTTCAAACATAAAAAGCTCGCTGGTAAAAGTTGGAGGCTTATCAGAGGATCAAGTTAATCAGATGATAACTCACTTTAATAATAACCCTGAGCTTGACATCTTCGACAGTATTCGGTCTATGGAGCCTGAGCTAAACATAGCGGTATCCACAGCGGTAAGGAATACTATCGGTAGCTCTTTTATGAGGATGGGTGTTGGTGAGACAGTGCCTTACGCTAATAGAGAGATGGGTAAAGTTCTGACTTCGTTGCTTAACTTTACTATAGGCTCTTGGGAGAAAATGGTTGTCCGTGGTGTTAAGTCTGACGGGTTAGGACTAATGGCCTCTATGTTTGCTGGCCAAGTTGCATTGGCTGTAATGTCCCAATATGCCTATGTCTATTCTCGGGCAGCAGGTAAAGAGGGTGAAGACAGGAGGAAGTTCATAGAGAAGAGCCTTGAGGATGAAGGGATGTTCTGGGGAGTAACAAATAGGGTAGGCTTCTTAGCAGCACCTATGCTCCCTATGCAGATGCTTGCGAGTGCGAGACTCCTCCCAGAGGAGATCACTGCATCACCAACAAAGGCAGGAGTTAACAGCATGGGTATACCTTCTGTTGATATGGGCGCAGATTACCTTAAAGCTATCGGTAGCTCTGGTGACCTTATATCCTCTCAGTTCACGGATGAGTATATGGGCAATAAGGACAGAGAGAAGGCTTACAACAATATCAAGAGAGTATTGCCTTGGGTAGACTCTCCTGTGTACAACGCAGCTACAGGTATACTGGATTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0003796 | lysozyme activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0016998 | cell wall macromolecule catabolic process | biological process | None (UniProt) |
| GO:0030430 | host cell cytoplasm | cellular component | None (UniProt) |
| GO:0031640 | killing of cells of another organism | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
Enzymatic activity
| EC Number | Entry Name | Reaction Catalyzed | Classification | Evidence | Source |
|---|---|---|---|---|---|
| 3.2.1.17 | None | Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. |
match to sequence model evidence used in automatic assertion
ECO:ECO:0000256 |
RuleBase:RU003788 |
Tertiary structure
PDB ID
upi000a0813a0_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(4NArk)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50