Protein
- Protein accession
- A0A8S5U9N3 [UniProt]
- Representative
- 3gW4A
- Source
- UniProt (cluster: phalp2_37519)
- Protein name
- lysozyme
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 97% (predicted by ML model) - Protein sequence
-
MAKQLCIDVSEHNGTIDWAAVKKAGINYAIIRDGYGTSHVDNCFVRNMEGAIAHGVHIGIYHFSYALSAAGAKAEAEYVLKLIQPYKDKIVLPVFFDFEYDTVDYAKKQGVTLGKEAFNAHTVAFCDTIQAAGYRAGVYYNLDYLHRYVDIDRIGKYVQWYAQYSSTASATTWDLWQYSSSYTITGCAGKFDVSVLKNSGSVTNSRKYKLGWNKDDKGWWYADTESTYYKSRWAKINDKWYSFDKEGYMLSNTWQVEAGGDTYYLGAEGDMQTNMVVGLGADGKLQPIEPWYHTLGEVPQGYRKELDKLVDAGKLKGKSGSGDDMVLDMPLSALRVLIILSR
- Physico‐chemical
properties -
protein length: 342 AA molecular weight: 38453,9 Da isoelectric point: 6,25 hydropathy: -0,35
Representative Protein Details
- Accession
- 3gW4A
- Protein name
- 3gW4A
- Sequence length
- 385 AA
- Molecular weight
- 44401,57940 Da
- Isoelectric point
- 9,63617
- Sequence
-
MTVTYRQYDARWGNIVYTKGGSTLAHAGCGDTSVAMLATNNPKYANVTPKDVVPFMKKHGYNYKGTTWDGITKGLEHYGFVTARSSNAETIFKWLDGGFFDRGIINFEAGKQGGVTWTSGGHYVVFSAYEKRGNKHWFYTRDPGMRKNDGWHCFEKTMSELIRMIFVCYLPSEHRKPEVIQPKKKTNIKCIDVSEAQGKIDWKKVKADGIKYAIIRAGYGWTHVDKCFKQNIKGAHEAGLKIGIYWFGYAYKKEHAIAEAEGCLRTILPYRKWIDLPVFYDWEYDSMKYAKKHGVKPNKSLITTFNKLFCSRVKAAGFKVGVYYNLDYKKNYLNLKKLDGFYKWLAYYTKTKQKNIAVQQYTNKGKVNGINGRVDRDWIVNEKLL
Other Proteins in cluster: phalp2_37519
| Total (incl. this protein): 4 | Avg length: 389,8 | Avg pI: 7,97 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 3gW4A | 385 | 9,63617 |
| 3iwKw | 489 | 9,66164 |
| A0A8S5NDY4 | 343 | 6,32637 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_819
41pAM
|
11 | 28,3% | 536 | 3.909E-112 |
| 2 |
phalp2_10790
3TNSu
|
2 | 28,0% | 406 | 7.695E-46 |
| 3 |
phalp2_24349
3WAOt
|
2 | 23,3% | 424 | 3.886E-33 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Siphoviridae sp. ctKNZ79 [NCBI] |
2825440 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
BK016045
[NCBI]
CDS location
range 10470 -> 11498
strand -
strand -
CDS
ATGGCTAAACAGCTGTGTATTGACGTGTCCGAGCACAACGGCACAATCGACTGGGCCGCAGTCAAAAAGGCTGGGATTAACTATGCCATTATCCGGGACGGCTACGGAACGAGCCACGTTGACAACTGCTTTGTGCGCAATATGGAGGGCGCGATTGCCCATGGTGTCCATATTGGTATCTACCATTTTTCTTACGCTCTGAGCGCCGCCGGAGCCAAAGCGGAAGCAGAATACGTCCTCAAACTGATCCAGCCGTACAAGGACAAGATCGTATTGCCGGTATTTTTTGATTTTGAGTATGATACCGTGGACTACGCAAAAAAGCAGGGCGTGACCCTGGGCAAGGAGGCGTTCAACGCCCACACCGTGGCGTTTTGCGATACCATTCAGGCGGCGGGGTACCGGGCTGGTGTCTACTACAACCTGGACTATCTGCACCGTTACGTGGACATTGACCGGATCGGCAAATACGTGCAGTGGTACGCCCAGTATTCGTCCACCGCTTCGGCGACCACCTGGGATTTGTGGCAGTACTCCAGCAGCTACACGATTACAGGTTGCGCCGGTAAGTTTGATGTCAGCGTGCTTAAAAACTCCGGCAGCGTCACCAACAGCCGGAAGTACAAGCTGGGATGGAACAAGGATGATAAGGGCTGGTGGTATGCGGACACAGAATCCACCTATTACAAGTCCAGATGGGCAAAGATCAACGACAAGTGGTACAGCTTTGACAAGGAGGGCTATATGTTGAGCAATACGTGGCAGGTAGAGGCCGGGGGTGACACCTACTATCTCGGCGCGGAAGGAGATATGCAGACCAATATGGTTGTAGGCCTTGGCGCAGACGGGAAACTCCAGCCCATTGAGCCGTGGTATCACACTCTGGGCGAGGTGCCGCAGGGATACCGCAAGGAGCTGGACAAGCTGGTGGACGCCGGGAAGCTCAAGGGCAAGAGTGGAAGCGGAGATGATATGGTGCTGGATATGCCGCTGAGCGCACTGAGGGTGCTGATTATCCTGAGCCGATAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0003796 | lysozyme activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0016052 | carbohydrate catabolic process | biological process | None (UniProt) |
| GO:0016998 | cell wall macromolecule catabolic process | biological process | None (UniProt) |
Enzymatic activity
| EC Number | Entry Name | Reaction Catalyzed | Classification | Evidence | Source |
|---|---|---|---|---|---|
| 3.2.1.17 | None | Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. |
match to sequence model evidence used in automatic assertion
ECO:ECO:0000256 |
ARBA:ARBA00000632 |
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(3gW4A)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50