Protein
- Protein accession
- A0AAF0K542 [UniProt]
- Representative
- 5EHHM
- Source
- UniProt (cluster: phalp2_30561)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MRLENNLGGDNLYGNFRLYIEGIEVPFSSASITNTYRGLPTAQIVLAPWRGFSEISKGYFPKVHLFYRDFNQGLAPYDMKKMVGGVATMNTLEAQSAMAEAAETFNKLERHAYKLIFGGVVTSVTDSKSLAGEGGSASLVLSCVHPYYLLNEILFKYIGTNPQETTARDSAFGIMVDGGNVGSLYTIEQALAGVRAPGDGQVTLEDGGKDTSVLSDTMADNYWRLQGIPGVIVALWNSFKRSAYSTDSQKNKTIMTKMYIPLLESGLKYFSKMTGHPVIEGAIQKDSKTPPTQAPATTQTAATGSSPKAVPGLTTMSSRGKKELIEFSEGRHYKLYDDATGKTIAHALDATGTPTIGVGHAVDKTDTQFDGKTLSDAQVDNLLADDISNSERAVAKLGLTLSQGQFDALVDLTFNVGPGGPGRDGALRQSNGQPSTLVRLITEGNTQGAANEFMSWVYSGNKVSQGLVNRRKRDQAFFLDAPLPSAQSLAANAYSSRGSNEEMATNNGKLIPGNMNGMLPRAIAEQLLHILSGHMTSGGTGEVMSFGQLIDAFLEVVEYDHIIMNSPAKVGISGQPELIDHVYKPKMFAYYAPICNVLLPYMYESYSVNIGTDQVPSRVIYNGMPFTADSAANSGTTVNMHNYVAPHSVRVALADGGNLAQTLISLVASPGKYEWGSGVRTADGTLPYWYAVLSHQAASGNVGDLANGTTLENLKAAWNKLYPEDPTLNPWDTDLSGIDAYQRLFFTGVDVEFSNAYSQARQGSVDGVFNPFIIPGYPMDVVDPSPLRESYHGFCTSVTHNIDATGHASTSIGMSSAYTFSEIATCYIPGTYPWLLAQLKMDENLKMYGNTPAYQRACQYYNDVLGVGAADPTILEAYSTGRPIPVKRNMGVWEIGEADTLTGSTHPTLYNTTLGNLNLVARNIASLLDIEKEHAMVTETFVDIDMWNQGSPTVEEVRKGPWVNDQSGTQDSSTLRGRDMECSAFLDYDSEAETPDPIIVPGYRETNVPPTVGQPGLPLQQATVQGQKADVLPKLGKRCSVGALDATSQAKLAECHPDLVKVVMKVIETRPVVITSGYRTSAKQAALENGGGKYAQGAGKGSRHLKYPSWAVDMVPCNNLNASSRAVMQNFAEYVLSVANSMGISLRWGGNFSTYDPVHFDLGLG
- Physico‐chemical
properties -
protein length: 1165 AA molecular weight: 125662,0 Da isoelectric point: 5,38 hydropathy: -0,24
Representative Protein Details
- Accession
- 5EHHM
- Protein name
- 5EHHM
- Sequence length
- 232 AA
- Molecular weight
- 24548,44270 Da
- Isoelectric point
- 7,87721
- Sequence
-
MRTSQQGRDRIKKHEALRLVSYRDAVGVWTIGWGNTGSEYAREGVRISKSKAQALFDGDIAEAEAIVDKAVSVPLSQEQYDALVSFVFNIGPGSKHKDGFVRLKNGKPSTLLRLLNEGKYEAAAQQFNRWVYGGGRILGGLVKRRAEETALFLEGTTSEVTTTSDTGPESGVIGESAPKGNEAPAAAALGGVAAVAAPLLQAASGATPALVATIFAALAAAYLIWKLTKEDE
Other Proteins in cluster: phalp2_30561
| Total (incl. this protein): 5 | Avg length: 395,2 | Avg pI: 8,43 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 5EHHM | 232 | 7,87721 |
| 6PRH1 | 255 | 9,93079 |
| A0A6G5Y6V0 | 160 | 9,51716 |
| A0A8K1BE17 | 164 | 9,45714 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_33221
5jbqV
|
447 | 49,3% | 154 | 1.274E-47 |
| 2 |
phalp2_10966
4MkR6
|
70 | 42,5% | 160 | 5.476E-46 |
| 3 |
phalp2_28113
7zmZV
|
50 | 45,2% | 157 | 7.492E-46 |
| 4 |
phalp2_2632
6RhYr
|
14867 | 46,0% | 152 | 4.909E-45 |
| 5 |
phalp2_2498
5GMvl
|
297 | 45,6% | 162 | 6.015E-44 |
| 6 |
phalp2_2176
3Yhf4
|
930 | 42,2% | 168 | 3.937E-43 |
| 7 |
phalp2_6830
8n5Jv
|
121 | 46,8% | 158 | 5.384E-43 |
| 8 |
phalp2_8252
7B99h
|
46 | 45,3% | 152 | 7.363E-43 |
| 9 |
phalp2_14481
4Lj5N
|
23 | 38,8% | 211 | 1.883E-42 |
| 10 |
phalp2_4532
3QBcV
|
9 | 47,3% | 150 | 3.522E-42 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Salmonella phage Arash [NCBI] |
3038319 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
OQ632216
[NCBI]
CDS location
range 8089 -> 11586
strand +
strand +
CDS
ATGCGTTTAGAAAATAATCTCGGCGGGGACAACCTCTACGGGAACTTCCGGCTGTACATCGAGGGGATCGAGGTCCCCTTCTCTTCAGCGAGCATCACGAATACATACCGCGGACTGCCGACGGCTCAGATCGTCTTGGCACCGTGGCGGGGGTTCTCGGAGATCTCTAAGGGGTACTTCCCGAAGGTTCACCTCTTCTACCGTGACTTCAACCAAGGCTTAGCGCCTTATGACATGAAGAAGATGGTCGGCGGTGTAGCCACTATGAATACGTTAGAGGCTCAATCGGCGATGGCAGAGGCCGCTGAGACCTTCAACAAGCTCGAACGTCACGCCTACAAGCTGATCTTCGGCGGTGTCGTCACATCCGTTACCGACTCGAAGTCATTAGCCGGCGAGGGTGGTTCAGCGAGCTTAGTGCTCTCCTGCGTTCACCCGTATTACCTGCTGAACGAAATCCTCTTCAAGTACATCGGTACCAACCCTCAGGAAACGACGGCCCGGGACTCAGCCTTCGGTATCATGGTGGACGGCGGTAACGTCGGCTCTCTGTATACCATCGAGCAGGCGCTGGCGGGTGTACGTGCCCCTGGCGACGGGCAGGTGACGCTGGAAGACGGTGGCAAGGACACGTCCGTGCTGTCCGATACGATGGCGGATAACTATTGGCGTCTTCAGGGTATCCCAGGGGTGATCGTGGCCCTCTGGAACTCCTTCAAGCGTTCGGCGTACTCTACCGATTCGCAGAAGAACAAGACGATCATGACGAAGATGTACATCCCGCTGCTGGAAAGCGGCTTGAAGTACTTTAGCAAAATGACCGGGCACCCGGTGATCGAAGGCGCTATCCAGAAGGATTCCAAGACGCCTCCGACCCAGGCGCCAGCGACGACCCAGACCGCGGCAACGGGCTCCTCCCCGAAAGCCGTCCCCGGTCTGACAACCATGTCGTCCCGTGGCAAGAAGGAGTTGATTGAGTTTTCTGAAGGCCGTCACTACAAGCTGTACGATGACGCCACCGGTAAGACAATCGCTCACGCACTCGATGCCACCGGCACCCCGACAATTGGGGTTGGCCATGCGGTAGACAAGACAGATACCCAGTTCGATGGCAAGACACTCTCTGACGCTCAGGTTGACAACCTGTTGGCCGATGACATCTCTAACTCTGAGCGTGCCGTAGCCAAGCTCGGCCTCACCCTCTCTCAGGGCCAGTTCGATGCACTGGTGGATCTGACCTTCAACGTCGGTCCAGGCGGTCCAGGTAGAGATGGAGCTCTTCGTCAGTCTAACGGGCAGCCATCGACTCTCGTTCGCTTGATCACTGAGGGGAATACTCAAGGTGCTGCGAATGAGTTCATGTCATGGGTATACTCTGGAAATAAAGTCTCCCAGGGTCTGGTGAATCGCCGTAAGCGTGACCAAGCGTTCTTCTTAGATGCTCCGCTTCCGTCGGCTCAGAGTCTGGCTGCGAATGCCTACTCTTCCCGCGGTTCCAACGAGGAAATGGCCACAAACAACGGTAAGCTGATTCCGGGGAATATGAATGGTATGCTCCCACGTGCGATAGCTGAACAGCTGCTGCACATCCTGAGTGGCCACATGACCTCAGGTGGGACCGGTGAAGTGATGTCTTTCGGCCAACTGATCGATGCCTTCTTGGAAGTCGTTGAGTATGACCATATCATCATGAACTCACCGGCTAAGGTGGGGATTTCCGGACAGCCGGAACTGATCGATCACGTCTACAAGCCGAAGATGTTCGCGTACTACGCCCCGATCTGTAACGTACTGCTTCCGTACATGTACGAATCCTACTCGGTGAACATCGGGACCGATCAGGTGCCTTCACGTGTGATCTACAACGGGATGCCTTTCACGGCTGACTCCGCGGCTAACTCCGGTACGACCGTGAACATGCACAACTACGTCGCCCCTCACTCTGTCCGTGTGGCGCTGGCTGACGGCGGTAACCTAGCCCAGACCCTTATCTCTCTGGTCGCGTCGCCTGGGAAGTACGAGTGGGGTTCCGGTGTGCGTACGGCGGACGGTACTCTGCCTTACTGGTATGCCGTGCTGTCGCATCAAGCAGCCTCCGGTAATGTTGGTGACTTAGCGAATGGCACAACGCTCGAGAACCTGAAGGCTGCGTGGAATAAACTGTATCCGGAGGACCCGACGCTGAACCCGTGGGACACCGATCTGTCAGGGATTGATGCCTACCAGCGTCTGTTCTTCACCGGTGTCGACGTGGAGTTCTCGAATGCGTACTCACAGGCCCGCCAGGGAAGCGTAGACGGCGTTTTCAACCCGTTTATCATCCCAGGGTACCCAATGGACGTCGTCGATCCAAGTCCTCTCAGAGAGTCCTACCACGGCTTCTGTACGAGTGTGACTCACAACATTGACGCGACGGGGCACGCTTCGACGTCGATCGGTATGTCCTCAGCATACACCTTCTCGGAGATCGCGACGTGCTACATTCCGGGAACCTACCCATGGCTCCTCGCTCAGCTGAAGATGGACGAGAACCTGAAGATGTACGGGAATACGCCGGCCTATCAGCGTGCCTGCCAGTACTACAATGACGTGCTCGGTGTGGGTGCAGCGGATCCGACGATCCTCGAAGCGTACTCTACCGGTCGTCCTATTCCGGTGAAGCGTAACATGGGTGTCTGGGAGATCGGCGAAGCGGATACACTGACAGGATCAACCCACCCGACGCTATACAACACGACGCTCGGTAACCTGAACCTCGTGGCCCGTAACATCGCGTCACTGCTGGATATCGAGAAGGAACACGCCATGGTAACAGAGACATTCGTGGATATCGACATGTGGAACCAAGGTTCTCCGACCGTTGAAGAGGTCCGTAAGGGTCCGTGGGTGAATGATCAGTCTGGAACGCAGGATTCTTCTACGCTCCGCGGTCGTGATATGGAGTGCTCAGCCTTCCTGGATTATGACTCAGAGGCCGAAACGCCAGACCCGATCATTGTCCCGGGTTACCGTGAGACAAACGTGCCGCCGACGGTAGGTCAACCTGGTCTGCCGCTGCAACAAGCAACCGTACAGGGTCAGAAAGCAGATGTTCTGCCTAAACTCGGTAAGCGTTGTTCAGTTGGTGCCCTGGATGCGACCTCTCAGGCGAAGTTGGCCGAGTGTCACCCAGATCTGGTTAAGGTCGTGATGAAAGTCATCGAGACCCGTCCAGTGGTCATCACCTCCGGCTACCGTACGTCTGCTAAGCAGGCGGCCCTGGAGAATGGTGGTGGTAAATATGCTCAAGGTGCAGGCAAAGGTTCTCGTCACCTCAAATATCCATCATGGGCAGTCGACATGGTACCGTGTAACAACCTGAACGCGTCTTCACGTGCAGTAATGCAGAACTTCGCGGAGTACGTTCTGAGCGTGGCGAACTCAATGGGCATCAGTCTGCGTTGGGGTGGGAACTTCTCAACGTATGATCCAGTGCACTTCGACTTAGGACTCGGTTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0003796 | lysozyme activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0016998 | cell wall macromolecule catabolic process | biological process | None (UniProt) |
| GO:0030430 | host cell cytoplasm | cellular component | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
| GO:0044659 | viral release from host cell by cytolysis | biological process | None (UniProt) |
Enzymatic activity
| EC Number | Entry Name | Reaction Catalyzed | Classification | Evidence | Source |
|---|---|---|---|---|---|
| 3.2.1.17 | None | Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. |
match to sequence model evidence used in automatic assertion
ECO:ECO:0000256 |
ARBA:ARBA00000632 |
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(5EHHM)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50