Protein

Protein accession
E3W8H6 [UniProt]
Representative
734XP
Source
UniProt (cluster: phalp2_1050)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MNKLKRWVIASVGAVAFLVATISGVSANTLGIDVASYQGTTTSYFSQFKSYGDNFTMVKLGGRGGGEGSHYVNPKAYAQIHNADAVGMQTGGYFWGEFGDSVSEASYHAQLAVQDAQNAGLAKGSYIALDYEKGAGLTKANNTTAILTFMDQIYAAGYKPMFYSYTSYVNSYVDLTRINARYPNALWLAWYLTTAHQATPPMQYFPNYSNVKIWQYGDNHFGVDGNVMVVGSLDNDKPAEQTASKPSQSTNTPSTPAKTRYATFSGVYVADYWTKYNNKMYGVNIDMSIPVIDYNNYIPISALTLTDRYGNKLKNQYIQGNNGRMEYFTLNGKYKVISQTATTINVEIGGEPVSMMKSFAIIK
Physico‐chemical
properties
protein length:363 AA
molecular weight:39965,4 Da
isoelectric point:8,79
hydropathy:-0,28
Representative Protein Details
Accession
734XP
Protein name
734XP
Sequence length
367 AA
Molecular weight
40207,26670 Da
Isoelectric point
5,86331
Sequence
MGGVKVNKLKRWVVASIGAVAFFGAVITGVSANTNGIDVASYQGDTTQYFSSFKQVGDNFTMVKLGGRGGGEGVHYANPKAYNQINSAANVGMQTGGYFWGEFGDSVAEASYHAQLAVQDAQNAGLAKGSYIALDYEAGAGYNKANNTSAILTFMDAIYAAGYKPMFYSYTSYVESFVDLSRINARYPNALWLAWYLTTAHQATPPMQYFPNYSNVKIWQYADNHYGVDGNVMVVGSLDNNTPAENTAKPSNRPSDDQSSNKTHYATFSGVYVADYWTKWYGGYYGVNVDMGIPIIDYNNYMPISALTLTDKYGHKLSNQTIKGNNGQQEYFTLEGEYKVLSQTATSVQVEIAGEPVWMMKSFATIK
Other Proteins in cluster: phalp2_1050
Total (incl. this protein): 69 Avg length: 361,8 Avg pI: 8,15

Protein ID Length (AA) pI
734XP 367 5,86331
1bM6L 363 8,93121
1c5iM 363 8,32991
1c5kl 369 9,23073
1c9xi 367 8,60216
2TxFw 361 9,35496
3lGwv 367 8,61022
4AbCx 363 8,81252
5TuQp 364 9,04577
5oH2l 363 9,17367
5oeqq 364 8,61834
5tRF2 363 8,63685
5tVU4 363 8,80369
75W3u 363 8,80762
79roQ 355 4,52645
7BK7p 363 8,80794
7Bbi5 363 8,60932
7C5H4 367 6,37957
7YnFe 363 8,94043
7YzTz 356 6,20013
7cJMa 363 8,35248
7cRXx 363 8,61937
7dZHX 363 8,94326
7dr6k 363 9,03855
7gJzG 363 8,37588
7guBP 365 7,87870
7hILo 366 9,00941
7hglX 364 6,89635
7hnJx 363 6,57544
7lazr 363 9,06737
7nQCY 363 8,94313
7s3rz 362 5,51170
7sCQt 364 7,84163
7sNDq 363 8,61834
7sNFA 363 8,91619
7sNGA 363 8,36647
7sNIA 363 9,17393
7ssFU 363 7,84659
7ssHB 364 8,32824
7t05c 363 8,36924
7tH3i 363 8,63639
7tH8n 363 8,64548
7tH9l 363 8,78628
7tTMr 363 9,03842
7tTPh 363 7,89043
7tTRz 363 8,77777
7uEdO 367 8,32379
7uEgp 363 8,62788
7uEjd 363 8,61950
7uEl8 363 8,35686
7uEnh 365 7,17151
7uSEr 363 6,22935
7upYk 363 8,62788
7uq64 363 8,62827
7vjGm 363 8,80350
7wDbV 356 4,46939
7wDbv 356 4,49468
7wJed 364 8,64529
7wN5P 354 9,44638
7yrap 363 7,87476
7yrbl 363 8,63697
7z88S 363 8,82168
7z89N 294 6,07230
86Hq7 363 8,33978
8K2vo 363 8,79402
8fL4R 356 6,58601
8fejE 356 8,35505
A0A6G5YBL3 363 8,93140
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12156
6iQWP
264 39,5% 235 3.945E-47
2 phalp2_17635
6JmE3
482 30,2% 284 3.236E-21
3 phalp2_36914
7BzuU
65 26,8% 242 4.932E-18
4 phalp2_36883
7sfSC
39 27,3% 358 3.785E-17
5 phalp2_36076
6wAjV
94 26,0% 261 9.242E-14
6 phalp2_362
76MmF
5 26,0% 234 6.879E-13
7 phalp2_14854
7znRU
13 26,1% 237 2.119E-11
8 phalp2_4947
6Eg7U
122 28,5% 228 8.532E-10
9 phalp2_39885
1cJe4
2 25,1% 243 1.840E-07
10 phalp2_11324
7cdUH
5 24,7% 259 3.234E-07

Domains

Domains [InterPro]
Disordered region
GH25
Unannotated
Representative sequence (used for alignment): 734XP (367 AA)
Member sequence: E3W8H6 (363 AA)
1 367 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Leuconostoc phage phiMH1
[NCBI]
912321 Seongbukvirus > Seongbukvirus MH1
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HM596271 [NCBI]
CDS location
range 32869 -> 33960
strand +
CDS
GTGAATAAATTAAAACGATGGGTGATTGCTTCTGTTGGAGCGGTTGCCTTTTTAGTTGCCACGATTTCAGGTGTTTCAGCCAATACATTGGGTATTGACGTTGCCAGTTATCAGGGCACAACCACAAGCTATTTCAGCCAGTTTAAGAGCTATGGTGATAACTTCACTATGGTTAAGCTGGGCGGACGTGGCGGCGGTGAGGGTAGTCATTATGTCAATCCTAAAGCCTACGCCCAAATTCATAACGCTGATGCCGTCGGTATGCAAACCGGTGGCTATTTCTGGGGTGAGTTTGGTGATTCAGTTAGTGAAGCGAGTTATCACGCACAATTAGCTGTACAAGATGCACAGAACGCTGGATTAGCCAAAGGTAGCTACATTGCATTGGACTATGAAAAAGGCGCTGGTCTCACTAAGGCTAACAACACTACGGCTATTTTGACGTTCATGGATCAGATTTACGCTGCTGGCTATAAGCCAATGTTTTATAGCTATACAAGCTACGTCAATTCGTACGTTGATTTAACACGTATCAATGCACGTTATCCTAATGCATTGTGGTTAGCCTGGTACTTGACCACGGCGCATCAAGCAACGCCACCTATGCAATACTTCCCAAATTATTCCAATGTGAAGATTTGGCAGTATGGTGATAATCACTTTGGTGTTGATGGTAATGTCATGGTGGTTGGTTCATTGGATAATGATAAGCCAGCCGAACAAACGGCTTCAAAGCCATCACAGTCAACGAACACACCAAGCACACCAGCAAAGACACGTTATGCAACCTTTAGTGGTGTCTACGTGGCTGACTACTGGACTAAGTACAATAACAAAATGTACGGTGTCAACATTGATATGAGTATTCCAGTGATTGATTACAACAACTATATTCCTATCTCAGCCTTAACTTTGACTGACCGATATGGTAATAAGTTGAAAAATCAATACATTCAAGGCAACAACGGACGTATGGAGTACTTCACTTTGAACGGTAAGTACAAGGTTATTTCACAGACAGCGACAACGATTAACGTTGAAATTGGTGGTGAGCCAGTTTCCATGATGAAGTCGTTTGCAATAATCAAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi0001ecfa2e_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (734XP) rather than this protein.
PDB ID
734XP
Method AlphaFoldv2
Resolution 83.45
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50