Protein

Protein accession
A0A2D1GDJ1 [UniProt]
Representative
1Mw6C
Source
UniProt (cluster: phalp2_1467)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAVTRANVEATKNFIRARVGNPYVYGGALSPTNVRQGTDCSEVWQTVLEMVHGRYNPGRQSEGATTESYRYIPVGGVGPFGTIRVASWRDIPADAVAKIAFHHGPGGGASSHMWGDLDGMLIESAGSKGLVTNGRAMTIDNSYATAWAYLPGPIVEDGSPIPEDPNAVTWGIDISNHQGEMDLNRVKAEGFDFIWCKVSEGANYRDPFWPGNRDKARAAGLILAGYHYVRTGDPAAQAKTFVEHLGDKSIPAMLDFEDGSGNIEQFWAVKAEIEKLGVQVRLSYIPDWYWERIGKPDLSKVPGLIASEYVSGTGYASVLYPGNSSNFWKAYGGRTPDVLQFTDKALVAGKSVDANAFRGTPDKLRRLLGAGGDDFLSALSDAEQRLLFDRTNQVWGALFNPVGSESKYKAEGEGLIYKTKDMVRFIDKNVHEMLVERQAMMGNPEAIALVRREANKGDKWAKVVLDYIESEG
Physico‐chemical
properties
protein length:472 AA
molecular weight:51643,5 Da
isoelectric point:5,63
hydropathy:-0,36
Representative Protein Details
Accession
1Mw6C
Protein name
1Mw6C
Sequence length
436 AA
Molecular weight
47373,67620 Da
Isoelectric point
6,52429
Sequence
MTVLRSNVEATKAFIRARVGNPYVYGGALSNNVRQGTDCSEVWQTVLEMVLGRWRQGRQSEGATTESYRYIPVGGVGPFGTIRVANWRDIPGNAVARLAFHHGPGGGANSHMWGDLDGMRIESAGSKGLVTGGNARAIEDGYAHAWCYLPGPIDGGGAVPVESTEDLQRRLNTEGAGLEVDGEYGPLTASALAAYDKRVPYARQVKAEGERRGITPRGIQIAFSVVFVESGWKMYANSKVPESLKLPHDAVGSDHDSVGLFQQRCPMWGPANVLMDPTKSAGLFYDRLAKLDYNNQSRPPGDYAADVQRPAAQYRGRYQERMGHAVALYNRITTTPEPSDDLEALLMSDELFASRSIYRTSDQKTMSARDAIFGADACAHMTWIEQKALAGDSWAIGLVADLATGKLPASKDPWAVDRARHIISIIQNAIAAQETK
Other Proteins in cluster: phalp2_1467
Total (incl. this protein): 71 Avg length: 492,0 Avg pI: 5,88

Protein ID Length (AA) pI
1Mw6C 436 6,52429
2LVy0 426 5,36216
7dyXr 454 6,56885
ugp5 433 5,00782
A0A7D5JQW9 526 5,94743
A0A5J6T2E0 525 5,71814
G8I7N0 526 5,93083
A0A1C9M1F8 526 5,81499
A0A0F6YQU5 526 5,89599
G1D3T6 472 5,93316
A0A5J6T3K6 466 6,21622
Q854L4 473 5,57638
A0A385DV82 472 5,60196
A0A0K2CPS6 526 5,81687
A0A142F1U1 466 6,21622
A0A142K4R9 526 5,93083
A0A1D8EY32 526 5,81687
A0A1D9CAG5 526 5,81499
A0A1S6L2C0 465 6,21622
A0A2P1A0V6 526 5,93083
A0A2P1A2H9 526 5,81687
A0A2P1A2Y7 526 5,93083
A0A2S1PBB3 472 5,60196
A0A2Z5H0T0 465 6,21622
A0A2Z5H107 466 6,21622
A0A2Z5H6C2 526 5,81687
A0A345BM58 472 5,86956
A0A345L5Y8 466 6,21622
A0A385D1K5 472 5,51130
A0A385D246 472 5,60196
A0A385UI13 472 5,60196
A0A3G8FI38 526 5,71172
A0A481VR61 526 5,93083
A0A481VV02 472 5,60196
A0A481VWN3 466 6,21622
A0A481VXL8 466 6,21622
A0A481VXN4 466 6,29955
A0A481VYH2 466 6,21622
A0A481W0B0 472 5,60196
A0A481W2D8 472 5,60196
A0A481W320 516 5,67892
A0A481W7R5 526 5,77299
A0A482JES0 518 5,91645
A0A4P8GHX0 472 5,60196
A0A4Y6EPL8 526 5,81499
A0A515MG35 472 5,60196
A0A5B8WR32 472 5,60196
A0A5J6T8K7 517 5,52494
A0A5J6TCR7 516 5,90929
A0A5J6TN95 526 5,74508
A0A5J6TRS6 516 6,15199
A0A5J6TY85 526 5,81687
A0A5Q2WHV6 526 5,93083
A0A5Q2WS95 516 6,03666
A0A649V7M7 518 5,91645
A0A649V8X9 526 5,81687
A0A649VDG1 516 5,70620
A0A649VGJ1 466 6,21622
A0A649VPV9 473 5,64243
A0A6B9J644 466 6,21622
A0A6B9SZN9 465 6,21622
A0A7T1KRD6 473 5,64243
A0A7T3KBC0 466 6,21622
G1BR17 526 5,81499
A0AAE8BVC2 472 5,60196
A0AAE8YB30 472 5,60196
A0A976U9C0 526 5,81681
A0A977KLW9 466 6,21622
A0A977KQK3 466 6,21622
A0AA48Y4F3 527 6,29892
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23524
75PiN
82 41,3% 532 1.583E-146
2 phalp2_3134
870ZT
3 32,8% 429 1.242E-42

Domains

Domains [InterPro]
Unannotated
Unannotated
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): 1Mw6C (436 AA)
Member sequence: A0A2D1GDJ1 (472 AA)
1 436 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage Klein
[NCBI]
2041536 Omegavirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MF919512 [NCBI]
CDS location
range 40341 -> 41759
strand +
CDS
ATGGCTGTCACTCGCGCTAACGTAGAAGCGACGAAGAATTTCATCCGGGCGCGGGTGGGCAACCCTTATGTCTATGGCGGGGCCCTTTCGCCGACCAACGTTCGACAGGGGACCGACTGCTCGGAGGTGTGGCAGACCGTCCTAGAGATGGTTCACGGCCGATACAACCCCGGCCGTCAGTCCGAGGGTGCCACCACCGAGTCCTACCGCTATATCCCCGTCGGGGGGGTGGGCCCATTCGGCACCATCCGGGTAGCCAGCTGGCGGGACATTCCCGCCGATGCCGTGGCGAAGATCGCCTTCCATCACGGCCCCGGCGGCGGTGCGAGCTCGCACATGTGGGGCGATTTGGACGGCATGCTCATCGAGTCGGCCGGCTCCAAGGGGCTAGTCACCAATGGTCGCGCCATGACCATCGACAACTCCTACGCCACCGCGTGGGCATACCTCCCCGGCCCAATCGTCGAAGACGGTAGCCCCATCCCCGAAGACCCCAACGCGGTCACCTGGGGAATTGACATCTCCAATCACCAGGGCGAGATGGACCTCAACCGCGTCAAGGCCGAGGGGTTCGACTTCATCTGGTGCAAGGTGTCGGAGGGCGCGAACTACCGCGACCCGTTCTGGCCGGGTAACCGGGATAAGGCCCGCGCTGCCGGCCTGATCTTGGCGGGCTATCACTACGTCCGCACGGGGGACCCTGCCGCTCAGGCGAAAACCTTCGTGGAGCACCTGGGGGATAAGTCCATCCCCGCCATGTTGGATTTCGAGGACGGCTCGGGGAACATCGAGCAGTTCTGGGCGGTCAAGGCCGAGATTGAAAAGCTCGGCGTCCAGGTCCGCCTGTCCTACATTCCTGACTGGTACTGGGAGCGCATCGGAAAGCCGGACCTGTCCAAGGTCCCTGGGCTTATTGCATCCGAATACGTCAGCGGAACCGGATACGCATCGGTGCTATATCCGGGCAATTCAAGCAACTTCTGGAAAGCGTACGGCGGCCGGACGCCCGACGTCCTGCAATTCACCGACAAGGCCCTTGTGGCTGGCAAGTCGGTCGACGCTAACGCTTTCCGGGGAACACCAGACAAGCTGCGCCGACTTTTGGGCGCGGGAGGAGATGACTTCTTGTCCGCATTGAGCGATGCAGAACAGCGGCTGCTCTTTGATCGCACCAACCAAGTATGGGGCGCCCTCTTTAATCCCGTCGGTTCGGAATCGAAATACAAGGCCGAGGGCGAGGGCCTGATCTATAAGACCAAGGACATGGTCCGGTTCATCGACAAAAACGTCCACGAGATGCTTGTCGAGCGCCAGGCCATGATGGGGAATCCTGAGGCTATCGCCCTCGTCCGCCGAGAGGCAAATAAGGGCGATAAGTGGGCCAAGGTAGTCCTCGACTACATCGAAAGCGAGGGCTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi000c0c1d45_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1Mw6C) rather than this protein.
PDB ID
1Mw6C
Method AlphaFoldv2
Resolution 87.24
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50