Protein

Protein accession
A0A6M3BF03 [UniProt]
Representative
5oGmc
Source
UniProt (cluster: phalp2_144)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKFKTKLITLVVAFLAAISFALPSHVNAAKGDIGVDWAVYQGANGKYGTNNKFAIIQAGGTQGGTLYDQWTYASQVKAAQANGLKVHTYLWYGVGGSADIGRQALDYFLPKILTPKGSIVALDYEDGASSSMAANTDAILYGMRRIAQAGYTPMYYSYKPYTLAHVDYQRILAEFPNSLWIAAYPDYQIRALPDYGVFPSMPGIALYQFTSMHAAGGLDGNVDLLGVTDNGYSQQPVAPSKPATPSQPSTSTAASDTDYAQTGVFKPSATVNIRTGAGTGYASVGSYAPGESVIYDHVYIRGTYVWARYLSYSGRYHYVALGVNGGESYGSRSSGYTSPVSHTYYTVRSGDSFWSIASKYGISMYTLAANNGKSIYSLIYPGESLYIR
Physico‐chemical
properties
protein length:388 AA
molecular weight:41843,4 Da
isoelectric point:8,77
hydropathy:-0,12
Representative Protein Details
Accession
5oGmc
Protein name
5oGmc
Sequence length
191 AA
Molecular weight
20467,28440 Da
Isoelectric point
7,82764
Sequence
MDGVAIWQYTSAFGLSQGLDGNIDLLGVTDNGYSKQPATPSVPVTPAPSQPAKSNAASDTDYAQNGVFKPSATVNIRTGAGTGYASVGSYAPGESVIYDHVYIRGTYVWARYLSYSGRYHYVALGVNGGESYGSRSSGYTSPVSHTYYTVRSGDSFWSIASKYSISMYTLAVNNGKSIYSLIYPGESLYIR
Other Proteins in cluster: phalp2_144
Total (incl. this protein): 8 Avg length: 274,1 Avg pI: 8,42

Protein ID Length (AA) pI
5oGmc 191 7,82764
6gabs 241 8,92283
78Tzy 145 9,99391
7RoCH 237 6,05559
7yJJQ 210 8,58482
A0A2Z6BEJ9 388 8,30180
A0AAF0AQ69 393 8,89530
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25839
5j1l4
12 78,5% 191 3.942E-92
2 phalp2_38217
7u45O
5 35,8% 198 6.418E-30
3 phalp2_29108
6ZCjl
6 33,0% 121 2.241E-14
4 phalp2_28052
75Rgb
8 31,0% 190 4.052E-10
5 phalp2_15728
3kqKZ
1 31,2% 195 1.545E-08

Domains

Domains [InterPro]
Disordered region
SH3_5
LysM
Representative sequence (used for alignment): 5oGmc (191 AA)
Member sequence: A0A6M3BF03 (388 AA)
1 191 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01476, PF08460

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage JNU_P10
[NCBI]
2686380 Pleetrevirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN830259 [NCBI]
CDS location
range 11666 -> 12832
strand +
CDS
ATGAAATTTAAAACTAAACTAATCACCTTGGTAGTCGCCTTCTTGGCGGCTATTTCTTTTGCCTTGCCATCGCATGTCAATGCGGCCAAGGGCGATATCGGTGTGGACTGGGCAGTTTATCAGGGAGCCAACGGGAAATATGGCACGAATAACAAGTTCGCGATCATTCAGGCTGGCGGTACACAAGGCGGTACGCTGTATGATCAATGGACGTATGCAAGCCAAGTCAAGGCAGCACAGGCGAATGGGCTCAAAGTCCACACTTATCTGTGGTACGGGGTCGGTGGCAGTGCTGATATTGGCCGACAGGCACTTGACTACTTCCTTCCCAAGATTCTAACGCCGAAAGGTTCTATTGTTGCTCTTGATTATGAGGACGGTGCCTCATCAAGTATGGCTGCCAATACCGATGCCATTCTATATGGTATGCGGCGCATTGCGCAGGCGGGATATACGCCAATGTATTACAGCTACAAGCCGTATACACTGGCACACGTTGACTATCAGCGCATTCTAGCGGAGTTTCCCAATTCACTGTGGATCGCGGCCTATCCGGACTATCAGATTCGGGCATTGCCAGATTATGGTGTATTCCCGTCAATGCCCGGCATTGCGCTCTACCAATTTACTTCTATGCATGCGGCAGGCGGTCTTGATGGCAACGTTGACTTGCTTGGCGTAACCGATAATGGATACAGTCAACAACCTGTCGCGCCATCAAAGCCCGCAACACCATCGCAACCGTCTACTTCAACAGCAGCCAGTGATACCGACTATGCGCAAACTGGTGTTTTCAAGCCGTCCGCGACTGTTAACATTCGCACTGGTGCCGGCACCGGCTATGCATCCGTTGGTAGCTACGCACCCGGTGAAAGTGTGATTTATGATCACGTGTATATCCGTGGCACATATGTTTGGGCACGTTATCTCAGCTACTCAGGCAGGTATCATTATGTTGCCTTGGGCGTGAATGGTGGGGAGAGCTATGGCTCGCGTTCGTCTGGATATACTTCGCCGGTAAGCCACACGTACTACACTGTCCGCTCTGGTGACAGCTTCTGGAGTATTGCCAGCAAGTATGGCATCAGCATGTACACGTTAGCGGCTAACAACGGTAAGTCAATCTACAGCCTGATCTACCCGGGCGAAAGCCTGTATATCAGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi000297c32d_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5oGmc) rather than this protein.
PDB ID
5oGmc
Method AlphaFoldv2
Resolution 73.40
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50