Protein

Protein accession
A0A4D6AST6 [UniProt]
Representative
75Rgb
Source
UniProt (cluster: phalp2_28052)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MALFGVDISEHNGFIDFDQLKNNVDFVIIRSSWGSFAEDLRARRNASECERVGIPYGFYHYSYACNLGDAQAEVNAFLNFTRQFHPSMPLYIDMEDADGWKANNGGVSWETSTAICRLFCDHVEAAGYWAGVYASLYWFQNMGDLSRYTNWVAQWQVTACSVPTDIWQFTSDGIVGGISGRVDSNYMYRDLRSVYTGQVPEPRSEVPQQATAPVSTRTYTVQEGDTLSAIAALFGTSYQELAAINGITNPDLIYPGQVLQVTGSTQASSSTTYTVESGDTLSAIADMYGTNYQHLAALNGIENPDLIYPGQVLRIE
Physico‐chemical
properties
protein length:316 AA
molecular weight:34988,3 Da
isoelectric point:4,43
hydropathy:-0,20
Representative Protein Details
Accession
75Rgb
Protein name
75Rgb
Sequence length
159 AA
Molecular weight
16730,35700 Da
Isoelectric point
4,05003
Sequence
MGACAVPTDIWQFTSDGIVGGISGRVDSNYMYRDLRSVYTGQIPEPRPAVPAPAAPASTGNYTVQEGDTLSAIAALYGTSYQELAAINGIANPDLIYPGQVLQVTGNPQAPSSTTYTVESGDTLSAIADMYGTNYQHLAAINGIENPDLIYPGQVLQIE
Other Proteins in cluster: phalp2_28052
Total (incl. this protein): 8 Avg length: 208,8 Avg pI: 7,34

Protein ID Length (AA) pI
75Rgb 159 4,05003
1gEfa 161 9,08716
5TLGE 197 9,09728
6nq2I 233 9,43342
6pdnu 229 8,83341
6sV3n 223 9,23324
7X1jM 152 4,54799
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_29475
1HHLO
21 68,3% 98 1.064E-37
2 phalp2_13670
5RffS
11 35,8% 162 3.593E-24
3 phalp2_32517
1lz2n
1 43,4% 122 1.397E-21
4 phalp2_26344
1gJRB
4 32,8% 146 6.698E-21
5 phalp2_224
5VoQx
6 28,6% 164 1.881E-18
6 phalp2_15728
3kqKZ
1 29,8% 171 2.573E-18
7 phalp2_5680
4CMV3
1 26,6% 154 4.810E-18
8 phalp2_17221
3xRQT
3 31,6% 155 2.487E-15
9 phalp2_5364
2oeSV
1 27,6% 206 1.799E-10
10 phalp2_144
5oGmc
8 31,3% 188 2.131E-09

Domains

Domains [InterPro]
Disordered region
LysM
LysM
Representative sequence (used for alignment): 75Rgb (159 AA)
Member sequence: A0A4D6AST6 (316 AA)
1 159 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01476

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage Javan100
[NCBI]
2547973 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MK448838 [NCBI]
CDS location
range 29515 -> 30465
strand +
CDS
ATGGCATTATTTGGAGTAGATATCAGTGAACACAATGGTTTTATTGACTTTGATCAGTTGAAAAACAATGTTGACTTTGTTATTATCCGTTCGTCTTGGGGTAGCTTTGCGGAAGATTTACGAGCACGACGAAATGCATCTGAATGCGAGCGGGTTGGCATTCCATACGGTTTCTACCATTATAGCTATGCTTGCAACTTGGGAGATGCACAAGCTGAGGTCAATGCCTTTTTGAACTTTACTCGTCAATTCCATCCTTCTATGCCACTCTATATTGATATGGAAGATGCAGACGGCTGGAAAGCAAACAATGGGGGTGTGAGCTGGGAAACTTCCACGGCTATCTGCCGACTGTTTTGTGACCATGTCGAAGCAGCAGGCTACTGGGCTGGTGTGTATGCGAGCCTGTACTGGTTCCAGAATATGGGCGACTTATCCCGTTATACGAATTGGGTAGCTCAGTGGCAGGTGACTGCTTGCTCTGTTCCAACAGACATCTGGCAGTTTACCAGTGATGGAATTGTTGGTGGTATCAGTGGCCGAGTAGATTCCAATTATATGTATCGGGATTTGCGTTCTGTTTATACCGGACAGGTACCAGAACCTCGATCAGAAGTGCCACAACAAGCAACAGCCCCAGTTTCGACTAGGACTTACACGGTTCAAGAGGGGGATACACTATCAGCTATTGCGGCTCTTTTTGGAACAAGTTATCAGGAATTAGCAGCTATCAATGGGATTACCAATCCTGACTTGATTTATCCGGGGCAAGTTTTGCAGGTTACTGGAAGTACGCAAGCATCTAGCAGTACCACTTATACGGTTGAAAGTGGGGACACCCTATCAGCCATTGCGGACATGTATGGCACGAACTACCAGCACTTAGCTGCCCTTAATGGAATTGAAAATCCGGATTTGATTTATCCCGGTCAAGTGTTACGAATTGAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi0006613c67_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (75Rgb) rather than this protein.
PDB ID
75Rgb
Method AlphaFoldv2
Resolution 74.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50