Protein

Protein accession
Q9B033 [UniProt]
Representative
6Q25R
Source
UniProt (cluster: phalp2_30676)
Protein name
Lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MHLSENGRLLIMRLEGGRLRAYQCRAGIWTIGYGHTEGVKPGDKISLDQALELFNHDVQWVGGRVNALVKVPLSQGQFEALCSFVFNVGRGAFAQSRLLKKLNAGDVAGVLADGLCFLCVRANHWQQRAVSFEQQAGEAQRERISARAGGFAATGYCG
Physico‐chemical
properties
protein length:158 AA
molecular weight:17207,5 Da
isoelectric point:9,17
hydropathy:-0,08
Representative Protein Details
Accession
6Q25R
Protein name
6Q25R
Sequence length
231 AA
Molecular weight
24522,74100 Da
Isoelectric point
7,84691
Sequence
MKFTAYEHMERWEGLRLDAYLCPAKVWTIGYGHTRTAKPGMKITEADAKALLESDLGWATAAVNRLVRVPLNQNQFDALVSLVYNIGEGGFAGSTVLRRLNAGDYAGAARAFEMWNKITDSAGRKVVSRGLENRRASERLLFETSAGHITPAEDADRTVIPADALPVPKVTAGAKGGAAVIALAAAASELAGQYGIALPVGAIEVLIVGGLTQLASFVAGYMKPSREGDHP
Other Proteins in cluster: phalp2_30676
Total (incl. this protein): 25 Avg length: 247,3 Avg pI: 8,14

Protein ID Length (AA) pI
6Q25R 231 7,84691
14LL0 248 6,33274
17ZEr 251 8,60783
1ljD4 231 9,21983
2sz2c 267 6,31944
35NoM 252 8,69197
3ngLa 225 5,68887
4RBKM 262 6,97036
4RBt5 252 9,24214
4Rv1V 254 9,53637
4exUA 268 8,89678
4g8Tb 291 9,06530
53zs8 241 8,92528
53zuz 241 5,95993
53zvB 241 9,34696
5gLJ3 263 9,45708
6Hp3r 263 9,26316
6PZHo 266 6,80007
6R6zo 265 7,83209
6Uy8w 211 10,42455
7lCu 265 7,84891
81gNC 234 8,79782
81pxt 236 6,91085
87hye 267 6,43272
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26378
1q5mp
207 50,2% 201 3.480E-63
2 phalp2_29738
1pBjt
830 45,3% 205 4.332E-59
3 phalp2_28113
7zmZV
50 56,2% 144 8.148E-56
4 phalp2_34878
4dXDH
390 53,0% 149 3.853E-46
5 phalp2_14481
4Lj5N
23 45,5% 156 1.084E-43
6 phalp2_19992
6T980
5 38,3% 185 3.793E-43
7 phalp2_6732
24SaC
23 40,9% 205 9.702E-43
8 phalp2_8851
2IyxA
73 46,6% 150 1.187E-41
9 phalp2_34911
4lF8s
21 47,9% 146 1.187E-41
10 phalp2_37325
8t1Gv
89 42,9% 163 1.061E-40

Domains

Domains [InterPro]
Unannotated
Disordered region
Representative sequence (used for alignment): 6Q25R (231 AA)
Member sequence: Q9B033 (158 AA)
1 231 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Phage GMSE-1
[NCBI]
148339 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AF311646 [NCBI]
CDS location
range 1 -> 477
strand +
CDS
ATGCACCTCAGTGAAAACGGGCGCTTGCTCATCATGCGTCTTGAAGGGGGCCGCTTGCGCGCCTATCAGTGTCGCGCCGGGATCTGGACTATCGGCTACGGCCATACCGAGGGGGTAAAGCCCGGTGATAAAATATCGTTGGATCAGGCTCTCGAATTATTTAATCACGATGTGCAATGGGTCGGTGGACGCGTGAATGCGTTGGTGAAGGTGCCGCTCTCGCAGGGGCAGTTTGAGGCGCTCTGTTCGTTTGTCTTTAATGTCGGGCGCGGCGCCTTTGCTCAGTCGCGTTTGCTCAAAAAGCTCAATGCGGGCGATGTTGCCGGTGTGTTGGCGGATGGCCTGTGCTTTCTTTGTGTGCGCGCTAATCACTGGCAGCAGCGGGCCGTCAGTTTTGAGCAGCAGGCGGGCGAGGCGCAACGCGAGAGAATATCTGCGCGCGCAGGTGGTTTCGCGGCAACAGGCTATTGCGGATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi000009b1e5_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6Q25R) rather than this protein.
PDB ID
6Q25R
Method AlphaFoldv2
Resolution 69.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50