Protein

Protein accession
P19385 [UniProt]
Representative
1mUYf
Source
UniProt (cluster: phalp2_33827)
Protein name
Lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MVKKNDLFVDVASHQGYDISGILEEAGTTNTIIKVSESTSYLNPCLSAQVSQSNPIGFYHFAWFGGNEEEAEAEARYFLDNVPTQVKYLVLDYEDHASASVQRNTTACLRFMQIIAEAGYTPIYYSYKPFTLDNVDYQQILAQFPNSLWIAGYGLNDGTANFEYFPSMDGIRWWQYSSNPFDKNIVLLDDEKEDNINNENTLKSLTTVANEVIQGLWGNGQERYDSLANAGYDPQAVQDKVNEILNAREIADLTTVANEVIQGLWGNGQERYDSLANAGYDPQAVQDKVNEILNAREIADLTTVANEVIQGLWGNGQERYDSLANAGYDPQAVQDKVNELLS
Physico‐chemical
properties
protein length:342 AA
molecular weight:38247,5 Da
isoelectric point:4,18
hydropathy:-0,44
Representative Protein Details
Accession
1mUYf
Protein name
1mUYf
Sequence length
277 AA
Molecular weight
31074,76900 Da
Isoelectric point
4,21867
Sequence
MVKKNDLFVDVASHQGYNISGILEEAGTTNTIIKVSESTSYLNPCLSAQVSQSNPIGFYHFAWFGGNEEEAEAEARYFLANVPTQVKYLVLDYEDHASASVQANTNACLRFMQVIADAGYTPIYYSYKPFTLDNVDYQQILAQFPNSLWIAGYGLNDGTANFEYFPSMDGIRWWQYSSNPFDKNIVLLDDEEEDNISNENTIKNLTTVANEVIKGFWGNGQERYDSLTNAGYDPQAVQDKVNEILNAGEVVDLTTVANEVIKGFWGNGQERYDSLTN
Other Proteins in cluster: phalp2_33827
Total (incl. this protein): 73 Avg length: 299,8 Avg pI: 5,16

Protein ID Length (AA) pI
1mUYf 277 4,21867
13AlJ 330 5,52204
1HIpN 295 4,58943
1cwZh 279 4,86391
1cy1v 294 4,15984
1cy9q 282 4,83492
1jTOU 282 4,66343
1os7S 294 5,24143
2lQw4 297 5,47174
2mvhU 331 5,30248
3bXXm 329 5,44952
3rTuu 294 5,23131
3sIyu 329 5,33772
4VFSb 331 5,79084
5Kdxy 294 4,97969
5OxRT 331 5,45515
5VB7O 331 5,45515
5ZhKN 282 5,19795
66RR3 320 4,68799
6MRd4 335 8,67546
6ZI3E 211 4,20821
6ZMxA 282 4,62166
6bBwf 282 4,83492
6dSD3 299 4,53566
6eAjj 282 4,69970
6gjZM 331 5,45515
6hn1M 294 4,31569
6hxzJ 296 4,87465
6m3cd 331 5,31799
6ojM4 317 4,71578
6p92t 299 4,46393
6pWRs 282 4,82440
6qICv 294 4,71743
6rVle 317 4,74960
71X21 290 5,44469
75Y11 273 4,71004
75Y16 274 4,65030
76nB9 341 5,53267
79JAh 335 8,79460
7Ql1W 273 4,84737
7bPYo 331 5,47208
7bPnH 294 5,27184
7bPsq 329 5,47208
7bQaB 331 5,47208
7euiL 334 5,79084
7g62e 294 5,25820
7qJu1 294 5,25820
7t636 342 9,24336
7vSNf 328 5,23569
7wDdl 335 8,67546
7wDei 335 8,85707
83ZCu 296 4,81201
83x6G 299 4,61796
8K2uS 273 4,59500
8K2uW 272 5,16095
8K2uX 299 4,49866
8LKft 342 4,18178
8bAKQ 296 4,59153
8cbxf 257 4,46075
8feKC 299 4,57533
8hjyM 282 4,56465
8lR5h 296 4,74636
8lRVS 295 4,79092
8qUy2 282 4,62041
8tSFL 296 5,10456
CFkp 233 4,29608
Cp1W 271 4,97997
HsRT 282 4,62166
MRwH 246 4,30842
MXx3 339 4,66463
NoWi 229 4,17638
q6j6 271 4,99043
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28278
OsBb
251 80,5% 277 3.158E-134
2 phalp2_5780
7M9yJ
4 75,6% 189 1.816E-81
3 phalp2_12156
6iQWP
264 40,9% 227 1.635E-51
4 phalp2_22092
5WvnN
1 34,6% 182 8.642E-44
5 phalp2_39451
7Q7F4
6 29,0% 196 7.757E-20
6 phalp2_22096
5ZSom
2292 28,5% 294 1.922E-19
7 phalp2_33310
6kV0o
68 29,2% 294 1.176E-18
8 phalp2_26548
858CP
728 27,7% 270 2.905E-18
9 phalp2_6828
8fwke
2 25,9% 289 2.387E-17
10 phalp2_8064
5PT62
85 29,3% 191 5.875E-17

Domains

Domains [InterPro]
GH25
CW_7
Disordered region
Representative sequence (used for alignment): 1mUYf (277 AA)
Member sequence: P19385 (342 AA)
1 277 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF08230

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptococcus phage Cp-7 (Bacteriophage Cp-7)
[NCBI]
10748 Salasmaviridae > Cepunavirus > Cepunavirus Cp7
Host Streptococcus pneumoniae
[NCBI]
1313 Firmicutes > Bacilli > Lactobacillales > Streptococcaceae > Streptococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
M34779 [NCBI]
CDS location
range 286 -> 1314
strand +
CDS
ATGGTTAAGAAAAATGATTTATTTGTAGACGTTGCAAGCCATCAAGGCTACGACATTTCAGGAATTTTAGAAGAAGCAGGGACAACAAACACAATTATTAAAGTGTCAGAAAGTACAAGCTATTTAAACCCTTGCTTGTCTGCTCAAGTGAGCCAGTCAAATCCTATCGGGTTTTATCATTTTGCTTGGTTTGGTGGAAATGAAGAAGAAGCAGAAGCAGAAGCACGCTATTTCCTTGATAACGTGCCTACACAAGTTAAATACCTTGTACTAGATTATGAAGACCATGCAAGCGCAAGCGTACAAAGAAACACTACCGCGTGCTTACGCTTTATGCAAATTATCGCAGAAGCTGGATATACACCTATTTATTATAGTTACAAACCGTTTACGCTTGATAATGTGGACTATCAGCAGATTTTAGCACAGTTCCCTAATTCTCTATGGATTGCAGGCTATGGCTTAAATGATGGTACAGCTAACTTTGAATACTTTCCAAGCATGGACGGTATCAGATGGTGGCAATATTCTAGTAACCCGTTTGACAAGAATATTGTACTGTTAGATGATGAGAAAGAAGATAATATAAACAATGAAAACACTCTAAAAAGCCTTACCACAGTAGCCAACGAGGTCATTCAGGGACTTTGGGGCAACGGTCAAGAACGTTATGACAGTTTAGCGAATGCAGGGTATGACCCCCAAGCGGTTCAAGACAAAGTGAATGAAATCTTAAACGCTAGAGAAATTGCAGACCTTACCACAGTAGCCAACGAGGTCATTCAGGGACTTTGGGGCAACGGTCAAGAACGTTATGACAGTTTAGCGAATGCAGGGTATGACCCCCAAGCGGTTCAAGACAAAGTGAATGAAATCTTAAACGCTAGAGAAATTGCAGACCTTACCACAGTAGCCAACGAGGTCATTCAGGGACTTTGGGGCAACGGTCAAGAACGTTATGACAGTTTAGCGAATGCAGGGTATGACCCCCAAGCGGTTCAAGACAAAGTGAATGAATTACTTTCATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. N/A UniProt

Tertiary structure

PDB ID
4CVD
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
5I8L
Method PDB
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
A0A068YG91
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
P19385
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi000257807b_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1mUYf) rather than this protein.
PDB ID
1mUYf
Method AlphaFoldv2
Resolution 89.34
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50