Protein

Protein accession
A0A8S5P9Z7 [UniProt]
Representative
5Lgvv
Source
UniProt (cluster: phalp2_23382)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKIVDVSSYNGTVSWAKVKKYGCSGAILKIIRKDLERDKGFNRNYQACNENHINWGVYNYTYATTPKKARADMNLVCDILDKIDKSYFKYGVWFDLEDKVQAKLTKSQIADIVNAAQEIVEKRGYVFGVYTGKAYFEEHIDRKKVLCQNWWIARYYRGDARMQIAVNPNEKYKPTTANIAWQYTSKGRFPKIVSTGNSGNFDFNIMYREPFEKEGVKKEEKTEEKPKKVYTGSLPALKTKAYFARPDGIKTSKKRKDVKKIQNFLNWAIDAGLTVDGKYGAKTETAVAVFQELCGIRIDGKFGNITLAKAKTFKK
Physico‐chemical
properties
protein length:316 AA
molecular weight:36268,3 Da
isoelectric point:9,64
hydropathy:-0,64
Representative Protein Details
Accession
5Lgvv
Protein name
5Lgvv
Sequence length
128 AA
Molecular weight
14625,60690 Da
Isoelectric point
8,89820
Sequence
MRKLIDVSSYNGTVNWEKAKAYGCQGAILKIIRKDLKIDNGFNRNYQACNENELAWGVYNYSYAATATKAKSDMKLVCDILDKIDKTHFVYGVWFDIEDKVQASLNKTKIAEIINAAQQVVEKRGYLF
Other Proteins in cluster: phalp2_23382
Total (incl. this protein): 10 Avg length: 171,3 Avg pI: 8,72

Protein ID Length (AA) pI
5Lgvv 128 8,89820
1nur2 135 7,62497
5Pew7 128 7,62958
6b1U7 144 9,05222
6cIc0 125 9,28333
6g7PW 165 9,18560
6taHi 128 7,62958
7MQkB 129 8,73013
A0A8S5SNY0 315 9,55668
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15198
D6Gl
21 46,4% 84 6.709E-27
2 phalp2_4390
2mvmf
118 32,2% 124 1.261E-26
3 phalp2_33300
6aZoh
2 34,8% 129 7.637E-25
4 phalp2_3380
3qQTc
8 31,3% 102 1.485E-21
5 phalp2_24879
a1g4
16 31,5% 133 3.825E-21
6 phalp2_25924
66tGM
6 34,3% 102 2.306E-19
7 phalp2_10753
3yWC4
1 25,3% 130 1.901E-17
8 phalp2_34287
3mtX2
6 34,3% 96 9.181E-17
9 phalp2_11218
6q99m
1 29,1% 127 3.235E-16
10 phalp2_31214
88VlE
13 33,3% 84 2.930E-15

Domains

Domains [InterPro]
Representative sequence (used for alignment): 5Lgvv (128 AA)
Member sequence: A0A8S5P9Z7 (316 AA)
1 128 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ct2QJ10
[NCBI]
2825315 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK015358 [NCBI]
CDS location
range 12910 -> 13860
strand -
CDS
ATGAAAAAAATTGTTGACGTATCTTCGTACAATGGGACTGTAAGCTGGGCGAAAGTAAAAAAATATGGATGTTCTGGAGCAATCTTAAAGATCATAAGAAAAGATTTAGAGAGGGATAAGGGATTCAATAGAAATTATCAGGCTTGTAACGAGAATCATATTAACTGGGGTGTGTACAATTATACATATGCTACTACCCCGAAAAAGGCAAGGGCAGATATGAATCTTGTTTGCGATATTCTTGACAAAATCGACAAGTCATATTTTAAATATGGCGTTTGGTTTGACCTTGAAGATAAGGTGCAGGCTAAGCTGACGAAATCCCAGATTGCGGATATTGTTAACGCCGCACAGGAGATTGTCGAAAAACGAGGTTATGTTTTCGGAGTATATACAGGAAAAGCGTATTTTGAGGAGCACATCGACCGTAAAAAAGTGCTTTGCCAGAACTGGTGGATTGCCCGTTATTATCGTGGGGATGCTCGTATGCAGATTGCTGTGAATCCTAACGAAAAGTATAAACCTACCACAGCTAATATTGCGTGGCAGTATACGTCCAAAGGGCGTTTCCCTAAGATTGTATCAACGGGAAATTCCGGAAATTTTGATTTTAATATCATGTATAGAGAACCTTTTGAAAAAGAAGGAGTTAAGAAAGAAGAAAAAACAGAAGAAAAGCCAAAGAAAGTTTATACAGGAAGTCTCCCTGCATTAAAAACTAAAGCGTATTTTGCAAGACCAGATGGAATTAAAACATCCAAAAAGAGAAAGGATGTTAAAAAAATTCAGAATTTCTTAAATTGGGCGATAGATGCAGGGTTAACTGTAGATGGAAAATATGGTGCAAAAACAGAAACAGCCGTTGCAGTATTCCAGGAATTGTGTGGAATCAGAATAGATGGCAAGTTTGGAAACATCACATTGGCAAAAGCAAAAACATTCAAGAAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5Lgvv) rather than this protein.
PDB ID
5Lgvv
Method AlphaFoldv2
Resolution 97.28
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50