Protein

Protein accession
A0A8S5P376 [UniProt]
Representative
2VksN
Source
UniProt (cluster: phalp2_5443)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKEYGLDLSKHNGTIDFDEIKKNNSFVILRAGYGTSTKDPKFEEYYKQAKAAGLKVGAYWYSYALNVNDAKTEAKKCLEVIKGKQFEYPIYIDMEDADGYKKRNGMPSNSTLVGICNTFCSIVESNGYYVGIYASESWFNNQLKGLNKGRYDAWVANWGTNDGTLQKDKSSVCRLHQFTSNYKIGNKRFDRNISYLDYASVIKENGLNGFGIHDAVKPANPLDALTDEQVADEIIAGKGGWGDGDTRKKKLSARYDSIQSIVNRKLSKDKKPTVSYYKAFNSTSIVDGLKSIGVNSSLEYRKKIAKANGIKNYKGDASDNIKLLKLAKQGKLKKA
Physico‐chemical
properties
protein length:335 AA
molecular weight:37446,9 Da
isoelectric point:9,39
hydropathy:-0,67
Representative Protein Details
Accession
2VksN
Protein name
2VksN
Sequence length
237 AA
Molecular weight
27177,93640 Da
Isoelectric point
4,73215
Sequence
NADGIDLSAYDFAILRSNYGEIEDEKIEYWVKACEKAGIPYGLYCYDYAIDDDQAEAEAKYTIEMAQKYKPELGVWIDMEDEDKYKEKRGAWTKERALRTCELFCKCVQSAGFYAGIYSSSWVFKNWLNSSTLNQYDKWVAQWDTNDGDYHSDTSSMGTIHQYTSVDKHSGIGLDKNAMYVDFDHYKSKIVEKPQNEPENVQKDESKESDIKELNGLIGILISILKKLLSVFGKSGD
Other Proteins in cluster: phalp2_5443
Total (incl. this protein): 6 Avg length: 252,2 Avg pI: 5,65

Protein ID Length (AA) pI
2VksN 237 4,73215
3msB7 195 5,07711
612UF 304 4,88073
6h2uw 225 4,90335
6nXBN 217 4,88920
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38437
1cCmC
101 42,2% 201 2.389E-60
2 phalp2_3425
3TK4D
923 35,9% 181 2.765E-41
3 phalp2_6199
5X5xO
102 31,4% 207 2.765E-41
4 phalp2_529
o3Q8
73 31,2% 208 6.298E-40
5 phalp2_31512
3WNwM
280 29,8% 208 1.374E-35
6 phalp2_13529
7Jyqc
660 35,4% 192 1.218E-34
7 phalp2_33310
6kV0o
68 34,7% 187 1.663E-34
8 phalp2_29740
1rkWk
100 32,4% 197 5.117E-33
9 phalp2_22740
8dpUt
356 32,5% 172 6.170E-32
10 phalp2_34771
3k1Sy
11 31,2% 205 1.569E-31

Domains

Domains [InterPro]
GH25
Disordered region
Representative sequence (used for alignment): 2VksN (237 AA)
Member sequence: A0A8S5P376 (335 AA)
1 237 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ctQtc11
[NCBI]
2825497 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK015325 [NCBI]
CDS location
range 34455 -> 35462
strand +
CDS
ATGAAAGAATATGGTTTGGATTTAAGTAAACATAATGGCACGATTGATTTTGATGAAATTAAAAAGAACAACAGTTTTGTTATTCTTCGTGCTGGATATGGGACAAGTACAAAAGACCCAAAGTTTGAAGAATATTATAAGCAAGCAAAAGCAGCAGGATTGAAAGTTGGTGCTTATTGGTATTCATATGCTTTAAATGTGAATGATGCTAAAACAGAGGCAAAGAAATGTTTAGAAGTTATTAAAGGCAAACAGTTTGAATATCCAATCTACATTGACATGGAAGATGCAGATGGCTATAAAAAAAGAAATGGAATGCCTAGTAATAGTACGCTTGTAGGTATATGCAATACATTTTGTAGTATTGTGGAATCTAATGGTTACTATGTTGGTATCTATGCTAGCGAAAGTTGGTTTAATAATCAATTGAAAGGATTAAATAAAGGTCGATATGATGCATGGGTAGCAAATTGGGGTACAAATGATGGTACATTACAAAAAGATAAATCTAGCGTATGTAGATTACATCAATTTACATCTAATTATAAAATTGGTAATAAAAGATTTGACAGAAATATCTCTTATTTAGATTATGCATCTGTTATTAAAGAAAATGGTTTAAATGGGTTTGGTATACATGATGCAGTTAAACCTGCTAATCCTCTAGATGCATTAACAGACGAACAAGTCGCTGATGAAATTATCGCTGGTAAAGGTGGTTGGGGTGATGGAGATACTCGTAAAAAGAAATTAAGTGCACGATATGATTCAATCCAATCAATTGTTAATAGAAAATTAAGTAAAGATAAAAAGCCTACAGTATCTTATTATAAGGCTTTCAATAGCACTTCAATTGTAGATGGTTTAAAGTCTATTGGTGTTAATTCATCTTTGGAATATAGAAAGAAAATTGCTAAAGCAAATGGCATTAAAAACTATAAAGGAGATGCATCAGACAACATCAAACTTCTAAAACTAGCAAAACAAGGGAAATTAAAGAAAGCATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2VksN) rather than this protein.
PDB ID
2VksN
Method AlphaFoldv2
Resolution 89.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50