Protein

Protein accession
A0A8S5PUF7 [UniProt]
Representative
7h79q
Source
UniProt (cluster: phalp2_379)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MKEYGLDLSKHNGTLDFNEIKKEGNSFVILRAGYGKLISQKDIKFEEYYKKAKSVGLKVGTYWYSYATTPSEAITEAKCFYDVIKGKQFEYPVFFDIENKDHHKGASKNDLVKMCKNFCDFMESKGYYAGVYANTDWFTNYISELPRGRYDEWEANYGSNDGTLQNVKLRRAPRIHQFTSNYKIGVKRFDRNISYYDYASVIKDKGLNGFSKQQTSNNEEISKKDENVSTVLKHKVGDVVSINGVYSSSDSAEKLNPYTKKGKITKIIASARNPYLLEDGNIGWVNDSCIVTQSSNALKVGSRVKIKSSAKNYATGEKIPDWVKKQTHTIQKISDEKVLLKEIASWVYKKDVE
Physico‐chemical
properties
protein length:353 AA
molecular weight:40322,1 Da
isoelectric point:9,19
hydropathy:-0,71
Representative Protein Details
Accession
7h79q
Protein name
7h79q
Sequence length
348 AA
Molecular weight
39908,03640 Da
Isoelectric point
9,32466
Sequence
MSLLYGLDISKYQGDFDFNNAPLKDFVIIRCGYTGQTSNKCYKDEKFDRNLREIKKTNIPYGFYYYSCAKNESDAKREADFIHSLLKNEKFELPLFIDYEYSKVINSLGTKKCNEVIESLGKKLESYGYFVGLYSYEYFLKNATDRFWLWIASYGTNDGTVQKDFNNGKYKMYQFTSQYKGKYDANVLYDASVMKTIKDKGLNGFSKQQTVDKVETPKKEENTSTSLKYKVGDVVSINGVYSSSDSNEKLNPYTKKGKITKIIANARNPYLLENGNIGWVNDSCIVIQSQSTLKVGSKVKIKSSAKKYATGQIIPDWVKKKTHTIKKINGGKVLLKEIVSWVYKKDVE
Other Proteins in cluster: phalp2_379
Total (incl. this protein): 2 Avg length: 350,5 Avg pI: 9,26

Protein ID Length (AA) pI
7h79q 348 9,32466
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10110
c1k0
16 34,1% 290 6.011E-52
2 phalp2_529
o3Q8
73 30,7% 315 1.749E-41
3 phalp2_34315
3GJC6
151 30,4% 296 6.968E-38
4 phalp2_10747
3nEcV
19 26,1% 371 1.073E-34
5 phalp2_31512
3WNwM
280 33,1% 226 3.637E-34
6 phalp2_32643
239ef
4 33,8% 251 9.078E-34
7 phalp2_29740
1rkWk
100 30,4% 266 9.890E-31
8 phalp2_15044
7hFHn
33 29,7% 232 1.897E-24
9 phalp2_15589
2brBA
13 29,2% 229 4.654E-24
10 phalp2_30592
67VKs
21 28,5% 270 6.275E-24

Domains

Domains [InterPro]
GH25
Unannotated
Unannotated
Representative sequence (used for alignment): 7h79q (348 AA)
Member sequence: A0A8S5PUF7 (353 AA)
1 348 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ctGuJ10
[NCBI]
2825418 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK015503 [NCBI]
CDS location
range 28583 -> 29644
strand +
CDS
ATGAAAGAGTATGGTTTAGACTTAAGCAAACATAATGGAACATTAGATTTTAATGAAATAAAAAAAGAAGGAAATAGTTTTGTTATTCTACGTGCTGGATATGGCAAGTTAATTTCACAGAAAGACATCAAGTTTGAAGAATATTATAAGAAAGCAAAAAGTGTTGGATTAAAAGTTGGTACTTATTGGTATTCATATGCAACAACACCAAGCGAAGCAATTACAGAAGCAAAATGTTTTTATGATGTAATTAAAGGCAAACAATTTGAATATCCAGTTTTCTTCGACATTGAAAACAAAGATCATCATAAAGGTGCATCTAAAAATGATTTAGTTAAAATGTGCAAAAACTTTTGTGATTTTATGGAAAGCAAAGGTTATTACGCTGGTGTCTATGCAAATACAGATTGGTTCACAAATTATATTTCAGAACTTCCAAGAGGACGATATGATGAGTGGGAAGCGAATTACGGTTCTAATGATGGTACATTACAAAATGTCAAATTAAGAAGGGCACCACGTATTCATCAGTTTACATCTAATTATAAAATTGGTGTTAAACGTTTCGATAGAAATATTTCTTATTATGATTATGCTTCTGTAATTAAAGACAAGGGGCTAAATGGCTTCTCTAAACAACAAACGTCCAACAATGAGGAAATATCTAAAAAAGATGAAAATGTTTCTACAGTTTTAAAACATAAAGTAGGTGATGTGGTATCTATCAATGGTGTTTATTCTTCATCTGATAGTGCAGAAAAACTCAATCCTTATACTAAAAAAGGAAAGATTACTAAAATTATTGCCAGTGCTAGAAACCCATATTTGTTAGAAGATGGTAATATTGGGTGGGTTAATGATTCTTGCATTGTAACTCAATCGTCAAATGCCTTAAAAGTTGGATCAAGGGTAAAGATTAAATCTAGTGCTAAGAATTATGCTACTGGTGAAAAAATTCCTGATTGGGTAAAGAAACAAACTCATACTATTCAAAAAATTAGTGATGAAAAAGTTTTGTTAAAGGAAATTGCGTCATGGGTATACAAAAAAGATGTAGAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7h79q) rather than this protein.
PDB ID
7h79q
Method AlphaFoldv2
Resolution 90.36
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50