Protein

Protein accession
A0A6J7XEF1 [UniProt]
Representative
7XMsp
Source
UniProt (cluster: phalp2_36673)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTQRLDGIDVSRYQGTVDWVQVKASCAVPSKSLAAWKVTQGTRTLDPAAGTNRKNTATAGIYYRFGYHWLSASTNALSQAKWFLQNFAPLQPGEGVILDCEEAGITVPQITTWCAHVEAVTGRPVAIYTGVHVAGGAIWNSQLIFGSGQRMRWLAAYVSEAKARKACEPYGFDVWQWSSSGSVPGVAGRCDVNMIDHPTLCDLATGQTPPKPPVPPTEIEDLDMPVCTNAEVFGGFAPLIVKWVVMSNGSLRHIGEPEWHARGSEVGTRLSNADIKALGGSL
Physico‐chemical
properties
protein length:282 AA
molecular weight:30387,2 Da
isoelectric point:6,59
hydropathy:-0,11
Representative Protein Details
Accession
7XMsp
Protein name
7XMsp
Sequence length
304 AA
Molecular weight
33488,24380 Da
Isoelectric point
9,68362
Sequence
MTSRLDGIDVSRYQGKIDWPAVAEAGIFWAATKATQGTSYVDPTLTANRAGMANADIRYRQLYHWLNPAIGAARLVPALRKRDAYAQADFFLKTVGKLSPGEGVLLDAEQTGIRSDQVAWWCERVEATTGLPVEVYTGVYVAGGEIWKSATVFNGKRARVLAAYISEERMLKASAPWRPDCWQFTSTATVPGVPTPVDGDMIYNRARFDAACGLVKPPVKPVTVAKPVERRRVMLKGTLNVALLQQQLKQLGLYAAKVDGKFWKVTEAGVRMFQAQHRLSVTGVYDKFTADKMSAVLTEKAAKK
Other Proteins in cluster: phalp2_36673
Total (incl. this protein): 15 Avg length: 286,0 Avg pI: 6,38

Protein ID Length (AA) pI
7XMsp 304 9,68362
1Eekl 235 4,53072
1NVCq 318 7,08642
1owF7 217 5,61230
25hLj 297 5,96516
2cMY7 294 9,33884
2j6Xp 282 5,22688
48T7c 295 6,43698
4GbbR 267 6,29238
4euxC 294 4,85180
8tar0 301 6,74914
TRPl 307 6,58721
iHWt 308 5,19721
A0A6J5RFC4 289 5,51511
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12578
1jpTj
2 35,2% 244 3.970E-53
2 phalp2_26465
1XOf0
5 29,8% 315 2.210E-41
3 phalp2_39912
1jXPT
300 36,1% 224 1.078E-38
4 phalp2_5981
6El6s
21 27,6% 300 3.710E-38
5 phalp2_35476
lHwL
1 35,2% 244 3.869E-34
6 phalp2_10644
2AeOK
60 34,1% 217 6.163E-33
7 phalp2_38136
6S3U0
1 35,7% 210 8.381E-33
8 phalp2_5615
4g2xP
40 32,6% 208 6.160E-31
9 phalp2_36157
6TiaL
25 28,8% 208 2.064E-28
10 phalp2_34359
474YH
38 25,1% 302 8.024E-27

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7XMsp (304 AA)
Member sequence: A0A6J7XEF1 (282 AA)
1 304 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LR797265 [NCBI]
CDS location
range 17507 -> 18355
strand +
CDS
ATGACGCAGCGACTTGACGGCATTGACGTTTCCCGCTACCAAGGAACGGTTGATTGGGTTCAAGTCAAAGCAAGTTGTGCCGTGCCAAGTAAATCTCTTGCCGCCTGGAAAGTTACGCAAGGCACGCGCACTCTTGACCCTGCCGCTGGCACCAATCGAAAAAATACTGCAACTGCCGGTATTTATTACCGCTTTGGATACCATTGGCTTTCTGCCAGTACCAACGCACTTTCGCAAGCCAAATGGTTTTTGCAAAACTTTGCGCCGCTTCAACCTGGCGAAGGTGTAATTCTTGACTGCGAAGAAGCAGGAATTACTGTTCCTCAAATTACTACTTGGTGCGCGCATGTTGAAGCTGTAACAGGACGACCAGTTGCAATTTACACCGGCGTGCACGTTGCTGGCGGGGCTATCTGGAATAGCCAATTAATTTTTGGGTCCGGACAAAGAATGCGGTGGCTAGCCGCTTACGTTTCAGAGGCCAAGGCAAGAAAAGCTTGCGAGCCGTATGGCTTTGATGTCTGGCAATGGTCGTCATCCGGCTCTGTACCTGGTGTGGCCGGCCGATGCGACGTTAACATGATTGACCACCCAACTTTATGTGATCTTGCAACCGGGCAAACACCGCCCAAACCACCAGTTCCTCCTACTGAAATTGAGGATTTAGACATGCCAGTTTGCACAAACGCAGAAGTCTTTGGAGGCTTTGCACCGCTGATTGTTAAATGGGTTGTAATGTCCAATGGTTCTCTTCGGCATATTGGTGAGCCAGAGTGGCATGCTCGAGGCTCAGAAGTTGGCACTCGACTTTCAAATGCTGACATCAAAGCTCTTGGTGGAAGTCTTTAG

CDS Source ID
CDS Source
LR797368 [NCBI]
CDS location
range 21548 -> 22396
strand +
CDS
ATGACGCAGCGACTTGACGGCATTGACGTTTCCCGCTACCAAGGAACGGTTGATTGGGTTCAAGTCAAAGCAAGTTGTGCCGTGCCAAGTAAATCTCTTGCCGCCTGGAAAGTTACGCAAGGCACGCGCACTCTTGACCCTGCCGCTGGCACCAATCGAAAAAATACTGCAACTGCCGGTATTTATTACCGCTTTGGATACCATTGGCTTTCTGCCAGTACCAACGCACTTTCGCAAGCCAAATGGTTTTTGCAAAACTTTGCGCCGCTTCAACCTGGCGAAGGTGTAATTCTTGACTGCGAAGAAGCAGGAATTACTGTTCCTCAAATTACTACTTGGTGCGCGCATGTTGAAGCTGTAACAGGACGACCAGTTGCAATTTACACCGGCGTGCACGTTGCTGGCGGGGCTATCTGGAATAGCCAATTAATTTTTGGGTCCGGACAAAGAATGCGGTGGCTAGCCGCTTACGTTTCAGAGGCCAAGGCAAGAAAAGCTTGCGAGCCGTATGGCTTTGATGTCTGGCAATGGTCGTCATCCGGCTCTGTACCTGGTGTGGCCGGCCGATGCGACGTTAACATGATTGACCACCCAACTTTATGTGATCTTGCAACCGGGCAAACACCGCCCAAACCACCAGTTCCTCCTACTGAAATTGAGGATTTAGACATGCCAGTTTGCACAAACGCAGAAGTCTTTGGAGGCTTTGCACCGCTGATTGTTAAATGGGTTGTAATGTCCAATGGTTCTCTTCGGCATATTGGTGAGCCAGAGTGGCATGCTCGAGGCTCAGAAGTTGGCACTCGACTTTCAAATGCTGACATCAAAGCTCTTGGTGGAAGTCTTTAG

CDS Source ID
CDS Source
LR798370 [NCBI]
CDS location
range 29583 -> 30431
strand -
CDS
ATGACGCAGCGACTTGACGGCATTGACGTTTCCCGCTACCAAGGAACGGTTGATTGGGTTCAAGTCAAAGCAAGTTGTGCCGTGCCAAGTAAATCTCTTGCCGCCTGGAAAGTTACGCAAGGCACGCGCACTCTTGACCCTGCCGCTGGCACCAATCGAAAAAATACTGCAACTGCCGGTATTTATTACCGCTTTGGATACCATTGGCTTTCTGCCAGTACCAACGCACTTTCGCAAGCCAAATGGTTTTTGCAAAACTTTGCGCCGCTTCAACCTGGCGAAGGTGTAATTCTTGACTGCGAAGAAGCAGGAATTACTGTTCCTCAAATTACTACTTGGTGCGCGCATGTTGAAGCTGTAACAGGACGACCAGTTGCAATTTACACCGGCGTGCACGTTGCTGGCGGGGCTATCTGGAATAGCCAATTAATTTTTGGGTCCGGACAAAGAATGCGGTGGCTAGCCGCTTACGTTTCAGAGGCCAAGGCAAGAAAAGCTTGCGAGCCGTATGGCTTTGATGTCTGGCAATGGTCGTCATCCGGCTCTGTACCTGGTGTGGCCGGCCGATGCGACGTTAACATGATTGACCACCCAACTTTATGTGATCTTGCAACCGGGCAAACACCGCCCAAACCACCAGTTCCTCCTACTGAAATTGAGGATTTAGACATGCCAGTTTGCACAAACGCAGAAGTCTTTGGAGGCTTTGCACCGCTGATTGTTAAATGGGTTGTAATGTCCAATGGTTCTCTTCGGCATATTGGTGAGCCAGAGTGGCATGCTCGAGGCTCAGAAGTTGGCACTCGACTTTCAAATGCTGACATCAAAGCTCTTGGTGGAAGTCTTTAG

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7XMsp) rather than this protein.
PDB ID
7XMsp
Method AlphaFoldv2
Resolution 88.09
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50