Protein

Protein accession
A0A1D8ETQ2 [UniProt]
Representative
7Q7F4
Source
UniProt (cluster: phalp2_39451)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MDRGIDVSSYQAPDGGLLEGMSFCIIKATEGTGYVNPNRTQWYKVATDRGVTTGAYHFMRGGDPVAQADYFLAHLPGDVDWLALDVEDPSDGLSWAGRVSFILAWTDRVKATGKPTLVYCSRSWANALLGAASSDERARLAQLPLWMADYTQRPGEYSGPWPGGWPIAIHQYTSAPIDKNVLLTDDLQGGDDMPLSTDDLRNIATAVWGGGGAASPMYADRLDGHQEYPETALFSLQQRLGARFGELSGKIDGLAEALKQVQGGNGVDADAIRAAAAEGAKAGVQAMIDTATVNLEAK
Physico‐chemical
properties
protein length:298 AA
molecular weight:31828,1 Da
isoelectric point:4,62
hydropathy:-0,26
Representative Protein Details
Accession
7Q7F4
Protein name
7Q7F4
Sequence length
278 AA
Molecular weight
29894,02190 Da
Isoelectric point
4,44728
Sequence
MDRGIDVSAYQAPDGGLLEGMSFCIIKATEGTGYTNPNRAQWYKVATDRGVTTGAYHFMRGGDPVQQADYFLDNLPGDVGWLALDVEADDDGLEWSGRVGFILAWTDRVKTASAKPTLVYCSRSWANALWEAATSDQRTRLTALPLWLADYTGTPGTYSGPVPDGWPVTMHQFTSTPIDTNALFADILQGGDTMTPELAARLDQIVAWQDANFKALAAQADANRDTLAQWTVDRLKETAASIPNVDTDKIAAQLAGAKLVVQLPANAQLQPAPAGGQA
Other Proteins in cluster: phalp2_39451
Total (incl. this protein): 6 Avg length: 291,3 Avg pI: 4,57

Protein ID Length (AA) pI
7Q7F4 278 4,44728
7AcPG 298 4,62410
7Aem3 298 4,62660
A0A173G9C4 278 4,44728
A0A1D8EU28 298 4,62660
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27394
4HDMm
9 33,8% 204 5.857E-20
2 phalp2_34559
4UNXW
287 37,7% 180 1.072E-19
3 phalp2_20117
23DDy
70 34,8% 192 2.193E-18
4 phalp2_33827
1mUYf
73 35,3% 198 5.412E-18
5 phalp2_2609
6KgoW
1 31,7% 290 5.987E-17
6 phalp2_5615
4g2xP
40 34,9% 183 8.081E-17
7 phalp2_37341
8v6XY
562 32,7% 183 1.985E-16
8 phalp2_38994
84ePK
348 32,9% 182 3.613E-16
9 phalp2_30706
7cHI2
66 32,4% 188 3.613E-16
10 phalp2_39912
1jXPT
300 32,7% 183 6.570E-16

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7Q7F4 (278 AA)
Member sequence: A0A1D8ETQ2 (298 AA)
1 278 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Propionibacterium phage B22
[NCBI]
1897532 Doucettevirus > Doucettevirus B22
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KX620750 [NCBI]
CDS location
range 18336 -> 19232
strand +
CDS
ATGGATCGTGGTATTGACGTGTCGTCCTATCAGGCCCCCGATGGTGGCCTGCTGGAGGGCATGTCGTTTTGCATCATCAAGGCCACCGAAGGCACCGGCTATGTGAACCCGAACCGTACCCAGTGGTACAAGGTCGCCACGGATCGTGGGGTGACCACCGGCGCCTATCACTTCATGCGTGGCGGCGATCCCGTGGCGCAGGCCGATTACTTCCTCGCGCATCTGCCCGGTGATGTCGATTGGCTCGCCCTTGATGTCGAGGATCCCAGCGACGGCCTGTCATGGGCCGGGCGCGTCAGCTTCATCCTCGCCTGGACCGATCGGGTGAAGGCGACCGGCAAGCCCACCTTGGTCTATTGCTCGCGCTCCTGGGCGAATGCCCTGTTGGGTGCGGCCAGCAGCGACGAGCGGGCACGGCTCGCGCAATTGCCGCTGTGGATGGCTGATTACACCCAACGCCCCGGCGAATATTCGGGGCCGTGGCCGGGCGGCTGGCCGATTGCCATACACCAGTACACCTCGGCCCCGATCGACAAGAACGTCCTGCTCACCGACGATCTCCAAGGAGGAGACGACATGCCCCTGTCAACCGACGACCTTCGAAATATCGCCACTGCCGTGTGGGGTGGCGGCGGCGCTGCAAGCCCAATGTATGCCGACCGGCTCGATGGGCATCAAGAATATCCGGAGACAGCCCTTTTCAGCCTGCAACAGCGGCTGGGTGCCCGCTTCGGTGAGCTGTCGGGCAAGATCGACGGTCTGGCCGAGGCGCTCAAGCAGGTGCAGGGAGGGAACGGCGTGGACGCGGACGCCATTAGGGCTGCTGCTGCCGAAGGAGCCAAGGCTGGCGTGCAGGCGATGATCGACACCGCCACGGTGAATCTGGAGGCCAAGTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7Q7F4) rather than this protein.
PDB ID
7Q7F4
Method AlphaFoldv2
Resolution 77.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50