Protein

Protein accession
D0R7H8 [UniProt]
Representative
5i2hu
Source
UniProt (cluster: phalp2_9040)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MQQRNSRNAQGIDVSRYQGKIDWKAVKASGISFAFIKASQGKLYRDKTFIGNAQAARAVGVLVGAYHYLDDSAKTPEDARKEAANFVSAINSAGGIAAFDLPPVMDYESNKSGLSKAALTAVARTFLAEVERLTGVRPIVYTYPSFIGNFSGLSDYPLWIARYSATQVPPGASGWSRWDFWQYSDGSAGGTLPSGTRKVAGIAGPVDLNEFDGTADELRTRFRKKTSVPKEEPATVADGSFQINGENVGKALLFDGKTHVPLRVLADALGIPLRWDNAKKVAYLNNRKLQSVQLVEGVAYIQLRPIAESYGAEVSWDSKNRIASLKTKGEK
Physico‐chemical
properties
protein length:331 AA
molecular weight:35949,1 Da
isoelectric point:9,52
hydropathy:-0,32
Representative Protein Details
Accession
5i2hu
Protein name
5i2hu
Sequence length
495 AA
Molecular weight
54165,15150 Da
Isoelectric point
8,73310
Sequence
MQERNKANAQGIDVSHHNGMVEWSKVAADQIRFAFVKSSEGQNMKDNRAEENVRGAKSSGLLVGLYHYLVAKNSSDAKQEAQNMATEYNRLGGKDYFELPPVLDYEDNRHNLGPAAITTIARAFLLEVERLTGARPILYTSQSFAEKLSGLSGEYDIWVARYSLNRPEDIGKWSRWRFWQYSDGQKGGYLPGGTRKVSGVSGYVDLNEYDGTYEDLRERYGKGGVTVSNPFEGFRLTSPFGMRKHPITGKQKFHRGVDLVTTPGNGPLYAFAGGTVRHAKDGAPGSGFGNYGITVAIEDNAGRLHVYAHLSAATVKVGQVVAKGDQIGNQGNTGASAGNHLHYEVRKVASPSFGYTATEAGVLEPTQYLKDYYAANLPKTEVDELSATEKQELINLRKENETLRKDVDALTNSKDVLKKGMQEQGNLLKKLVERVDKLESTDVPAWAKEAVDAFANTLAIDGSSPVITDKKVGPVVAKMLVILHRLGLATSKGGK
Other Proteins in cluster: phalp2_9040
Total (incl. this protein): 2 Avg length: 413,0 Avg pI: 9,13

Protein ID Length (AA) pI
5i2hu 495 8,73310
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38365
InEX
1 30,7% 358 2.861E-29
2 phalp2_15502
7Zuwd
2 29,7% 356 2.923E-28

Domains

Domains [InterPro]
GH25
PET_M23
Unannotated
Unannotated
Representative sequence (used for alignment): 5i2hu (495 AA)
Member sequence: D0R7H8 (331 AA)
1 495 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Paenibacillus phage phiBP
[NCBI]
666474 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
FN538971 [NCBI]
CDS location
range 859 -> 1854
strand +
CDS
ATGCAACAACGAAACAGCCGTAATGCCCAAGGAATAGACGTATCCCGTTATCAAGGAAAGATTGATTGGAAGGCGGTCAAGGCAAGTGGCATTTCCTTTGCCTTTATTAAGGCCAGTCAGGGTAAGTTATACCGCGACAAAACATTCATCGGTAATGCACAGGCTGCACGAGCTGTCGGAGTCCTGGTCGGGGCCTATCACTATTTAGACGATTCTGCGAAAACGCCAGAGGATGCCCGAAAGGAAGCTGCAAACTTTGTGAGCGCCATTAATTCAGCCGGAGGCATCGCGGCCTTCGATCTGCCGCCGGTTATGGATTATGAGTCCAACAAGTCCGGATTAAGCAAAGCGGCGCTTACAGCCGTAGCTAGGACATTTCTGGCGGAAGTTGAGCGGCTTACTGGAGTACGACCAATCGTGTACACATATCCTTCGTTCATCGGTAATTTCAGCGGTTTATCAGATTACCCATTGTGGATTGCCCGGTACAGCGCCACACAGGTTCCGCCTGGCGCATCTGGTTGGTCACGCTGGGATTTCTGGCAGTATAGCGATGGCTCGGCTGGTGGCACATTGCCGTCCGGCACACGTAAGGTGGCTGGCATAGCGGGTCCAGTCGATTTGAATGAATTTGACGGTACGGCAGATGAGCTTCGGACACGGTTTAGGAAGAAGACATCCGTACCCAAAGAAGAGCCTGCCACTGTTGCGGATGGATCATTCCAGATCAACGGAGAGAATGTGGGTAAAGCTCTGCTGTTTGATGGAAAGACTCACGTTCCGTTGCGTGTGCTGGCGGATGCTCTCGGTATCCCTTTACGCTGGGATAATGCCAAAAAGGTCGCATATTTAAACAATCGCAAATTGCAATCTGTGCAGCTTGTGGAGGGTGTCGCTTATATACAACTTCGGCCAATTGCAGAATCTTATGGAGCCGAAGTTTCATGGGATTCTAAAAATAGAATTGCTAGTCTGAAAACGAAAGGAGAAAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0019835 cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi0001bb3851_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5i2hu) rather than this protein.
PDB ID
5i2hu
Method AlphaFoldv2
Resolution 87.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50