Protein

Protein accession
I3VYU7 [UniProt]
Representative
f6dF
Source
UniProt (cluster: phalp2_9369)
Protein name
lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIKIIDVSSNNTVTSTHEVKNAGYYGIICKATEGTSYVNPTFADVVNRTKAVGLKSGSYHFARPDVNSNAEQEAEHYYNVMKQFTFDLPPVLDIEVQGNMNKQQLTNWALTFLKRIEKLTGRKPWIYTGDYFYNNCLIPAQLKEYPLWIAAYGSTEPTTPHIMWQYTDSENVPGVGKCDVSLANDDFSIRTGGNSMLQRGSTGDAVKTLQQQLNQLGYNLVVDGVFGANTENAVKDFQSKHGLVVDGIVGPATQAAISNALQQLQPQPQPQPTTTYTIVFDDQNQANLVAQVLGGKVRQIQ
Physico‐chemical
properties
protein length:301 AA
molecular weight:33263,0 Da
isoelectric point:5,74
hydropathy:-0,38
Representative Protein Details
Accession
f6dF
Protein name
f6dF
Sequence length
440 AA
Molecular weight
48294,40550 Da
Isoelectric point
5,57894
Sequence
MSYPITVSFIPNLPQEPYENGVGKYEGVVAHSTATPNATASAERNYETTHFNDAFVHYFVDWTSIVNTANVDYLSWHGGHTANQRYVGVELCETADANLFKQSYDRYTWLLAKILLDKGLPMVHMKTFFTHDDTSKLFHETTHSDPVGYLQSHGISINQLVSDVTAQYNAMKGSAHPMLQMGSTGISVSELQNTLNSKGYNLVVDGSFGNSTLSAVKDFQSKNGLTADGVVGDNTWAKLTAPVPKPVAPAPSNMYRVRIAWEDASSQIGAFTILDSAKTLADQHKDHKVFDESGKVVYDPNPPVPQPATTQIYRVRKDWADSKSQLGAFSNLESAKTIADQNAGYKVFDSTGKQIHPVVVEQPVPTQPTPQAKPEELNGDLPSGAVGECVIKVKDLDIYNSYDANSQKIGVIVEGQKFPVYKVVNDFYLISNGMWISNVS
Other Proteins in cluster: phalp2_9369
Total (incl. this protein): 2 Avg length: 370,5 Avg pI: 5,66

Protein ID Length (AA) pI
f6dF 440 5,57894
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20229
8m9sH
6 44,6% 430 7.167E-101
2 phalp2_22248
7gCZr
4 32,0% 306 8.912E-27
3 phalp2_32070
5P7Lm
156 23,4% 316 1.394E-11

Domains

Domains [InterPro]
Ami2
PG_1
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): f6dF (440 AA)
Member sequence: I3VYU7 (301 AA)
1 440 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Thermoanaerobacterium phage THSA-485A
[NCBI]
1126885 No lineage information
Host Thermoanaerobacterium saccharolyticum JW/SL-YS485
[NCBI]
1094508 Firmicutes > Clostridia > Thermoanaerobacterales > Thermoanaerobacterales Family III. Incertae Sedis > Thermoanaerobacterium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
CP003186 [NCBI]
CDS location
range 5761 -> 6666
strand -
CDS
ATGATTAAAATTATTGATGTATCCAGTAACAATACCGTAACCAGCACACACGAAGTTAAAAATGCTGGTTATTATGGAATCATTTGTAAGGCAACTGAAGGAACAAGCTATGTTAATCCCACTTTCGCTGACGTAGTAAACCGTACTAAAGCGGTTGGGCTTAAGAGTGGCAGTTATCATTTTGCAAGACCAGACGTAAATTCTAACGCCGAGCAAGAAGCAGAACATTACTATAACGTGATGAAACAGTTTACATTTGACCTACCACCAGTCTTAGACATTGAAGTACAAGGCAATATGAATAAACAACAGCTTACTAATTGGGCTTTGACGTTTCTAAAGAGAATAGAGAAGTTGACAGGAAGAAAACCTTGGATATACACAGGCGATTACTTCTACAACAACTGCTTAATTCCAGCACAATTAAAGGAATATCCTTTATGGATAGCAGCTTATGGTAGTACAGAGCCAACGACGCCTCACATCATGTGGCAGTATACAGATAGCGAAAATGTTCCCGGCGTCGGCAAATGCGATGTATCATTAGCAAACGATGATTTTAGTATTAGAACAGGAGGTAATTCTATGTTACAAAGAGGAAGTACAGGTGATGCTGTAAAAACATTACAGCAACAATTAAATCAATTAGGGTACAATTTAGTAGTTGACGGTGTATTTGGTGCAAACACAGAAAACGCAGTAAAAGACTTTCAGAGCAAACATGGTCTTGTAGTAGATGGTATAGTTGGTCCAGCAACACAGGCAGCTATAAGCAATGCTTTACAGCAATTACAACCACAGCCTCAACCGCAACCGACGACTACTTACACAATTGTATTTGACGATCAAAACCAAGCTAACCTTGTGGCACAGGTATTAGGCGGAAAAGTAAGACAAATACAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016052 carbohydrate catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi000263f830_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (f6dF) rather than this protein.
PDB ID
f6dF
Method AlphaFoldv2
Resolution 85.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50