Protein
- Protein accession
- I3VYU7 [UniProt]
- Representative
- f6dF
- Source
- UniProt (cluster: phalp2_9369)
- Protein name
- lysozyme
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MIKIIDVSSNNTVTSTHEVKNAGYYGIICKATEGTSYVNPTFADVVNRTKAVGLKSGSYHFARPDVNSNAEQEAEHYYNVMKQFTFDLPPVLDIEVQGNMNKQQLTNWALTFLKRIEKLTGRKPWIYTGDYFYNNCLIPAQLKEYPLWIAAYGSTEPTTPHIMWQYTDSENVPGVGKCDVSLANDDFSIRTGGNSMLQRGSTGDAVKTLQQQLNQLGYNLVVDGVFGANTENAVKDFQSKHGLVVDGIVGPATQAAISNALQQLQPQPQPQPTTTYTIVFDDQNQANLVAQVLGGKVRQIQ
- Physico‐chemical
properties -
protein length: 301 AA molecular weight: 33263,0 Da isoelectric point: 5,74 hydropathy: -0,38
Representative Protein Details
- Accession
- f6dF
- Protein name
- f6dF
- Sequence length
- 440 AA
- Molecular weight
- 48294,40550 Da
- Isoelectric point
- 5,57894
- Sequence
-
MSYPITVSFIPNLPQEPYENGVGKYEGVVAHSTATPNATASAERNYETTHFNDAFVHYFVDWTSIVNTANVDYLSWHGGHTANQRYVGVELCETADANLFKQSYDRYTWLLAKILLDKGLPMVHMKTFFTHDDTSKLFHETTHSDPVGYLQSHGISINQLVSDVTAQYNAMKGSAHPMLQMGSTGISVSELQNTLNSKGYNLVVDGSFGNSTLSAVKDFQSKNGLTADGVVGDNTWAKLTAPVPKPVAPAPSNMYRVRIAWEDASSQIGAFTILDSAKTLADQHKDHKVFDESGKVVYDPNPPVPQPATTQIYRVRKDWADSKSQLGAFSNLESAKTIADQNAGYKVFDSTGKQIHPVVVEQPVPTQPTPQAKPEELNGDLPSGAVGECVIKVKDLDIYNSYDANSQKIGVIVEGQKFPVYKVVNDFYLISNGMWISNVS
Other Proteins in cluster: phalp2_9369
| Total (incl. this protein): 2 | Avg length: 370,5 | Avg pI: 5,66 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| f6dF | 440 | 5,57894 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_20229
8m9sH
|
6 | 44,6% | 430 | 7.167E-101 |
| 2 |
phalp2_22248
7gCZr
|
4 | 32,0% | 306 | 8.912E-27 |
| 3 |
phalp2_32070
5P7Lm
|
156 | 23,4% | 316 | 1.394E-11 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Thermoanaerobacterium phage THSA-485A [NCBI] |
1126885 | No lineage information |
| Host |
Thermoanaerobacterium saccharolyticum JW/SL-YS485 [NCBI] |
1094508 | Firmicutes > Clostridia > Thermoanaerobacterales > Thermoanaerobacterales Family III. Incertae Sedis > Thermoanaerobacterium > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
CP003186
[NCBI]
CDS location
range 5761 -> 6666
strand -
strand -
CDS
ATGATTAAAATTATTGATGTATCCAGTAACAATACCGTAACCAGCACACACGAAGTTAAAAATGCTGGTTATTATGGAATCATTTGTAAGGCAACTGAAGGAACAAGCTATGTTAATCCCACTTTCGCTGACGTAGTAAACCGTACTAAAGCGGTTGGGCTTAAGAGTGGCAGTTATCATTTTGCAAGACCAGACGTAAATTCTAACGCCGAGCAAGAAGCAGAACATTACTATAACGTGATGAAACAGTTTACATTTGACCTACCACCAGTCTTAGACATTGAAGTACAAGGCAATATGAATAAACAACAGCTTACTAATTGGGCTTTGACGTTTCTAAAGAGAATAGAGAAGTTGACAGGAAGAAAACCTTGGATATACACAGGCGATTACTTCTACAACAACTGCTTAATTCCAGCACAATTAAAGGAATATCCTTTATGGATAGCAGCTTATGGTAGTACAGAGCCAACGACGCCTCACATCATGTGGCAGTATACAGATAGCGAAAATGTTCCCGGCGTCGGCAAATGCGATGTATCATTAGCAAACGATGATTTTAGTATTAGAACAGGAGGTAATTCTATGTTACAAAGAGGAAGTACAGGTGATGCTGTAAAAACATTACAGCAACAATTAAATCAATTAGGGTACAATTTAGTAGTTGACGGTGTATTTGGTGCAAACACAGAAAACGCAGTAAAAGACTTTCAGAGCAAACATGGTCTTGTAGTAGATGGTATAGTTGGTCCAGCAACACAGGCAGCTATAAGCAATGCTTTACAGCAATTACAACCACAGCCTCAACCGCAACCGACGACTACTTACACAATTGTATTTGACGATCAAAACCAAGCTAACCTTGTGGCACAGGTATTAGGCGGAAAAGTAAGACAAATACAATAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0003796 | lysozyme activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0016052 | carbohydrate catabolic process | biological process | None (UniProt) |
| GO:0016998 | cell wall macromolecule catabolic process | biological process | None (UniProt) |
Enzymatic activity
| EC Number | Entry Name | Reaction Catalyzed | Classification | Evidence | Source |
|---|---|---|---|---|---|
| 3.2.1.17 | None | Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. |
match to sequence model evidence used in automatic assertion
ECO:ECO:0000256 |
ARBA:ARBA00000632 |
Tertiary structure
PDB ID
upi000263f830_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(f6dF)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50