Protein
- Protein accession
- G3MB93 [UniProt]
- Representative
- 8LXT6
- Source
- UniProt (cluster: phalp2_17838)
- Protein name
- Gp22
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MYKIAICAGHGFNTPGKRTPDGEREWSFNNKLVVAFVNTMNQYENVTVKRYDDPTGKTDVPLITRTNGANAWGANIYISFHHNANTAKWGTWTGVETFYYPGSTTGLNLAKAIHPAVVQSYGLKDRGVKSGNLHIVRETKMPAILIEGGFMDSTIDIKKLRSDAVLKAAGENIAHAVAKYAGLKKKSSTSTPKPPTPTTPNAATYTVVKGDTLYGISQKFGISVDTIKSLNGLKTNTISIGQVLKVKK
- Physico‐chemical
properties -
protein length: 248 AA molecular weight: 26891,4 Da isoelectric point: 9,78 hydropathy: -0,34
Representative Protein Details
- Accession
- 8LXT6
- Protein name
- 8LXT6
- Sequence length
- 556 AA
- Molecular weight
- 60975,38030 Da
- Isoelectric point
- 9,26174
- Sequence
-
MYKIAICAGHGFNTPGKRTPDGEREWSFNNKLVVAFVNTMNQYENVTVKRYDDPTGKTDVPLITRTNGANAWGANIYISFHHNANTAKWGTWTGVETFYYPGSTTGLNLAKAIHPAVVQSYGLKDRGVKSGNLHIVRETKMPAILIEGGFMDSTIDIKKLRSDAVLKAAGENIAHAVAKYAGLKKKSSTSTPKPPTPTTPNAATYTVVKGDTLYGISQKFGISVDTIKSLNGLKTNTISIGQVLKVKKVEDVTADSFYRVFVNGKQVGAFSVKENAYEDGWKQYNSGKTDAYLVEPNGSKYTFNAHQSENPNKKQDETNSTKNPILGKSVVTSSQMVTLVKSKNPNAKNIEEIAEAFLKVGEKYGLRGDIAFCQSIIETGWFRFDGGTAVTPDQHNYCGLGVTSTGIKGAVFATIKDGVTAQIQHLYAYATKSALPAGETLLDPRFQYVTRGIAPNWEDLSNRWAMNANYGTQILNMYNQLLQVPASNTSPEKPNTPKPKEEKKDVLFTHSEINNKQYASVVIDGKGYIAVSDVADLLDLKVVFDKVTKEIIFIKK
Other Proteins in cluster: phalp2_17838
| Total (incl. this protein): 17 | Avg length: 410,2 | Avg pI: 8,15 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8LXT6 | 556 | 9,26174 |
| 1GGtO | 483 | 7,15781 |
| 1LJpS | 472 | 5,46901 |
| 36Dn5 | 377 | 7,60048 |
| 3NcEn | 690 | 9,14150 |
| 4NP0T | 690 | 6,43567 |
| 4lxDw | 379 | 6,41146 |
| 4mtAe | 379 | 7,59809 |
| 5V7hd | 422 | 8,73329 |
| 7vAas | 466 | 8,70950 |
| Zkdg | 382 | 9,09683 |
| A0A0U4JE28 | 256 | 9,81513 |
| A0A5Q2F3V6 | 267 | 7,02737 |
| A0A6G6XUR0 | 274 | 9,44470 |
| A0A6G9L8D2 | 268 | 7,02737 |
| A0AA52HLC6 | 365 | 9,79302 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_35035
7KLgC
|
2 | 28,0% | 488 | 6.598E-76 |
| 2 |
phalp2_5670
4yIAx
|
6 | 26,3% | 538 | 1.812E-66 |
| 3 |
phalp2_14180
2AXSB
|
54 | 24,5% | 557 | 2.936E-60 |
| 4 |
phalp2_39119
2teHi
|
5 | 25,5% | 509 | 3.320E-59 |
| 5 |
phalp2_2869
netU
|
143 | 25,8% | 557 | 1.318E-51 |
| 6 |
phalp2_29726
1lBT8
|
1 | 25,3% | 497 | 1.517E-36 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Donellivirus gee [NCBI] |
1084719 | Donellivirus > |
| Host |
Bacillus megaterium [NCBI] |
1404 | Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
JN638751
[NCBI]
CDS location
range 19865 -> 20611
strand +
strand +
CDS
ATGTACAAAATAGCTATTTGTGCTGGTCATGGTTTCAACACTCCTGGTAAGAGAACTCCAGATGGTGAAAGAGAATGGAGCTTTAACAATAAGTTAGTTGTTGCTTTCGTTAACACTATGAATCAGTATGAGAATGTTACTGTTAAGCGTTATGATGATCCAACGGGTAAAACTGACGTGCCTTTAATTACTCGTACTAATGGGGCAAACGCTTGGGGAGCAAACATCTACATTTCATTCCACCACAACGCTAATACTGCTAAATGGGGTACTTGGACAGGTGTTGAAACTTTCTACTATCCAGGTTCTACGACTGGATTGAACTTAGCTAAAGCTATTCACCCAGCAGTTGTTCAATCTTATGGATTGAAAGATAGAGGCGTTAAGTCTGGAAATCTTCACATCGTTAGAGAAACAAAAATGCCAGCTATTTTAATCGAAGGTGGATTCATGGATTCTACTATCGATATTAAAAAACTTAGAAGCGACGCTGTATTAAAAGCTGCTGGTGAAAACATTGCTCATGCTGTAGCTAAGTACGCTGGTTTAAAGAAAAAATCTTCTACTTCTACTCCTAAACCGCCAACTCCTACTACTCCAAATGCTGCAACTTACACTGTAGTTAAAGGAGATACTCTGTATGGAATTTCTCAAAAATTTGGAATCAGTGTCGATACTATCAAATCATTAAATGGTTTGAAAACTAATACTATCTCTATTGGGCAGGTATTAAAGGTAAAAAAGTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008745 | N-acetylmuramoyl-L-alanine amidase activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi00022a82a4_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8LXT6)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50