Protein
- Protein accession
- A0A222ZQH1 [UniProt]
- Representative
- 8sEzo
- Source
- UniProt (cluster: phalp2_15548)
- Protein name
- Lysin A, protease C39 domain
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MAEKVLPYDRSVVPQETPWDCGPAATQVVLNSRGIIVSESDLIREIGTTVNGTDYVGLITDRSLKRRLPDAKYTSVYIENDPPTAAQRDALWRNVVRSIDAGYGVVVNWVAPPSNYPRGVKGSVSPRYSGGTVYHYVACMGYDDNPALRAVWIADSGFQPQGYWISFDQLASLIPPKGYAYADAAPVGGGPAPSGPDPVDVLSQAMGGRLSRERYAALLPAVAQALAECQCGTVERVAMWCAQIGHESGGLYYMEEIADGSAYEGRTDLGNTQPGDGKRFKGRGPIQITGRSNYTRLSQWAFDKGLVPSPTFFVDDPAQLASDRYGFLGVVWYWTVARPQINSMCDANDLDGVTRAINGGLNGIDDRRARWDRCRAMGAALLAITTTQEDDPLSALSADEQRELLSLLRILASNRFVSRSPLRHLGEGPVETVAGFALNTDGNQHVLLVHDLAKAGDPDALALLREVASAEGNSRYPDRQADAKLAKRLLADVDASKAPAQPSTPTTPTTPSEPSAPTSPVKASCALSAAGCVVADATSGGGCALSTDGTGKCVVAAATEDGK
- Physico‐chemical
properties -
protein length: 563 AA molecular weight: 59853,4 Da isoelectric point: 5,15 hydropathy: -0,25
Representative Protein Details
- Accession
- 8sEzo
- Protein name
- 8sEzo
- Sequence length
- 448 AA
- Molecular weight
- 48651,44430 Da
- Isoelectric point
- 6,03888
- Sequence
-
MGYGTYGNPAVDSFVARKIRPDGFSTFRRDAVVVDPTNAVRTLMDAMGAGVSEQRYQQLFPAVADALRKCGCTSVKRIAMWCSQIGHESGGLRWMEEIADGSQYEGRTDLGNTQIGDGKRYKGRGPIQVTGRSNYTTLSKWAYAQGLVPSPTFFVDNPGQLATDTYGFLGVVWYWTVARPNINSMADSGNLEGVTRAINGGLNGFADRQQRYNNCISMGDRLLQLVASEPKEEGFLMALSAEEQVEIRDKVRQLWGAAFNLVPSKSRYGNPKDLWPSKDFDRNMDGFMYDIITEHDAALGDPAALRRVREAADKGDVIAQHFLEKLTAAPLPATPVSAVVSAPPANNVTCWNCSKHYPDVLPNCPFCGASQSPPSEPPAAIAQAQPVEASVGRHAAERVNPVVNEGLPAVDKSVVDQLTLLGQFNNQLPAEVSTAITQLIPVLKGLVK
Other Proteins in cluster: phalp2_15548
| Total (incl. this protein): 71 | Avg length: 483,7 | Avg pI: 5,65 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8sEzo | 448 | 6,03888 |
| 2qOSX | 495 | 4,74170 |
| 37UOZ | 509 | 5,34482 |
| 4OIKI | 368 | 4,85743 |
| 4dKnP | 481 | 4,92654 |
| 6x5xN | 476 | 5,40490 |
| 75mXc | 485 | 4,86504 |
| 75r5P | 477 | 4,81696 |
| 76XAi | 479 | 6,64848 |
| 79I4I | 487 | 5,31509 |
| 7AvKB | 477 | 6,28374 |
| 7Avoe | 477 | 6,15250 |
| 7AwP9 | 461 | 5,86314 |
| 7Awuh | 477 | 6,15250 |
| 7AxbS | 477 | 6,15250 |
| 7Axln | 477 | 6,03519 |
| 7Axxr | 461 | 5,86314 |
| 7PZ9y | 489 | 5,92708 |
| 7Q0UN | 494 | 6,24845 |
| 7Q1dz | 477 | 6,15250 |
| 7Q1oW | 470 | 6,03439 |
| 7XoZs | 566 | 4,69083 |
| 7aXWI | 477 | 6,15250 |
| 7hOqf | 472 | 5,22921 |
| 7iJBl | 489 | 6,15057 |
| 7jEga | 477 | 6,15154 |
| 7pGjR | 479 | 5,21813 |
| 7pNYj | 472 | 5,16100 |
| 7pOhl | 471 | 5,10161 |
| 7q5Qk | 473 | 4,93012 |
| 7q5li | 482 | 5,47782 |
| 7q7w5 | 488 | 5,05784 |
| 7q805 | 477 | 4,87419 |
| 7q8SH | 481 | 4,88340 |
| 7qdK7 | 476 | 4,68043 |
| 7qfAN | 465 | 5,24251 |
| 7qiKj | 474 | 5,38000 |
| 7qiop | 471 | 4,85197 |
| 7qlmA | 482 | 5,38330 |
| 7r4fM | 465 | 5,46697 |
| 7ubJK | 434 | 4,96269 |
| 7zRXh | 477 | 6,15250 |
| 867Gn | 559 | 4,77160 |
| 87a1T | 580 | 4,75404 |
| 8JrmF | 476 | 5,98790 |
| 8Jrmg | 477 | 6,15250 |
| 8Jrmh | 477 | 6,03405 |
| 8Jrmi | 477 | 6,15159 |
| 8Jrmj | 477 | 6,03672 |
| 8Jrml | 477 | 6,08407 |
| 8Jrmm | 477 | 6,03519 |
| 8Jrmt | 477 | 6,15250 |
| 8Jrmu | 475 | 6,23492 |
| 8Jrna | 461 | 5,97499 |
| 8Jrnj | 477 | 5,92572 |
| 8Jrol | 462 | 6,02473 |
| 8LVSm | 477 | 6,15250 |
| 8M13e | 475 | 6,38844 |
| 8MAYX | 477 | 5,82318 |
| 8MCav | 477 | 6,03342 |
| 8MDmC | 477 | 6,15250 |
| 8MGdK | 477 | 6,15250 |
| 8MKQB | 477 | 6,15057 |
| 8MLma | 477 | 6,03519 |
| 8MLqM | 477 | 6,28374 |
| 8MWgo | 477 | 6,15250 |
| 8MxQF | 477 | 6,28454 |
| 8lGA4 | 584 | 4,79263 |
| A0A0A7S349 | 563 | 5,16987 |
| A0A222ZQI6 | 572 | 5,15145 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_24932
8Jrom
|
307 | 43,4% | 341 | 1.671E-133 |
| 2 |
phalp2_8241
7x5Zv
|
42 | 46,5% | 292 | 8.032E-89 |
Domains
Domains [InterPro]
1
448 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Mycobacterium phage Findley [NCBI] |
2015882 | Timquatrovirus > Timquatrovirus findley |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MF140411
[NCBI]
CDS location
range 23102 -> 24793
strand +
strand +
CDS
TTGGCTGAGAAGGTTCTGCCTTACGACCGCTCGGTCGTGCCGCAAGAGACACCGTGGGACTGCGGCCCGGCCGCAACGCAGGTCGTGCTCAACTCGCGCGGCATCATCGTCAGCGAGTCCGACCTGATCCGCGAGATTGGGACGACCGTCAACGGCACCGATTACGTCGGGCTGATTACCGACCGCTCACTCAAGCGGCGCCTGCCCGACGCCAAGTACACCTCGGTGTACATCGAAAACGACCCGCCGACCGCGGCGCAGCGCGACGCCCTGTGGCGCAACGTGGTTCGCTCGATCGACGCCGGGTACGGCGTGGTGGTCAATTGGGTCGCCCCGCCCTCGAACTATCCGCGCGGCGTCAAGGGCTCGGTGAGCCCCCGCTACAGCGGCGGCACCGTGTACCACTACGTCGCTTGCATGGGCTACGACGACAACCCGGCCCTGCGCGCAGTGTGGATCGCCGACAGTGGCTTTCAGCCGCAAGGGTATTGGATCAGCTTCGACCAGCTCGCCTCGCTGATCCCGCCGAAGGGCTACGCCTACGCCGACGCTGCCCCGGTGGGCGGCGGCCCGGCGCCGAGCGGCCCCGACCCGGTCGACGTGCTGTCTCAGGCTATGGGCGGCAGGCTGTCTCGCGAGCGTTACGCCGCGCTGCTGCCCGCGGTGGCCCAGGCCCTCGCCGAGTGCCAGTGCGGCACCGTCGAGCGTGTCGCAATGTGGTGCGCGCAGATCGGGCACGAGTCGGGCGGCCTGTACTACATGGAGGAAATCGCCGACGGCAGCGCCTACGAGGGCCGCACCGACCTGGGCAACACACAGCCGGGCGACGGTAAGCGGTTCAAGGGCCGCGGCCCTATCCAGATCACCGGCCGGTCGAACTACACGCGCCTGTCGCAATGGGCGTTCGACAAGGGCCTCGTGCCGTCGCCGACGTTCTTTGTCGACGACCCGGCGCAATTGGCAAGCGACCGTTACGGATTCCTGGGCGTCGTCTGGTACTGGACGGTCGCCCGGCCGCAGATCAACAGCATGTGCGACGCCAACGACCTTGACGGCGTCACGCGGGCGATCAACGGCGGGCTGAACGGCATCGACGACCGGCGCGCTCGCTGGGACCGCTGCCGCGCAATGGGCGCGGCGTTACTCGCAATCACGACGACACAGGAGGATGACCCGTTGTCTGCCCTATCCGCTGACGAGCAGCGCGAACTGCTCTCGCTGCTGCGCATTCTCGCGAGCAACCGTTTCGTGAGCCGCAGCCCGCTGCGCCACCTCGGCGAGGGGCCGGTCGAAACGGTCGCCGGGTTCGCGCTCAACACCGACGGCAACCAGCACGTGCTGCTCGTGCATGACCTAGCGAAGGCGGGCGACCCTGACGCCCTGGCGCTGCTGCGCGAAGTGGCGAGCGCCGAAGGCAATTCGCGTTACCCCGACCGGCAGGCCGACGCGAAGCTCGCCAAGCGGCTGCTCGCTGACGTCGACGCCAGCAAGGCACCGGCGCAGCCGTCGACGCCGACCACGCCGACCACGCCGAGCGAGCCGAGCGCGCCGACGTCGCCGGTCAAGGCGTCCTGTGCGCTGTCTGCGGCCGGGTGCGTTGTGGCTGACGCGACCTCGGGCGGTGGCTGCGCCCTGTCCACCGACGGCACCGGCAAGTGCGTCGTCGCCGCCGCGACCGAGGATGGGAAGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0006508 | proteolysis | biological process | None (UniProt) |
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000b9d5e28_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8sEzo)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50