Protein
- Protein accession
- R4JHA2 [UniProt]
- Representative
- 8MRZu
- Source
- UniProt (cluster: phalp2_39755)
- Protein name
- Lysin A
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MSFRIVNGNTHTEKGWRCCNRDECDIVRIPGLYLTDTAPIRKGAPLTILGAWLYWYDRNVEEIVTPIWGWSLDNDVLGQPGKNNGSNHLSGVAVDVNAPKYPWGTYRMSADKIAKVEEGLRLFEGTVFWGRRWSKPDEMHYDLSYPEGDPRNEAFAKKLLDGYLGIYKPATQVSAAPILAAATGLSEARAAEILPAVRSGLRESECTNVNRIAMWLAQIGHESGSFQYTEEIAKNGRYAPYIGRTWIQITWDYNYRSFSQWAYAFGMVPTPDYFVVNYRELADLKWAGIGPAWYWTVARPDINELSDRRDLNTVTRRINGGTNGLADRQARYNRALAQGDALLQLLHEEDDFLSALTDAEQRELLDLARQQAKYKRKSRSPLHWPHEGEVDTIAGLSWSTDANVHIQLVEKLAVIYGDPVSIALLYAVSNSDDPTNNPELAKRILKRVKPEDITAAQVQIQKWLAAEQKFHAA
- Physico‐chemical
properties -
protein length: 473 AA molecular weight: 53475,6 Da isoelectric point: 6,06 hydropathy: -0,44
Representative Protein Details
- Accession
- 8MRZu
- Protein name
- 8MRZu
- Sequence length
- 519 AA
- Molecular weight
- 57857,97150 Da
- Isoelectric point
- 5,00606
- Sequence
-
MSFRVVNGNTHTENGWRCCNRDECGIVDIADLFLTETAPLRNGAPLIILGAWLFWYDRNVEEIVSSVWGWSLNNDVLGQPGRNNGSNHLSGTAVDVNAPKYPWGLYRMTDDKIAKVREGLELFQGSIFWGRRWGELGVSKADEMHYQMAWPEGDPRNEAFAEKLRNGYLGIYGTDTGTETPSSPRKPVVPGQGGTFWNDVSQYQVKPIDESYPHKVFSFRTNSGDVTDTLALQNARAAKTLLDSGQLEIVIPYYFFRPGQANCDLHREILETAGLFNHPRTVSMVDVEGDKGSVKGDNSIEINDEVTRMRGWYGSNDRVIGYLNSNADPNLWPTRNGINLVVPQYGRTPGDISSIKDATVRTDAIAHQYTSTATDQSPWIGRSVDANWSPYDLSELLALFGIQKAEGLFMYLTKDEEFAIRDKILGYKSMGNKWPARGIFADSLEGVDDTVGMLLNSDGNIWDVLVILGALSGVSEHISRIERLANGQGPRGKEERFVKIAQELLRFIQAKDEGTEDNA
Other Proteins in cluster: phalp2_39755
| Total (incl. this protein): 55 | Avg length: 428,8 | Avg pI: 6,02 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8MRZu | 519 | 5,00606 |
| 4M8tn | 518 | 5,27013 |
| 8MS1A | 522 | 5,02016 |
| A0A2L1IWT7 | 519 | 5,00606 |
| A0A5Q2WFY0 | 472 | 5,94396 |
| A0A411BRR3 | 482 | 6,35144 |
| U5NZY5 | 225 | 6,22401 |
| G8I5V5 | 473 | 6,05508 |
| W8FPK3 | 482 | 6,19576 |
| A0A515MM52 | 473 | 6,24492 |
| A0A1B1PCT5 | 472 | 6,00444 |
| A0A481VSH7 | 226 | 6,52849 |
| V5R928 | 473 | 6,19218 |
| A0A3G2KG67 | 522 | 5,02016 |
| A0A0K1LRS8 | 473 | 6,19218 |
| A0A0K1LTD6 | 473 | 6,05508 |
| A0A0K2CLB2 | 485 | 6,11243 |
| A0A1L6BY60 | 473 | 6,05508 |
| A0A1L6BZ44 | 473 | 6,05508 |
| A0A2L0HK03 | 473 | 6,05508 |
| A0A2P1A164 | 473 | 6,05508 |
| A0A2P1CE32 | 473 | 6,05508 |
| A0A2P1CGN0 | 473 | 6,05508 |
| A0A2P1N3B0 | 473 | 6,05508 |
| A0A2Z5HF97 | 473 | 6,05508 |
| A0A345KKQ5 | 473 | 6,11192 |
| A0A345KPK5 | 473 | 6,05508 |
| A0A385DVK0 | 226 | 6,52849 |
| A0A386KNG9 | 225 | 6,22844 |
| A0A411BND8 | 473 | 6,05508 |
| A0A4D6T9Q1 | 473 | 6,05508 |
| A0A4Y5TXM6 | 473 | 6,05508 |
| A0A514DH66 | 226 | 6,52849 |
| A0A5B8RQY2 | 473 | 6,19218 |
| A0A5Q2WGH6 | 473 | 6,05508 |
| A0A6B9LEW7 | 473 | 6,05508 |
| A0A6B9LI91 | 226 | 6,53020 |
| A0A6M3SWZ9 | 226 | 6,52673 |
| A0A6M3T5F5 | 473 | 6,05508 |
| A0A7G8LEP4 | 473 | 6,05508 |
| A0A7G8LH55 | 473 | 6,11192 |
| A0A7G8LPS2 | 522 | 5,15134 |
| A0A7U0J6D3 | 473 | 5,87979 |
| G1JTQ9 | 473 | 6,05508 |
| A0A8A1VBL3 | 218 | 5,82960 |
| A0A8A1VES4 | 222 | 5,96783 |
| A0A8A1VG37 | 218 | 5,82960 |
| A0A8F3E638 | 473 | 5,93350 |
| A0A976SRZ9 | 473 | 6,24492 |
| A0A9E7QKG2 | 471 | 6,29551 |
| A0AAE7VBH6 | 225 | 6,52832 |
| A0AAU8GKY1 | 471 | 6,29551 |
| A0AAU8GQ87 | 473 | 6,19218 |
| B3VGP8 | 473 | 5,93350 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_30503
5hxHI
|
19 | 30,7% | 508 | 1.661E-62 |
| 2 |
phalp2_1213
8JrjH
|
54 | 30,4% | 371 | 1.185E-27 |
Domains
Domains [InterPro]
1
519 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Mycobacterium phage PattyP [NCBI] |
1327773 | Fromanvirus > |
| Host |
Mycobacterium smegmatis str. MC2 155 [NCBI] |
246196 | Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KC661273
[NCBI]
CDS location
range 4371 -> 5792
strand +
strand +
CDS
GTGAGCTTCCGGATCGTCAACGGGAACACCCACACCGAGAAGGGATGGCGGTGCTGCAACCGGGACGAGTGCGACATCGTCCGCATCCCCGGCCTGTATCTAACCGACACAGCACCCATCCGTAAGGGAGCTCCGCTAACCATTCTCGGGGCGTGGCTGTACTGGTACGACCGCAACGTCGAGGAGATCGTCACCCCGATCTGGGGCTGGTCCCTTGATAACGACGTACTAGGGCAGCCGGGGAAGAACAACGGGTCTAACCACCTGTCCGGGGTAGCTGTGGACGTCAACGCGCCCAAGTACCCCTGGGGCACGTACCGGATGTCTGCGGACAAGATTGCCAAGGTCGAGGAAGGACTTCGGCTGTTCGAAGGGACAGTCTTCTGGGGACGGCGCTGGTCAAAGCCCGACGAGATGCACTACGACCTCTCGTACCCCGAAGGCGACCCTCGAAACGAAGCCTTCGCGAAGAAGCTCCTGGATGGGTACCTCGGGATTTACAAGCCCGCTACCCAGGTGTCCGCAGCCCCCATCCTGGCGGCGGCCACCGGCCTGAGCGAAGCTCGCGCGGCGGAGATCCTGCCCGCGGTCCGCTCGGGCCTCCGGGAATCCGAATGCACGAACGTCAACCGCATCGCGATGTGGCTGGCTCAGATCGGGCATGAGTCCGGGTCATTCCAGTACACCGAGGAGATCGCCAAGAACGGTCGGTACGCGCCGTACATCGGCCGGACGTGGATTCAGATCACCTGGGACTACAACTACCGGTCGTTCTCGCAGTGGGCGTACGCGTTCGGGATGGTTCCGACTCCGGACTACTTCGTCGTGAACTACCGCGAGCTCGCTGACCTGAAGTGGGCGGGCATCGGCCCTGCCTGGTACTGGACGGTCGCCCGCCCGGACATCAACGAGCTGTCCGATCGCCGCGACCTGAACACGGTCACCCGCCGGATCAACGGCGGCACCAACGGCCTCGCGGATCGACAAGCCCGCTACAACCGCGCGCTCGCCCAGGGCGATGCGCTGCTGCAACTACTTCACGAAGAGGACGACTTTTTGTCTGCTCTAACCGACGCTGAACAGCGTGAGTTGCTGGACCTGGCTCGCCAGCAGGCCAAGTACAAGCGCAAGTCCCGCTCGCCGCTGCACTGGCCGCACGAGGGCGAGGTCGATACGATCGCCGGCTTGTCCTGGTCGACGGACGCGAACGTCCATATCCAGCTGGTCGAGAAGCTCGCTGTGATCTACGGCGACCCGGTCTCGATCGCGCTGCTGTACGCGGTGTCGAACTCCGACGATCCGACGAACAACCCCGAGCTGGCGAAGCGCATCTTGAAGCGCGTCAAGCCCGAGGACATCACCGCTGCTCAGGTCCAGATCCAGAAGTGGCTGGCTGCCGAGCAGAAGTTCCATGCCGCTTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000218e85e_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(8MRZu)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50