Protein
- Protein accession
- A0A0U4K3I2 [UniProt]
- Representative
- 6Fcrq
- Source
- UniProt (cluster: phalp2_19946)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MSKTMTAAQFLAALKAEGVPVRERKGWRTHNRNHKGASGPHNGVMVHHTASGSDGIESFVSNGNSALPGPLCHGLIEKSGRVVLIGWGRANHAGGGDPDVLAAVRAERWPVPKTNEHDGSAGSVDGNAHFVGYECVNQGNGKDPWPAIQLEGIARACAAVCRFHGWTVDSVIRHLDWSDWKVDPRGIDWNKMRARITTILKGKPNATPFASWADGDGGDAEQPEQPQPEQPDADAYPGAAAFGPGKSGLHVTRLGQMLVKRGGSRFYSVGPGPRWGEADRKATEAFQRAQGWSGSDADGIPGGTTWRLLVTGTGKSIPAAAPAKKPKVSLANIIAASRKDPGAPQGKTSYPADVRLVEAALKKLGFLGATYASDGAYGTVTVAAYNAFRRSIGLKGADATGDPGRLSLGTLGSRSGLFTV
- Physico‐chemical
properties -
protein length: 420 AA molecular weight: 44058,0 Da isoelectric point: 9,55 hydropathy: -0,42
Representative Protein Details
- Accession
- 6Fcrq
- Protein name
- 6Fcrq
- Sequence length
- 445 AA
- Molecular weight
- 46445,37780 Da
- Isoelectric point
- 7,19038
- Sequence
-
MPDRWMPGAEIHDIGDHAPTDGGPAKAIAHITWDKNASAGAPQDWVSFDALVNYFTGSGAGAAPHIIWDPFSGRIAQLVPADSRSKSVVDSAGGTRTNRAGSVVIQVEAVFFPYCRKGGQVYPRLVDTPCAGWDRLHAWIASWGVPDMWPMGRPVDFTSHRSESVWESQGGWYAHAHVPENDHQDPGSWPAFSSSPSPAPPVPATTRVTVRVGQTLTAIAAAAGVALAVILGLNPDVARHPDAIRPGDSIVVPAVPGQVPVPSQDPVPAPSAPGGGFPGASTFGPGASNANVTLLGQMLVARGAARFYAVGPGPAWGDADRRATEAFQLAQGWTGSDADGIPGATTWDYLLTGKGHDIPAAAKVASAPAFPGAAKFGPGQSNAYVTQLGQQLVRKGYGRYYTKGPGPTWGKADRLNVQAFQRAQGWRGSGADGIPGPRTWALLFS
Other Proteins in cluster: phalp2_19946
| Total (incl. this protein): 4 | Avg length: 435,0 | Avg pI: 8,67 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 6Fcrq | 445 | 7,19038 |
| 7uicA | 453 | 8,44447 |
| A0A0E3JQE4 | 422 | 9,49395 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_3990
7zjtr
|
195 | 50,7% | 349 | 4.340E-117 |
| 2 |
phalp2_16564
7umlh
|
28 | 35,6% | 451 | 2.417E-73 |
| 3 |
phalp2_19394
2oOR4
|
46 | 32,0% | 321 | 9.517E-27 |
| 4 |
phalp2_8051
5Ickg
|
15 | 26,5% | 463 | 2.181E-14 |
Domains
Domains [InterPro]
1
445 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Streptomyces phage Maih [NCBI] |
1775283 | Woodruffvirus > Woodruffvirus TP1604 |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KU189325
[NCBI]
CDS location
range 25551 -> 26813
strand +
strand +
CDS
ATGTCGAAGACGATGACCGCCGCGCAGTTCCTCGCGGCACTGAAGGCCGAGGGCGTCCCGGTCCGTGAGCGCAAGGGCTGGCGTACCCACAACCGGAACCACAAGGGCGCGTCCGGCCCGCACAACGGCGTCATGGTGCACCACACCGCCTCGGGCTCGGACGGTATCGAGTCGTTCGTCAGCAACGGCAACAGCGCGCTGCCGGGGCCGCTGTGCCACGGGCTGATCGAGAAGTCCGGGCGCGTGGTCCTGATCGGGTGGGGACGCGCGAACCACGCGGGCGGCGGTGACCCTGACGTCCTGGCCGCTGTCCGTGCCGAGCGCTGGCCGGTGCCGAAGACGAACGAGCACGACGGGTCCGCCGGATCGGTGGACGGCAACGCGCACTTCGTCGGGTACGAGTGCGTCAACCAGGGAAACGGGAAGGACCCGTGGCCGGCCATCCAGCTGGAAGGCATCGCGCGCGCCTGCGCGGCGGTGTGCCGCTTCCACGGCTGGACCGTGGACAGCGTGATTCGTCACCTGGACTGGTCGGACTGGAAGGTGGACCCGCGCGGCATCGACTGGAACAAGATGCGGGCCCGTATCACGACCATCCTGAAGGGGAAGCCGAACGCCACGCCGTTCGCGTCGTGGGCCGATGGCGACGGGGGCGACGCCGAGCAGCCCGAGCAGCCGCAGCCCGAGCAGCCGGACGCTGACGCCTACCCCGGCGCTGCTGCCTTCGGCCCCGGCAAGAGCGGCCTGCACGTGACGCGCCTCGGGCAGATGCTGGTGAAGCGCGGCGGTAGCCGCTTCTACAGCGTGGGTCCCGGTCCCCGGTGGGGTGAGGCGGACCGGAAGGCCACCGAGGCGTTCCAGCGGGCGCAGGGCTGGTCCGGCAGCGACGCGGACGGCATCCCCGGCGGGACCACGTGGCGGCTGCTGGTCACCGGGACCGGCAAGAGCATCCCCGCAGCCGCGCCCGCGAAGAAGCCGAAGGTGTCCCTGGCCAACATCATCGCGGCGTCGCGGAAGGACCCCGGTGCCCCGCAGGGAAAGACGTCCTATCCGGCGGACGTGCGGCTGGTGGAAGCCGCGCTGAAGAAGCTCGGGTTCCTCGGGGCCACGTACGCGTCCGATGGCGCGTACGGAACCGTGACGGTGGCCGCGTACAACGCGTTCCGCCGCAGCATCGGACTGAAGGGCGCGGACGCCACCGGCGACCCCGGCCGGCTGTCCCTCGGGACACTGGGCAGCCGCTCGGGTCTCTTCACCGTGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0008745 | N-acetylmuramoyl-L-alanine amidase activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0005feb3ef_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(6Fcrq)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50