Protein
- Protein accession
- A0A1P8VVH0 [UniProt]
- Representative
- 4GkQr
- Source
- UniProt (cluster: phalp2_4675)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MSIPHNPLAEQVQRRLKARGHDLGTSGPLGDGVDGRAGNLTFAAVLAEIPDIAPVPVPLKPAETALSFAQRQIDLDLLCIAFPANTRAGLEPWVEPTRQACVRWGIDTFREVASFLANINVESAGLTRLSESLNYSVDALIAKFGRHRISVADANRYGRGNGHAADQEALANILYGGPWGAKNLGNTQPGDGWRFRGYGPKQLTGRANQTAFANAIGKPVDEIPAYVRTPEGGMMSAGWFWKSHDLDAKAATPGVEDDRRAINGGTFGLADVERIFDDLIEELLRREKAAA
- Physico‐chemical
properties -
protein length: 291 AA molecular weight: 31362,9 Da isoelectric point: 6,00 hydropathy: -0,29
Representative Protein Details
- Accession
- 4GkQr
- Protein name
- 4GkQr
- Sequence length
- 141 AA
- Molecular weight
- 15440,26500 Da
- Isoelectric point
- 6,27846
- Sequence
-
MTPEIIKSAFPKASDAIIDAILEYAPRYGIDAKQMPMFLAQAGHESGEFTVFCESLNYSADALVKIFSRHRISEADAEKYGRTSGHAANQEMIANLIYGGAWGAKNLGNTQPGDGWMFRGRGIFQLTGRANYVAFVKDSPN
Other Proteins in cluster: phalp2_4675
| Total (incl. this protein): 23 | Avg length: 211,4 | Avg pI: 8,39 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4GkQr | 141 | 6,27846 |
| 16aWN | 140 | 9,64210 |
| 1KxgC | 217 | 9,38481 |
| 1LBMi | 221 | 9,28617 |
| 2Zueb | 108 | 6,03098 |
| 3fZ6a | 286 | 9,25465 |
| 4NOD1 | 214 | 6,75227 |
| 4NP8L | 216 | 5,79419 |
| 4emA6 | 222 | 8,92599 |
| 4fS2W | 181 | 9,56100 |
| 4g7Jf | 239 | 9,06504 |
| 4o01f | 173 | 10,30780 |
| 56FWf | 214 | 6,75267 |
| 6zjn8 | 214 | 6,75267 |
| 89KzH | 238 | 9,12623 |
| 8iQjP | 241 | 9,06408 |
| A0A023NGE0 | 229 | 6,52821 |
| A0A2R3UA80 | 215 | 9,94014 |
| A0A6J5M118 | 221 | 9,68884 |
| A0A9E7MZJ3 | 213 | 9,49279 |
| A0AAF0I9X9 | 215 | 9,49279 |
| A0AAV2PF34 | 213 | 9,80604 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_26839
3X8d3
|
3 | 52,2% | 111 | 2.876E-39 |
| 2 |
phalp2_5234
8cPFI
|
3482 | 43,2% | 141 | 2.712E-35 |
| 3 |
phalp2_9408
8GIg1
|
520 | 39,4% | 137 | 6.138E-27 |
| 4 |
phalp2_39815
Hj9s
|
55 | 37,8% | 119 | 2.693E-25 |
| 5 |
phalp2_36471
1bFy4
|
16 | 40,2% | 134 | 3.690E-25 |
| 6 |
phalp2_12287
7e8ZJ
|
95 | 36,8% | 122 | 2.215E-23 |
| 7 |
phalp2_29588
83X5u
|
30 | 40,3% | 109 | 2.007E-22 |
| 8 |
phalp2_14082
8g4v0
|
2 | 34,7% | 92 | 4.225E-20 |
| 9 |
phalp2_24890
f0po
|
5 | 36,3% | 143 | 5.241E-16 |
| 10 |
phalp2_31323
2aR1M
|
3 | 32,7% | 116 | 4.225E-14 |
Domains
Domains [InterPro]
1
141 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Erythrobacter phage vB_EliS_R6L [NCBI] |
1913119 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KY006853
[NCBI]
CDS location
range 35630 -> 36505
strand -
strand -
CDS
ATGTCGATCCCGCACAACCCGCTCGCCGAACAGGTCCAACGCCGGCTCAAGGCCCGGGGGCACGATCTGGGCACAAGCGGCCCGCTCGGCGATGGAGTCGACGGCCGCGCCGGCAATCTGACCTTCGCGGCCGTCCTGGCCGAAATCCCGGACATCGCCCCGGTGCCGGTGCCGCTCAAGCCGGCCGAAACGGCGCTTTCGTTCGCGCAGCGCCAGATCGACCTCGATCTGCTGTGCATCGCGTTCCCGGCCAACACGCGCGCCGGGCTGGAGCCGTGGGTCGAACCAACCCGGCAGGCTTGCGTCCGGTGGGGGATCGACACGTTCCGCGAAGTCGCCAGCTTCCTCGCGAACATCAACGTCGAGAGTGCCGGCCTGACGCGGCTGTCGGAGAGCCTGAACTATTCCGTGGACGCGCTGATCGCCAAGTTCGGCCGCCATCGCATCAGCGTCGCGGACGCCAACCGCTACGGCCGGGGCAATGGCCACGCCGCCGATCAGGAGGCGCTTGCGAATATCCTCTACGGCGGGCCATGGGGCGCGAAGAACCTCGGCAACACGCAACCGGGTGATGGCTGGCGGTTCCGGGGCTACGGCCCCAAGCAGTTGACCGGCCGGGCCAACCAAACGGCGTTCGCCAATGCCATCGGCAAGCCGGTGGACGAGATCCCCGCCTATGTCCGCACGCCCGAGGGCGGCATGATGTCGGCCGGCTGGTTCTGGAAGTCGCACGATCTGGACGCCAAGGCCGCGACGCCGGGCGTCGAGGATGATCGGCGCGCCATCAATGGCGGCACGTTCGGGCTGGCCGATGTCGAGCGCATCTTCGACGATCTGATCGAGGAGTTGCTGCGCCGGGAAAAGGCAGCGGCGTGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(4GkQr)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50