Protein
- Protein accession
- A0AB74UEX9 [UniProt]
- Representative
- 7sgqF
- Source
- UniProt (cluster: phalp2_26079)
- Protein name
- Tail assembly protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MLTKKVKSDIAAHVAACLPEEACGLVVMVGRKQVFVPCPNVFEDPTGVRSRKDAFTISDMAWMDAEDMGDVVRVVHSHPGQRELTPSLGDVNGCNGSGVVWAITNEYGDFIEIDPEDPPLVGRRFVLGITDCYGLVMDWHKKQGVNLHDFRVPYNWWETGENLYMDNWYGAGFRECEENTPGAMVIMQISAPVPNHAGIFLPGNQLLHHIYGSLSSVIPFRSGFFRDNVVKWVRHKDLPGDITEWQ
- Physico‐chemical
properties -
protein length: 246 AA molecular weight: 27574,1 Da isoelectric point: 5,20 hydropathy: -0,19
Representative Protein Details
- Accession
- 7sgqF
- Protein name
- 7sgqF
- Sequence length
- 100 AA
- Molecular weight
- 12032,39720 Da
- Isoelectric point
- 4,31882
- Sequence
-
MQDLSQLDWWLVCNGELHIFPKIQPLIGREFIHGTTDCYSIYKDFYYLAGLDMDEFKRQDYWWEKGENLYLENIEGQGFERLSEDAELQVGDVILMQSWR
Other Proteins in cluster: phalp2_26079
| Total (incl. this protein): 62 | Avg length: 243,5 | Avg pI: 5,20 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 7sgqF | 100 | 4,31882 |
| C9EHB0 | 246 | 5,30100 |
| G9JXH6 | 246 | 5,12178 |
| A0A0M7QBK7 | 246 | 5,21176 |
| K4PAQ4 | 246 | 5,21176 |
| A0A1B2AP02 | 246 | 5,21176 |
| A0A0A7DVB8 | 246 | 5,32481 |
| A0A076YQM8 | 246 | 5,32265 |
| W0LM02 | 239 | 5,07563 |
| A0A1B2APZ9 | 246 | 5,21176 |
| A0A343S156 | 246 | 5,11178 |
| A0A3G8F5Q1 | 246 | 5,11178 |
| A0A482N3U4 | 246 | 5,11178 |
| A0A482N4B2 | 246 | 5,02357 |
| A0A482N4U1 | 246 | 5,21904 |
| A0A4V1E2M6 | 246 | 5,19454 |
| A0A653FW65 | 246 | 5,21176 |
| A0A6B9WNQ5 | 246 | 5,11178 |
| A0A6B9WWB4 | 246 | 5,21176 |
| A0A6B9X3G2 | 246 | 5,21176 |
| A0A6B9XGK5 | 246 | 5,33107 |
| A0A6C0R0D8 | 246 | 5,21176 |
| A0A6G5YHE0 | 245 | 4,96292 |
| A0A6M3YRM6 | 246 | 5,21176 |
| A0A7D7FN11 | 246 | 5,10354 |
| A0A7H0XF12 | 246 | 5,45617 |
| A0A7H0XF25 | 246 | 5,45617 |
| A0A7H0XFG1 | 246 | 5,45617 |
| A0A7L8ZKG3 | 246 | 5,45617 |
| A0A8E5NQT7 | 246 | 5,21176 |
| C0LP50 | 246 | 5,11178 |
| K7P800 | 246 | 5,11178 |
| A0A976R6E2 | 246 | 5,31623 |
| A0A976SPR6 | 246 | 5,21176 |
| A0A9E7LUV7 | 246 | 5,21176 |
| A0A9E7MJF2 | 246 | 5,03397 |
| A0A9E7MKU3 | 246 | 5,02357 |
| A0A9E7RZL4 | 246 | 5,21176 |
| A0AA48U0U9 | 246 | 5,11178 |
| A0AAD1Q7C2 | 246 | 5,21176 |
| A0AAD1Q8K3 | 246 | 5,21176 |
| A0AAE7S309 | 246 | 5,33107 |
| A0AAE7VY11 | 246 | 5,21176 |
| A0AAE7VZI1 | 246 | 5,32481 |
| A0AAE7XQF7 | 246 | 5,21904 |
| A0AAE7XQX1 | 246 | 5,21904 |
| A0AAE8B0T3 | 246 | 5,45617 |
| A0AAE8B6H7 | 246 | 5,21176 |
| A0AAE8B7H9 | 246 | 5,20221 |
| A0AAE8YPW1 | 246 | 5,32481 |
| A0AAE8YU49 | 246 | 5,21176 |
| A0AAE8YV83 | 246 | 5,21176 |
| A0AAE8YWU6 | 246 | 5,21176 |
| A0AAE9CD94 | 246 | 5,21176 |
| A0AAE9WZ21 | 246 | 5,21176 |
| A0AAF0FFU8 | 246 | 5,24569 |
| A0AAX4M4M4 | 246 | 5,21176 |
| A0AAX4NW42 | 246 | 5,21176 |
| A0AAX4QWU9 | 246 | 5,21176 |
| A0AB39JCZ5 | 246 | 5,21176 |
| A0AB74UKL4 | 246 | 5,21176 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_33148
7CcD2
|
99 | 44,7% | 96 | 1.265E-22 |
| 2 |
phalp2_23201
4Trrv
|
17 | 39,4% | 104 | 2.755E-20 |
| 3 |
phalp2_18066
3JA4t
|
92 | 33,6% | 95 | 1.199E-14 |
| 4 |
phalp2_3626
4QVie
|
1 | 32,3% | 71 | 1.597E-10 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Escherichia phage vB_Eco_Lzu_P3 [NCBI] |
3348405 | Dhillonvirus > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
PQ295796
[NCBI]
CDS location
range 39522 -> 40262
strand -
strand -
CDS
ATGTTAACGAAAAAGGTTAAAAGCGATATCGCCGCGCACGTTGCGGCATGTCTGCCGGAAGAAGCCTGTGGGCTGGTCGTCATGGTGGGCCGCAAACAAGTATTTGTCCCGTGCCCGAACGTATTCGAAGACCCTACCGGCGTGCGCTCACGCAAAGACGCGTTCACGATTAGTGATATGGCCTGGATGGATGCGGAGGATATGGGCGACGTCGTGCGCGTAGTCCACTCGCATCCAGGCCAGCGAGAGCTTACCCCATCACTGGGCGACGTTAACGGATGCAACGGCAGCGGCGTAGTCTGGGCCATCACTAACGAATATGGCGACTTTATCGAGATCGACCCTGAAGATCCTCCGCTGGTGGGGCGTCGATTTGTTCTCGGAATTACGGATTGTTACGGCCTCGTCATGGACTGGCACAAAAAGCAGGGTGTAAACCTGCATGACTTCCGCGTGCCGTATAACTGGTGGGAGACAGGCGAGAATCTGTATATGGATAATTGGTACGGAGCGGGCTTCAGGGAGTGTGAGGAAAATACGCCGGGGGCAATGGTAATTATGCAAATCAGCGCGCCAGTACCTAACCATGCCGGGATATTCCTTCCGGGCAACCAACTACTTCACCATATCTACGGCAGTCTGTCGAGCGTGATACCCTTCCGGTCAGGATTTTTCCGCGACAATGTGGTTAAATGGGTACGTCATAAAGACCTACCGGGGGATATCACAGAATGGCAATGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(7sgqF)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50