Protein
- Protein accession
- G9JXH6 [UniProt]
- Representative
- 7sgqF
- Source
- UniProt (cluster: phalp2_26079)
- Protein name
- Phage tail assembly protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MLTKKVKSDIAAHVAACLPEEACGLVVMVGRKQVFVPCMNVFEDPTGARSRRDAFTISDMAWMEAEDMGDVVRVVHSHPGQRELTPSLGDVNGCNGSGVVWTITNEYGDFIEIDPEDPPLVGRRFVLGITDCYGLVMDWHKKQGVILPDFRVPYNWWETGENLYMDNWYGAGFRECEENTPGAMVIMQISAPVPNHAGIFLPGNQLLHHIYGSLSSVVPFRAGFFRDNVVKWVRHKDLPGDITEWQ
- Physico‐chemical
properties -
protein length: 246 AA molecular weight: 27581,2 Da isoelectric point: 5,12 hydropathy: -0,15
Representative Protein Details
- Accession
- 7sgqF
- Protein name
- 7sgqF
- Sequence length
- 100 AA
- Molecular weight
- 12032,39720 Da
- Isoelectric point
- 4,31882
- Sequence
-
MQDLSQLDWWLVCNGELHIFPKIQPLIGREFIHGTTDCYSIYKDFYYLAGLDMDEFKRQDYWWEKGENLYLENIEGQGFERLSEDAELQVGDVILMQSWR
Other Proteins in cluster: phalp2_26079
| Total (incl. this protein): 62 | Avg length: 243,5 | Avg pI: 5,20 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 7sgqF | 100 | 4,31882 |
| C9EHB0 | 246 | 5,30100 |
| A0A0M7QBK7 | 246 | 5,21176 |
| K4PAQ4 | 246 | 5,21176 |
| A0A1B2AP02 | 246 | 5,21176 |
| A0A0A7DVB8 | 246 | 5,32481 |
| A0A076YQM8 | 246 | 5,32265 |
| W0LM02 | 239 | 5,07563 |
| A0A1B2APZ9 | 246 | 5,21176 |
| A0A343S156 | 246 | 5,11178 |
| A0A3G8F5Q1 | 246 | 5,11178 |
| A0A482N3U4 | 246 | 5,11178 |
| A0A482N4B2 | 246 | 5,02357 |
| A0A482N4U1 | 246 | 5,21904 |
| A0A4V1E2M6 | 246 | 5,19454 |
| A0A653FW65 | 246 | 5,21176 |
| A0A6B9WNQ5 | 246 | 5,11178 |
| A0A6B9WWB4 | 246 | 5,21176 |
| A0A6B9X3G2 | 246 | 5,21176 |
| A0A6B9XGK5 | 246 | 5,33107 |
| A0A6C0R0D8 | 246 | 5,21176 |
| A0A6G5YHE0 | 245 | 4,96292 |
| A0A6M3YRM6 | 246 | 5,21176 |
| A0A7D7FN11 | 246 | 5,10354 |
| A0A7H0XF12 | 246 | 5,45617 |
| A0A7H0XF25 | 246 | 5,45617 |
| A0A7H0XFG1 | 246 | 5,45617 |
| A0A7L8ZKG3 | 246 | 5,45617 |
| A0A8E5NQT7 | 246 | 5,21176 |
| C0LP50 | 246 | 5,11178 |
| K7P800 | 246 | 5,11178 |
| A0A976R6E2 | 246 | 5,31623 |
| A0A976SPR6 | 246 | 5,21176 |
| A0A9E7LUV7 | 246 | 5,21176 |
| A0A9E7MJF2 | 246 | 5,03397 |
| A0A9E7MKU3 | 246 | 5,02357 |
| A0A9E7RZL4 | 246 | 5,21176 |
| A0AA48U0U9 | 246 | 5,11178 |
| A0AAD1Q7C2 | 246 | 5,21176 |
| A0AAD1Q8K3 | 246 | 5,21176 |
| A0AAE7S309 | 246 | 5,33107 |
| A0AAE7VY11 | 246 | 5,21176 |
| A0AAE7VZI1 | 246 | 5,32481 |
| A0AAE7XQF7 | 246 | 5,21904 |
| A0AAE7XQX1 | 246 | 5,21904 |
| A0AAE8B0T3 | 246 | 5,45617 |
| A0AAE8B6H7 | 246 | 5,21176 |
| A0AAE8B7H9 | 246 | 5,20221 |
| A0AAE8YPW1 | 246 | 5,32481 |
| A0AAE8YU49 | 246 | 5,21176 |
| A0AAE8YV83 | 246 | 5,21176 |
| A0AAE8YWU6 | 246 | 5,21176 |
| A0AAE9CD94 | 246 | 5,21176 |
| A0AAE9WZ21 | 246 | 5,21176 |
| A0AAF0FFU8 | 246 | 5,24569 |
| A0AAX4M4M4 | 246 | 5,21176 |
| A0AAX4NW42 | 246 | 5,21176 |
| A0AAX4QWU9 | 246 | 5,21176 |
| A0AB39JCZ5 | 246 | 5,21176 |
| A0AB74UEX9 | 246 | 5,20301 |
| A0AB74UKL4 | 246 | 5,21176 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_33148
7CcD2
|
99 | 44,7% | 96 | 1.265E-22 |
| 2 |
phalp2_23201
4Trrv
|
17 | 39,4% | 104 | 2.755E-20 |
| 3 |
phalp2_18066
3JA4t
|
92 | 33,6% | 95 | 1.199E-14 |
| 4 |
phalp2_3626
4QVie
|
1 | 32,3% | 71 | 1.597E-10 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Shigella phage EP23 [NCBI] |
1109721 | Dhillonvirus > Dhillonvirus EP23 |
| Host |
Shigella sonnei [NCBI] |
624 | Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Shigella > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
JN984867
[NCBI]
CDS location
range 16513 -> 17253
strand +
strand +
CDS
ATGTTAACGAAAAAGGTTAAAAGCGATATTGCCGCGCACGTTGCGGCATGTCTGCCGGAGGAAGCCTGCGGCCTGGTTGTCATGGTGGGCCGCAAACAAGTTTTTGTCCCGTGCATGAACGTATTCGAAGACCCGACCGGAGCGCGATCACGTAGAGACGCGTTTACCATTAGTGATATGGCCTGGATGGAGGCCGAGGATATGGGCGACGTTGTTCGCGTTGTCCACTCACACCCAGGCCAGCGCGAGCTAACCCCGTCACTTGGCGACGTTAACGGGTGCAACGGCAGCGGCGTCGTGTGGACCATTACTAATGAGTACGGCGACTTTATCGAGATTGACCCGGAAGACCCGCCGCTGGTAGGCCGCCGATTTGTTCTCGGAATTACGGATTGCTATGGCCTCGTAATGGACTGGCACAAAAAGCAGGGCGTGATTCTGCCAGACTTCCGCGTGCCGTACAACTGGTGGGAGACTGGCGAAAACCTGTATATGGATAACTGGTACGGAGCAGGCTTCAGGGAATGCGAGGAGAATACACCAGGGGCGATGGTCATCATGCAGATTAGCGCGCCGGTGCCAAACCATGCGGGGATCTTCCTTCCAGGCAACCAGCTACTACACCATATCTACGGCAGCTTGTCGAGCGTCGTCCCCTTCCGGGCAGGATTTTTCCGCGACAATGTGGTTAAATGGGTGCGTCATAAAGACTTACCTGGGGATATTACAGAATGGCAATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0006508 | proteolysis | biological process | None (UniProt) |
| GO:0008234 | cysteine-type peptidase activity | molecular function | None (UniProt) |
| GO:0008235 | metalloexopeptidase activity | molecular function | None (UniProt) |
| GO:0008270 | zinc ion binding | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000240d37f_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(7sgqF)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50