Protein
- Protein accession
- W6E9P7 [UniProt]
- Representative
- 4Trrv
- Source
- UniProt (cluster: phalp2_23201)
- Protein name
- Tail assembly protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MFGAEVEKEIMKYCKSHAPNEACGFINSSNEFVPIKNISSEPEKAFKMKKGSVPDDAKALVHSHPGGPFCPSEADMQQQIVTDIPWGICCFNERHQETFWFGDNVPMPPLIGRGFRHGVTDCYALIRDFYRAIHDIVLPEFPRNWEWWENNKTLYEDGFESAGFHSVNINEILPGDVFLATIPKSTTPNHGGIYLGDGLILHHCAARQPYAPDRLSVVDPAVRWMSYVTKVLRHEDDTISRAVGQKVWA
- Physico‐chemical
properties -
protein length: 249 AA molecular weight: 28161,6 Da isoelectric point: 5,75 hydropathy: -0,36
Representative Protein Details
- Accession
- 4Trrv
- Protein name
- 4Trrv
- Sequence length
- 160 AA
- Molecular weight
- 18882,61480 Da
- Isoelectric point
- 8,97395
- Sequence
-
MRYQMQLAIPFVVMCWPLYDVFWWGDQLAPSPLIGRGFRHGVHDCYSLLRDYYVEKFGVRLIDEPRDWNWWDDKQGLDLYRQHFDAAGFRVIDKREATQVGDGLLMSFNYRVPMHGAVVWDNDLILHHPAGMKPVDPTRLSVVAPRSRFIRHVALALRRK
Other Proteins in cluster: phalp2_23201
| Total (incl. this protein): 17 | Avg length: 187,5 | Avg pI: 6,00 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4Trrv | 160 | 8,97395 |
| 2lmHD | 132 | 6,19297 |
| 4c57h | 141 | 6,47762 |
| 4yYsK | 138 | 6,06054 |
| 5D3ce | 151 | 6,61790 |
| 5hghx | 136 | 4,98901 |
| 6N1oT | 165 | 7,86432 |
| 6REMw | 151 | 5,09621 |
| 8Bzmo | 138 | 4,94536 |
| 8kxXo | 227 | 6,19706 |
| vig1 | 131 | 5,28878 |
| V9QKQ8 | 279 | 5,11837 |
| A0A2I6PHT8 | 244 | 5,82943 |
| A0A481W6P3 | 245 | 5,84893 |
| A0A7G7WXW9 | 251 | 5,11428 |
| A0A975YYJ7 | 250 | 5,57860 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_20055
1yh76
|
36 | 48,3% | 153 | 3.618E-56 |
| 2 |
phalp2_14456
4GZZ7
|
1941 | 37,1% | 164 | 9.932E-50 |
| 3 |
phalp2_10001
6WaO6
|
89 | 37,5% | 157 | 3.507E-49 |
| 4 |
phalp2_27124
2S85x
|
12283 | 37,1% | 164 | 3.284E-45 |
| 5 |
phalp2_16601
8Ajl
|
621 | 30,4% | 161 | 2.712E-43 |
| 6 |
phalp2_30311
4w8xT
|
423 | 41,8% | 129 | 3.067E-41 |
| 7 |
phalp2_33889
1NGRy
|
1 | 67,6% | 105 | 2.685E-30 |
| 8 |
phalp2_8226
7u1LL
|
5 | 35,3% | 130 | 6.239E-29 |
| 9 |
phalp2_18066
3JA4t
|
92 | 30,6% | 147 | 9.545E-27 |
| 10 |
phalp2_15655
2TlsE
|
4 | 33,3% | 135 | 5.670E-25 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhizobium phage vB_RglS_P106B [NCBI] |
1458697 | Rigallicvirus > Rigallicvirus P106B |
| Host |
Rhizobium gallicum [NCBI] |
56730 | Proteobacteria > Alphaproteobacteria > Rhizobiales > Rhizobiaceae > Rhizobium > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KF977490
[NCBI]
CDS location
range 22862 -> 23611
strand +
strand +
CDS
ATGTTCGGTGCAGAAGTCGAAAAAGAAATAATGAAATATTGTAAGTCGCACGCTCCAAACGAAGCATGCGGCTTTATCAATTCCAGCAACGAATTCGTGCCGATCAAGAATATTTCGAGCGAACCGGAAAAAGCCTTTAAGATGAAAAAAGGCAGCGTTCCGGACGATGCTAAAGCGTTGGTGCACTCGCATCCAGGCGGGCCGTTTTGCCCGTCCGAAGCGGACATGCAACAGCAAATCGTAACCGATATCCCTTGGGGAATTTGCTGCTTTAATGAGCGTCATCAAGAAACGTTTTGGTTTGGTGATAATGTTCCGATGCCGCCACTAATCGGACGCGGTTTTAGACACGGTGTAACAGATTGTTATGCGTTGATCCGCGATTTTTACCGTGCAATTCATGATATTGTTTTGCCTGAATTTCCGCGAAATTGGGAATGGTGGGAAAACAACAAAACGCTTTATGAGGACGGCTTTGAAAGCGCTGGTTTCCACAGTGTCAACATCAACGAAATATTGCCGGGTGATGTTTTCTTAGCTACTATCCCGAAATCCACCACGCCCAACCACGGCGGGATTTACTTAGGTGACGGTTTGATTTTACATCATTGCGCCGCACGTCAACCTTACGCACCAGACCGTTTAAGTGTTGTTGATCCTGCTGTTCGATGGATGAGCTACGTAACGAAAGTTTTAAGACATGAAGACGATACAATTAGCCGGGCGGTTGGCCAAAAGGTTTGGGCGTGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0006508 | proteolysis | biological process | None (UniProt) |
| GO:0008234 | cysteine-type peptidase activity | molecular function | None (UniProt) |
| GO:0008235 | metalloexopeptidase activity | molecular function | None (UniProt) |
| GO:0008270 | zinc ion binding | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4Trrv)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50