Protein
- Protein accession
- A0A3Q8HYG9 [UniProt]
- Representative
- 5tXTi
- Source
- UniProt (cluster: phalp2_28907)
- Protein name
- Tail protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MSAKVTVTFYTIKGTYPVVARTTSGSVPRNSTSQFNNGLLAFETQNDSSQDMPSFTIQLTDDYDWSTLLVPNDYVRIDVSYYSNIFSTEAKKVVKTTLACGLISNINRGIDSQSNSRIYTVTCQGVAKIIYNMNLTTFSELTSTLPAYVLLPDDAKKGIRFGNRSSGDIIEQVFNRFITGNNGFTDYAFNNSNLSVPMSNILKLSIIKNSDEAMQTMAYNRFSNYNGTILQMIGDIAAKPFNEIYWTHEDGVATFNYRPTPFDQERWEALERISLSPSDIISEQVSITDTDQYSIFKLLAYSGLGSETYSGGWSGHLAPLTNTQLIRRYGYKMLEVQVDYFNGDTKNQDDDSTGEANQNLSSWAKKYSSKAKSICSKLNASDMYPYLMTIVQLEHGSNSDPINAASHGGTNINGETASLTFGAKFLKSMNGKASDESHKVTDKLALVQAYNFGKGYIDYLSDKNSSSMSLSLNMGYSAKIAKQKGNSSLARIPYSTAVSKKYGKNYLYRNGANFYYGYEAKTYLGGSNDSSSFIIQNSTSPSTEGTTESEARKHYPLYDNLEDMLAYAMGNKTSSTATIARQYGGEAEYQKVLSILRGKPSRSSFYNQVKSLSFPISKVKADAIYTNYKEGKGSVGTRVRREAYLSIVAPTQRITNSKISESYTYLKTLKSMKAHPKKAALQLMEVSNYSIGSKQAYEIIKKFIANKGSISSAEYSAILKKYAFNDTESGVNPLTGNGSINSVPYLFVKYSEKLYNWFADNSKFHSGTITINGTSGIEVGKRLLVKDDKDGVYWEYYIESVSHNWSFQSGWTTAIGVTRGLPLSSESDDRRFTYPKSFWGSYEEFKGGYFGEYDLATAESLYANSDKDDDSDGGSGSGTAEKALNYALDLEKKKGSSSVYDQGYHGSNPFNMGTVRGDCSQLVYFAFKKAGVDISSGGSWTTWNIAKSSKLKTVSKEGGNKSSAYKKLKKGDIVFFNTEGSDSHMAIYAGDDTLVGFQSAPNMLSTFKLQSNSYWSSAFRGHICRLK
- Physico‐chemical
properties -
protein length: 1027 AA molecular weight: 113648,4 Da isoelectric point: 8,81 hydropathy: -0,49
Representative Protein Details
- Accession
- 5tXTi
- Protein name
- 5tXTi
- Sequence length
- 860 AA
- Molecular weight
- 94344,93180 Da
- Isoelectric point
- 5,13781
- Sequence
-
MRNLYTLKANMTITFYTVDGSYPVVSRAVDNQVSPGSHQAFYNGLISFQTKNDTSEDIPTATIVLTDDYDWSSILVPNDYVVITAGYRGDMPYSDKNTTVNSTLYCGLVTDIMKTGSYGDNNANRTFTITVQGMAKVLQSMNLSTFSEITSTLNGYQMLPDDEKTGIAFSGKSSAVLIDEVLKKFILNNNEYTKYLFQDENGQQFSLESLLQTQLQPNTDEAFASNSNNQFMNYNGTILQMVKDMAVRPFNELYWTHEQGVATLHYRPTPFEQSEWTALEEIDLSSDNIISEEVRANDAEQSSIFKLLATDDAGSQLTTTGFSGSIYPLTNRALIQRYGYKTMEVQTPYFSGESNSDDSSYGGTSEAGKGKTESEAQLHYPSYGNIQDYFALARGVNGTDDYDVPPEHGGNHTYDSLLSSLKGGASQSAFVSQATDTGFISETQANNLYAKYKSSGNGTLNKQAYLSIVAPNYNPTSTNVSYSSTYLKSVSKMKDDPEKAAMELMQESKYTLGSKQAYDLVQAAIASKGNVSKATYQKILGEGYNAKQDGVDILSADGSGNQDAVSLLFQMYTQKLFNWYADNSKFHSGTITVQGTLGIENGKRLVVYDNKEKVYWEYYIESVGHNFSYTQGWTTQIGVTRGLALPNQGDTTRRYGFPYSFWGTYEQFVGGYFGEQSMADAVAAADSASGGSDDTSGGSVSGGGDDKFSSGGKTATKAVELASGLSKKAGAGSYNQSYHTQDPFNMSSPKGDCSSLVYFAYKHAGVNLSNGGWTTWKLAKSDKLKTIGSSGSNKDDVYKKMHVGDLIFFKTTAKDDSHVGLVMSDGIIAWNTKKGVSTFSLKSSNYWWNAWRGHCMRLKE
Other Proteins in cluster: phalp2_28907
| Total (incl. this protein): 21 | Avg length: 982,1 | Avg pI: 7,13 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 5tXTi | 860 | 5,13781 |
| 28x8d | 1062 | 8,72046 |
| 3xPYl | 1081 | 7,49191 |
| 5N1CD | 1035 | 4,90699 |
| 5oeuJ | 880 | 4,97679 |
| 5osb7 | 1026 | 4,94882 |
| 5osdM | 1021 | 4,92126 |
| 7AoOz | 1027 | 8,48367 |
| 7zOqP | 1028 | 8,69061 |
| 7zOwx | 1030 | 8,85636 |
| 81LrB | 1056 | 4,64581 |
| 8LlRD | 1038 | 8,38058 |
| 8LmMN | 1035 | 8,87970 |
| 8LseK | 1080 | 6,04013 |
| A0A3S7UP23 | 1030 | 8,85636 |
| A0A3S6QAA7 | 1035 | 8,87970 |
| A0A4Y5FGK4 | 1080 | 6,04013 |
| Q5ULI0 | 1062 | 8,76939 |
| A0A6G5XWY1 | 404 | 7,76412 |
| A0A6G5YFL1 | 728 | 5,54160 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_11331
7jPy7
|
2 | 36,9% | 603 | 6.051E-169 |
| 2 |
phalp2_37189
1QZtJ
|
98 | 28,0% | 884 | 6.530E-111 |
| 3 |
phalp2_11026
4Sv8d
|
15 | 26,4% | 717 | 1.444E-65 |
| 4 |
phalp2_33582
8LHfr
|
135 | 25,4% | 550 | 3.944E-46 |
| 5 |
phalp2_1764
2LOtm
|
25 | 19,8% | 848 | 6.720E-36 |
| 6 |
phalp2_5245
7YnjZ
|
6 | 21,2% | 904 | 1.641E-32 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Lactobacillus phage Bromius [NCBI] |
2315485 | Herelleviridae > Harbinvirus > Harbinvirus bromius |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MH809531
[NCBI]
CDS location
range 22876 -> 25959
strand +
strand +
CDS
ATGTCAGCAAAAGTTACTGTAACGTTCTATACAATTAAAGGTACTTATCCAGTTGTAGCTCGTACTACCAGTGGTTCAGTACCCCGTAATTCAACGTCACAGTTTAATAATGGGCTACTGGCATTTGAAACACAGAATGATAGCTCACAAGATATGCCGTCATTTACGATTCAGTTAACAGATGATTATGATTGGTCTACACTTTTAGTTCCTAATGACTATGTTAGGATTGATGTCAGTTACTATAGCAATATATTTAGTACCGAAGCTAAGAAAGTAGTTAAGACTACTTTAGCCTGTGGACTAATATCAAATATTAACCGTGGTATTGACTCACAATCTAATAGTCGTATTTATACTGTTACGTGCCAAGGCGTTGCCAAGATTATATATAACATGAATCTAACCACATTTTCAGAGCTGACCTCAACACTTCCTGCGTATGTACTATTACCAGATGACGCTAAGAAAGGTATTAGATTTGGCAATCGTTCTTCTGGCGATATTATTGAACAAGTGTTTAATCGTTTCATTACAGGTAACAATGGATTTACAGATTACGCTTTTAATAACTCTAATTTGTCAGTCCCAATGAGTAATATTCTTAAGCTATCAATAATTAAGAACTCAGATGAAGCCATGCAAACAATGGCCTATAATAGATTTTCTAATTATAACGGAACAATTCTTCAAATGATTGGTGATATTGCTGCTAAGCCTTTTAACGAAATTTATTGGACGCATGAGGATGGCGTAGCCACTTTCAACTATCGACCAACTCCATTTGACCAAGAACGTTGGGAAGCTCTAGAAAGAATTAGTCTATCTCCATCTGATATTATATCAGAGCAGGTAAGTATTACAGATACTGACCAATACTCTATTTTTAAACTTTTGGCCTACTCTGGATTAGGTTCTGAGACTTATTCTGGTGGTTGGTCTGGTCATTTGGCTCCTTTAACCAATACACAGTTGATAAGACGTTATGGGTATAAGATGCTTGAAGTACAAGTGGATTATTTTAATGGTGACACTAAGAACCAAGATGATGATTCAACGGGTGAAGCAAACCAAAATTTGTCATCATGGGCTAAAAAATATTCTAGTAAAGCAAAAAGTATTTGTAGTAAATTAAATGCCTCTGATATGTATCCCTATCTAATGACAATTGTTCAATTGGAACATGGTAGTAATTCAGACCCCATTAATGCTGCTAGCCATGGTGGAACTAACATTAACGGGGAGACTGCTAGCTTAACATTTGGTGCGAAATTTTTAAAGAGTATGAACGGTAAGGCATCAGATGAGTCACACAAGGTTACGGACAAACTTGCCTTAGTACAAGCATATAACTTTGGAAAAGGGTATATTGATTACTTATCTGATAAGAACTCTAGCTCGATGTCACTATCATTAAATATGGGCTACTCTGCTAAGATAGCTAAGCAAAAGGGTAACTCTTCACTAGCTAGGATTCCATATAGCACTGCTGTATCAAAGAAATACGGAAAGAATTATTTATACAGAAATGGTGCTAACTTCTACTATGGTTACGAAGCAAAGACCTACCTAGGTGGGTCTAATGACAGTTCTAGTTTTATAATACAGAACTCAACTTCCCCCTCTACGGAGGGAACTACTGAATCAGAGGCAAGGAAGCATTATCCTCTATATGATAACCTTGAAGACATGTTAGCATATGCAATGGGAAATAAAACTAGTAGTACCGCCACAATTGCTAGACAGTATGGTGGGGAAGCAGAGTACCAGAAAGTATTATCTATTCTTAGAGGAAAGCCTTCTCGTTCTAGTTTTTATAATCAAGTAAAGTCCTTGTCTTTCCCAATCAGTAAGGTTAAGGCTGATGCAATATATACAAACTATAAGGAAGGTAAAGGTTCTGTTGGAACACGGGTAAGAAGAGAAGCTTATTTAAGCATTGTTGCACCAACTCAGAGAATAACAAATTCAAAAATATCTGAGAGCTATACGTATTTAAAAACACTTAAGAGTATGAAAGCACACCCAAAGAAGGCTGCCTTACAGTTGATGGAAGTGTCTAATTATTCTATAGGTAGCAAACAAGCATATGAGATTATCAAAAAATTCATTGCCAACAAAGGTAGCATATCTTCTGCTGAATATAGTGCTATACTTAAGAAGTATGCTTTCAATGATACAGAATCTGGAGTTAACCCTCTTACTGGTAATGGCTCTATCAATTCAGTTCCTTATCTATTTGTAAAATATAGTGAGAAACTGTATAACTGGTTTGCTGATAATAGCAAGTTTCACTCAGGGACGATAACCATTAATGGTACTTCTGGCATTGAAGTTGGCAAGAGATTATTAGTAAAAGATGACAAAGATGGTGTTTATTGGGAATACTATATTGAGTCAGTGTCCCATAACTGGTCTTTTCAATCTGGTTGGACGACTGCTATTGGTGTTACCAGAGGGCTACCATTGTCTTCTGAAAGTGATGATAGACGTTTCACATATCCCAAGAGTTTCTGGGGCTCTTATGAAGAATTTAAAGGTGGATACTTTGGGGAATATGATTTAGCAACCGCAGAATCTTTGTATGCCAACAGCGACAAAGATGACGATTCTGATGGAGGAAGCGGAAGTGGAACTGCTGAGAAAGCTTTAAATTATGCTTTAGACCTAGAAAAGAAGAAAGGTAGCAGTTCTGTTTATGACCAAGGATACCATGGCTCTAATCCATTTAACATGGGTACTGTTCGTGGCGACTGTTCACAATTGGTTTACTTCGCCTTTAAAAAGGCAGGTGTAGACATCTCAAGTGGTGGTAGCTGGACTACTTGGAACATTGCAAAAAGTAGTAAGTTGAAAACAGTAAGTAAAGAGGGCGGAAATAAATCAAGTGCCTATAAGAAATTAAAGAAGGGCGATATTGTCTTCTTTAATACAGAAGGCTCTGATAGCCACATGGCTATCTATGCTGGCGATGATACCCTTGTAGGTTTCCAAAGTGCCCCAAACATGTTATCAACATTCAAATTACAGTCTAACAGTTATTGGTCGTCTGCGTTTAGAGGTCATATTTGTAGACTTAAATAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0008234 | cysteine-type peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(5tXTi)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50