Protein
- Protein accession
- A0A7S5RGQ1 [UniProt]
- Representative
- 7w9xW
- Source
- UniProt (cluster: phalp2_36235)
- Protein name
- Putative internal virion protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MADNNDELATARQFLYGRTNKPQSAIDGLRGDFATKLSRMMQDPDAPVGLGLYSGYRSPERQAELWAGALKKYGSPEAARKWVAPPGHSMHNEGLAADMGYNGQSLQHAPKEVVDWVHSNADRYGLKFPLGNEAWHIEDSSTRGGKPTSVSDQAFAAGGFQQSGGTYQPVTNPPVPDWVKDSAPNDTFQTTPNVAPPTWGAVGSSILHSMPTYQVAQWVGNGSAAPDMPWLNNPDLGKFGKTLDGLNDDEQKWVIGTALSPQHAVELRSQLDQQKVDAGTATSSPTAFAGMLVANVLDPVSLAASAATGPAVVGASNVLRASTAAGVLTRMAEAGTIGIADATLASVIQAQVDPNFGVEDYVMNGLTGMALGAGIGTLTRGAELDGVSAEATAAFGRRARDFIAQTAEDRGYKFASPEARDAFTGAGKSAGAAENIASAAELFPDRRGFLERMADRLGSWSSNENLLMNKTGTAVRGIYEKLMPNLSGTGGRELRASEDAWSFMKRSTEVDHANMEKTFSQHFDGWLKESGMENVSFLKRDHLEEEFSQRVYYAMAHPDVANGSEAIEKMADAYRSAFGSKLDEAKALGVSWAQDVPKDDYYVPFKVSKRAYYEVTAKIGRDGLIDVIKQAFLRAQPDLYKAERAKKPGTKGRKLAADVRERKVSRLAERYLKMIEDTNIDNNGAERMTGLAGDSARALRDMLTEEGVGLEEIEDIISAFGWSPKEGPSNFRRRAVMERDEPFIPSKYASHPNARDYAVSLRQITDQDIRSVYSSYSRSVNGHLALAKAGFKSEADARAQIKDATNFRTTIEKGPEGKQNVFEQDMTTASEELNYFVDRILGRPAFPGVSKKTRLALSLVKNLGFIRFMQQSGISQLGDMPKILLRYGIGATWRQSRMRDFLDVFTYRDTEANALTREIQASTGIGIKTKNAKMYALHEDINIGGELFDEDLTTKMLAKANAVSESASRLTATVSLLNPITDTLQLWAARAASQRLIDIATGRKVISRKWMNEIGINERDLADMREIAKRMTLEGDGKIVRWNSDKQRAAGMDHADAYDRFLALVRRETQMAILETAPNAIKRSMSGPMMGLFWQLKSYMMNALLANTAKNIKLGPAYMSMSLVATSVWSAAIYAGQQYSVSLGMPGEDRKKFLEDRLSLRGILSAGFQRSADSSILPMIIDTTISGVDVFTGEDHRLFSNTRNSGLGSNITDSIPALRMVQDMGKLAKNVAAAAARTDQRFDQTEGRQMRDFIPLLRTYGLLNATNALIATMPPDTDGAIVKAK
- Physico‐chemical
properties -
protein length: 1285 AA molecular weight: 140301,8 Da isoelectric point: 6,82 hydropathy: -0,40
Representative Protein Details
- Accession
- 7w9xW
- Protein name
- 7w9xW
- Sequence length
- 943 AA
- Molecular weight
- 101335,61320 Da
- Isoelectric point
- 5,06387
- Sequence
-
MARIRSLQSEGRAGSGVPSAVTGGGGFAAKLADVAGSLSTRIYDLAQGAAQRAGLLAGTQQVEAPASGQYATGDPYAAKGYLQQHTDKGQEAIHGLDDNFSVKLANLFQAAPDNIRSGLGIYSGYRSNEHQAKLYSAALAKYGSEAEARKWVAPPGHSQHNKGMAADVSYNGVSLSKAPKEVTDWIHQNAAAYGLKFPLSNENWHIEDDSTRGGKGTTIDPRPLALRRDGTVFGEAYDRAASSAYLWRVQSGLSRELFQAQQDHPDDPYAFSAAKEEIRTRYLNDPALSDPLLREAFQKGFEQTSDGYQRQTAIAYGAKLKAEEQSAFASGIDAMGVNIERQAYTYGANPDGDKVVGDLVTSSQRSIDAAIDAGTVTPAAGETLKQDIAMRASQARIQGTYDALPTPESKQEFAAGLVDRWKEQDPALGAFNDAFMSDIQSLQRTLLSNAASLTTAQKQANATRKAQLSTLMDDDIASVLASGKGLDPSAGLSMAELEQYFTPAEIAKFRDERNLSLDIHDATSGMDAMTADDIAALVEDMKPEPGQNGYADQQKIYDAASKKAAAILKARETDPLGQAAAAGIVEIQPIDSTDSGTITQSLVARSQAARIAGGVLGTEMPLFTKAEVDALKVQGKSTDPALFGAVMQQMDFLANSDGLLSVKQTFGAPMMEDLQVWQSKMRYATQAEAQDWLKQHADPQWQERVKPLVSSGEAKAREVPFQDIVSELDPNWIADIGAPIDDASKRAMQNDYTMLVGQFYSRIGDIDAAQKQAIEAMKTVWGRTDALGGRGGRLMAYPPEKFYPAVASNQRYLKTEMQDLAKAAGVEIDQLSLVSDAKTEAAADRHEAPGYLISIVDPQTGFDELLTDDAGRPLRHFFDPEAARKDAMAKAEQDRSDANGMAPMTPAERKAARQQGGTAAPLEINIPDNKPKPSFGRSRSPGH
Other Proteins in cluster: phalp2_36235
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_21335
1NS0y
|
26 | 45,1% | 685 | 5.722E-176 |
| 2 |
phalp2_3986
7yjZk
|
1 | 26,4% | 833 | 1.987E-67 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhizobium phage RHph_I20 [NCBI] |
2509730 | Autographiviridae > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MN988539
[NCBI]
CDS location
range 8501 -> 12358
strand +
strand +
CDS
GTGGCAGACAACAACGACGAGCTGGCAACGGCTCGGCAATTCCTCTATGGCCGCACGAACAAGCCGCAGTCGGCGATCGACGGTTTGCGCGGCGACTTCGCTACCAAGCTCTCCCGCATGATGCAGGACCCGGACGCACCAGTCGGCCTCGGCCTCTATTCAGGCTATCGCAGCCCCGAGCGCCAAGCCGAGCTTTGGGCGGGCGCCCTGAAGAAGTACGGCTCGCCTGAAGCTGCTCGCAAGTGGGTCGCGCCGCCCGGTCACTCCATGCATAACGAGGGCCTGGCTGCGGATATGGGCTACAACGGCCAGTCCCTCCAGCACGCTCCGAAGGAGGTCGTGGATTGGGTCCACAGCAACGCCGATCGCTATGGCCTGAAGTTCCCGCTTGGGAATGAGGCTTGGCACATCGAAGACAGCTCGACGCGCGGAGGCAAGCCGACCAGCGTCAGCGATCAGGCATTCGCGGCTGGAGGCTTTCAGCAGTCCGGTGGAACATACCAGCCGGTGACAAACCCGCCGGTTCCGGATTGGGTCAAGGACTCGGCGCCCAATGACACATTCCAAACCACGCCGAACGTCGCGCCGCCGACTTGGGGCGCAGTGGGCTCTTCTATCCTCCACAGTATGCCGACTTATCAAGTGGCGCAGTGGGTCGGGAATGGCTCCGCCGCACCTGATATGCCATGGCTCAACAACCCCGACCTTGGCAAATTCGGCAAGACCCTGGACGGTCTCAACGATGATGAGCAGAAATGGGTGATCGGGACTGCCCTGTCGCCGCAGCACGCGGTTGAACTGCGCAGCCAGCTAGACCAGCAGAAGGTCGATGCAGGAACGGCCACGTCTTCACCGACGGCCTTCGCGGGCATGCTCGTGGCGAACGTACTCGACCCGGTCTCTCTCGCCGCCAGTGCTGCAACAGGCCCGGCCGTGGTGGGGGCTTCTAACGTTCTGCGCGCTTCGACGGCGGCGGGCGTCCTCACGAGAATGGCGGAGGCCGGAACGATCGGGATAGCCGATGCGACCCTCGCTTCAGTTATCCAGGCGCAGGTAGACCCCAACTTCGGCGTTGAGGACTACGTCATGAACGGGCTCACCGGCATGGCGTTGGGCGCTGGCATAGGCACCTTGACCCGTGGCGCCGAGTTGGACGGTGTTTCGGCCGAAGCCACCGCAGCCTTCGGGCGCCGGGCTCGCGACTTCATCGCACAAACGGCAGAAGACCGCGGATATAAGTTCGCCTCCCCCGAAGCGCGTGACGCCTTCACCGGTGCTGGCAAGTCCGCCGGTGCCGCCGAGAACATCGCGAGCGCCGCCGAACTCTTCCCCGATCGGCGCGGCTTCCTGGAGCGCATGGCCGACCGCCTCGGCTCTTGGTCATCCAATGAGAACCTTCTCATGAACAAGACCGGGACTGCGGTGCGCGGCATCTATGAGAAGCTTATGCCAAATCTCTCCGGCACCGGAGGCCGCGAGCTGCGCGCCAGCGAAGACGCCTGGAGCTTCATGAAGCGGTCGACCGAAGTGGACCACGCGAACATGGAGAAGACCTTCTCTCAGCACTTCGACGGCTGGCTGAAGGAAAGCGGTATGGAAAACGTCAGCTTCCTCAAGCGAGACCATCTCGAAGAGGAGTTCTCACAGCGCGTCTATTACGCCATGGCCCACCCGGACGTGGCTAACGGCTCGGAGGCGATCGAGAAAATGGCCGACGCCTACCGATCGGCTTTCGGAAGCAAGCTGGACGAGGCAAAAGCGCTCGGCGTCTCGTGGGCGCAGGACGTTCCGAAGGACGATTACTATGTCCCGTTCAAGGTTTCGAAGCGCGCCTATTACGAGGTGACGGCGAAGATCGGCCGCGATGGCCTGATCGACGTCATCAAGCAAGCGTTCTTGCGTGCGCAGCCGGACCTTTACAAGGCCGAGCGCGCGAAGAAGCCGGGCACCAAGGGCCGCAAGCTGGCCGCGGACGTGCGCGAGCGGAAGGTCTCCAGACTGGCCGAGCGCTATCTGAAGATGATCGAAGACACGAACATCGACAACAACGGCGCCGAGCGCATGACCGGCTTGGCTGGCGACAGCGCCCGCGCACTGCGGGACATGCTCACGGAAGAAGGCGTCGGGCTGGAGGAGATCGAAGATATCATCTCCGCCTTTGGCTGGTCGCCGAAGGAAGGACCGAGCAACTTCCGGCGCCGGGCTGTGATGGAGCGGGACGAGCCCTTCATTCCGTCCAAGTATGCATCCCATCCGAATGCTCGGGACTATGCCGTCTCTCTCCGGCAGATAACGGACCAAGACATCCGCTCGGTCTACAGCTCATATAGCCGATCCGTGAACGGGCATCTCGCCCTGGCTAAGGCTGGCTTCAAGTCCGAGGCCGACGCCCGCGCACAGATCAAGGACGCGACCAACTTCCGGACCACCATTGAGAAGGGACCGGAGGGCAAGCAGAACGTCTTTGAGCAGGACATGACCACGGCCAGCGAAGAGCTGAACTACTTCGTGGACCGCATCCTCGGGCGCCCGGCATTCCCAGGTGTCAGCAAGAAGACCCGCCTCGCCCTGTCGCTGGTCAAGAACCTCGGCTTCATCCGCTTCATGCAGCAGTCGGGCATATCTCAGCTCGGTGACATGCCGAAGATTCTGCTCAGGTATGGCATCGGCGCGACGTGGCGGCAGTCGAGAATGCGGGACTTCCTTGACGTGTTCACCTATCGGGACACCGAAGCGAACGCCCTCACCCGCGAAATCCAGGCATCAACCGGCATTGGAATCAAGACCAAGAATGCGAAGATGTACGCGCTGCACGAGGACATCAACATCGGCGGCGAGCTGTTCGATGAAGATCTAACCACGAAGATGCTGGCCAAGGCGAACGCCGTTTCGGAAAGCGCCTCGCGCCTCACAGCCACCGTCTCGCTCCTCAATCCGATCACTGACACGCTTCAGCTTTGGGCGGCGCGCGCAGCCAGCCAACGGCTCATCGACATCGCCACCGGCCGCAAGGTGATCAGTCGGAAGTGGATGAACGAGATCGGCATCAACGAGCGTGACCTCGCCGACATGCGCGAGATCGCCAAGCGGATGACGCTTGAGGGCGATGGCAAGATTGTCCGCTGGAATAGCGACAAGCAGCGTGCGGCCGGGATGGACCACGCCGATGCCTACGACCGGTTCCTCGCGCTCGTGCGCCGGGAAACGCAGATGGCCATTCTCGAGACCGCGCCGAACGCCATCAAGCGGTCGATGTCCGGCCCGATGATGGGTCTCTTCTGGCAGCTCAAGAGCTACATGATGAACGCCCTCCTCGCCAACACGGCGAAGAACATCAAGCTCGGCCCTGCCTACATGTCCATGTCCCTTGTTGCCACGTCCGTATGGTCGGCGGCGATCTACGCGGGGCAGCAATATTCGGTCTCGCTTGGGATGCCGGGCGAGGATCGCAAGAAATTCCTCGAAGATCGTCTTTCGCTTCGGGGTATCCTGTCGGCGGGCTTCCAACGCTCGGCCGATAGCTCCATCCTCCCCATGATTATCGACACGACCATCTCCGGCGTGGACGTGTTTACGGGTGAGGACCACCGGCTATTCAGCAACACACGCAACTCCGGCCTCGGCTCCAACATCACCGACAGCATTCCCGCGCTGCGGATGGTGCAGGACATGGGCAAGCTGGCGAAGAACGTAGCGGCTGCGGCTGCACGGACTGACCAGCGTTTCGACCAGACGGAAGGGAGGCAGATGCGGGACTTCATTCCGCTTCTCCGCACATACGGCTTGCTGAACGCCACCAACGCGCTGATCGCCACCATGCCGCCGGACACTGACGGGGCGATCGTCAAGGCAAAGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0006508 | proteolysis | biological process | None (UniProt) |
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(7w9xW)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50