Protein

Protein accession
A0A7S5RGQ1 [UniProt]
Representative
7w9xW
Source
UniProt (cluster: phalp2_36235)
Protein name
Putative internal virion protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MADNNDELATARQFLYGRTNKPQSAIDGLRGDFATKLSRMMQDPDAPVGLGLYSGYRSPERQAELWAGALKKYGSPEAARKWVAPPGHSMHNEGLAADMGYNGQSLQHAPKEVVDWVHSNADRYGLKFPLGNEAWHIEDSSTRGGKPTSVSDQAFAAGGFQQSGGTYQPVTNPPVPDWVKDSAPNDTFQTTPNVAPPTWGAVGSSILHSMPTYQVAQWVGNGSAAPDMPWLNNPDLGKFGKTLDGLNDDEQKWVIGTALSPQHAVELRSQLDQQKVDAGTATSSPTAFAGMLVANVLDPVSLAASAATGPAVVGASNVLRASTAAGVLTRMAEAGTIGIADATLASVIQAQVDPNFGVEDYVMNGLTGMALGAGIGTLTRGAELDGVSAEATAAFGRRARDFIAQTAEDRGYKFASPEARDAFTGAGKSAGAAENIASAAELFPDRRGFLERMADRLGSWSSNENLLMNKTGTAVRGIYEKLMPNLSGTGGRELRASEDAWSFMKRSTEVDHANMEKTFSQHFDGWLKESGMENVSFLKRDHLEEEFSQRVYYAMAHPDVANGSEAIEKMADAYRSAFGSKLDEAKALGVSWAQDVPKDDYYVPFKVSKRAYYEVTAKIGRDGLIDVIKQAFLRAQPDLYKAERAKKPGTKGRKLAADVRERKVSRLAERYLKMIEDTNIDNNGAERMTGLAGDSARALRDMLTEEGVGLEEIEDIISAFGWSPKEGPSNFRRRAVMERDEPFIPSKYASHPNARDYAVSLRQITDQDIRSVYSSYSRSVNGHLALAKAGFKSEADARAQIKDATNFRTTIEKGPEGKQNVFEQDMTTASEELNYFVDRILGRPAFPGVSKKTRLALSLVKNLGFIRFMQQSGISQLGDMPKILLRYGIGATWRQSRMRDFLDVFTYRDTEANALTREIQASTGIGIKTKNAKMYALHEDINIGGELFDEDLTTKMLAKANAVSESASRLTATVSLLNPITDTLQLWAARAASQRLIDIATGRKVISRKWMNEIGINERDLADMREIAKRMTLEGDGKIVRWNSDKQRAAGMDHADAYDRFLALVRRETQMAILETAPNAIKRSMSGPMMGLFWQLKSYMMNALLANTAKNIKLGPAYMSMSLVATSVWSAAIYAGQQYSVSLGMPGEDRKKFLEDRLSLRGILSAGFQRSADSSILPMIIDTTISGVDVFTGEDHRLFSNTRNSGLGSNITDSIPALRMVQDMGKLAKNVAAAAARTDQRFDQTEGRQMRDFIPLLRTYGLLNATNALIATMPPDTDGAIVKAK
Physico‐chemical
properties
protein length:1285 AA
molecular weight:140301,8 Da
isoelectric point:6,82
hydropathy:-0,40
Representative Protein Details
Accession
7w9xW
Protein name
7w9xW
Sequence length
943 AA
Molecular weight
101335,61320 Da
Isoelectric point
5,06387
Sequence
MARIRSLQSEGRAGSGVPSAVTGGGGFAAKLADVAGSLSTRIYDLAQGAAQRAGLLAGTQQVEAPASGQYATGDPYAAKGYLQQHTDKGQEAIHGLDDNFSVKLANLFQAAPDNIRSGLGIYSGYRSNEHQAKLYSAALAKYGSEAEARKWVAPPGHSQHNKGMAADVSYNGVSLSKAPKEVTDWIHQNAAAYGLKFPLSNENWHIEDDSTRGGKGTTIDPRPLALRRDGTVFGEAYDRAASSAYLWRVQSGLSRELFQAQQDHPDDPYAFSAAKEEIRTRYLNDPALSDPLLREAFQKGFEQTSDGYQRQTAIAYGAKLKAEEQSAFASGIDAMGVNIERQAYTYGANPDGDKVVGDLVTSSQRSIDAAIDAGTVTPAAGETLKQDIAMRASQARIQGTYDALPTPESKQEFAAGLVDRWKEQDPALGAFNDAFMSDIQSLQRTLLSNAASLTTAQKQANATRKAQLSTLMDDDIASVLASGKGLDPSAGLSMAELEQYFTPAEIAKFRDERNLSLDIHDATSGMDAMTADDIAALVEDMKPEPGQNGYADQQKIYDAASKKAAAILKARETDPLGQAAAAGIVEIQPIDSTDSGTITQSLVARSQAARIAGGVLGTEMPLFTKAEVDALKVQGKSTDPALFGAVMQQMDFLANSDGLLSVKQTFGAPMMEDLQVWQSKMRYATQAEAQDWLKQHADPQWQERVKPLVSSGEAKAREVPFQDIVSELDPNWIADIGAPIDDASKRAMQNDYTMLVGQFYSRIGDIDAAQKQAIEAMKTVWGRTDALGGRGGRLMAYPPEKFYPAVASNQRYLKTEMQDLAKAAGVEIDQLSLVSDAKTEAAADRHEAPGYLISIVDPQTGFDELLTDDAGRPLRHFFDPEAARKDAMAKAEQDRSDANGMAPMTPAERKAARQQGGTAAPLEINIPDNKPKPSFGRSRSPGH
Other Proteins in cluster: phalp2_36235
Total (incl. this protein): 5 Avg length: 996,2 Avg pI: 5,60

Protein ID Length (AA) pI
7w9xW 943 5,06387
2mEvs 942 5,50192
7otzH 990 5,37318
7rMbS 821 5,24893
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21335
1NS0y
26 45,1% 685 5.722E-176
2 phalp2_3986
7yjZk
1 26,4% 833 1.987E-67

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhizobium phage RHph_I20
[NCBI]
2509730 Autographiviridae >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN988539 [NCBI]
CDS location
range 8501 -> 12358
strand +
CDS
GTGGCAGACAACAACGACGAGCTGGCAACGGCTCGGCAATTCCTCTATGGCCGCACGAACAAGCCGCAGTCGGCGATCGACGGTTTGCGCGGCGACTTCGCTACCAAGCTCTCCCGCATGATGCAGGACCCGGACGCACCAGTCGGCCTCGGCCTCTATTCAGGCTATCGCAGCCCCGAGCGCCAAGCCGAGCTTTGGGCGGGCGCCCTGAAGAAGTACGGCTCGCCTGAAGCTGCTCGCAAGTGGGTCGCGCCGCCCGGTCACTCCATGCATAACGAGGGCCTGGCTGCGGATATGGGCTACAACGGCCAGTCCCTCCAGCACGCTCCGAAGGAGGTCGTGGATTGGGTCCACAGCAACGCCGATCGCTATGGCCTGAAGTTCCCGCTTGGGAATGAGGCTTGGCACATCGAAGACAGCTCGACGCGCGGAGGCAAGCCGACCAGCGTCAGCGATCAGGCATTCGCGGCTGGAGGCTTTCAGCAGTCCGGTGGAACATACCAGCCGGTGACAAACCCGCCGGTTCCGGATTGGGTCAAGGACTCGGCGCCCAATGACACATTCCAAACCACGCCGAACGTCGCGCCGCCGACTTGGGGCGCAGTGGGCTCTTCTATCCTCCACAGTATGCCGACTTATCAAGTGGCGCAGTGGGTCGGGAATGGCTCCGCCGCACCTGATATGCCATGGCTCAACAACCCCGACCTTGGCAAATTCGGCAAGACCCTGGACGGTCTCAACGATGATGAGCAGAAATGGGTGATCGGGACTGCCCTGTCGCCGCAGCACGCGGTTGAACTGCGCAGCCAGCTAGACCAGCAGAAGGTCGATGCAGGAACGGCCACGTCTTCACCGACGGCCTTCGCGGGCATGCTCGTGGCGAACGTACTCGACCCGGTCTCTCTCGCCGCCAGTGCTGCAACAGGCCCGGCCGTGGTGGGGGCTTCTAACGTTCTGCGCGCTTCGACGGCGGCGGGCGTCCTCACGAGAATGGCGGAGGCCGGAACGATCGGGATAGCCGATGCGACCCTCGCTTCAGTTATCCAGGCGCAGGTAGACCCCAACTTCGGCGTTGAGGACTACGTCATGAACGGGCTCACCGGCATGGCGTTGGGCGCTGGCATAGGCACCTTGACCCGTGGCGCCGAGTTGGACGGTGTTTCGGCCGAAGCCACCGCAGCCTTCGGGCGCCGGGCTCGCGACTTCATCGCACAAACGGCAGAAGACCGCGGATATAAGTTCGCCTCCCCCGAAGCGCGTGACGCCTTCACCGGTGCTGGCAAGTCCGCCGGTGCCGCCGAGAACATCGCGAGCGCCGCCGAACTCTTCCCCGATCGGCGCGGCTTCCTGGAGCGCATGGCCGACCGCCTCGGCTCTTGGTCATCCAATGAGAACCTTCTCATGAACAAGACCGGGACTGCGGTGCGCGGCATCTATGAGAAGCTTATGCCAAATCTCTCCGGCACCGGAGGCCGCGAGCTGCGCGCCAGCGAAGACGCCTGGAGCTTCATGAAGCGGTCGACCGAAGTGGACCACGCGAACATGGAGAAGACCTTCTCTCAGCACTTCGACGGCTGGCTGAAGGAAAGCGGTATGGAAAACGTCAGCTTCCTCAAGCGAGACCATCTCGAAGAGGAGTTCTCACAGCGCGTCTATTACGCCATGGCCCACCCGGACGTGGCTAACGGCTCGGAGGCGATCGAGAAAATGGCCGACGCCTACCGATCGGCTTTCGGAAGCAAGCTGGACGAGGCAAAAGCGCTCGGCGTCTCGTGGGCGCAGGACGTTCCGAAGGACGATTACTATGTCCCGTTCAAGGTTTCGAAGCGCGCCTATTACGAGGTGACGGCGAAGATCGGCCGCGATGGCCTGATCGACGTCATCAAGCAAGCGTTCTTGCGTGCGCAGCCGGACCTTTACAAGGCCGAGCGCGCGAAGAAGCCGGGCACCAAGGGCCGCAAGCTGGCCGCGGACGTGCGCGAGCGGAAGGTCTCCAGACTGGCCGAGCGCTATCTGAAGATGATCGAAGACACGAACATCGACAACAACGGCGCCGAGCGCATGACCGGCTTGGCTGGCGACAGCGCCCGCGCACTGCGGGACATGCTCACGGAAGAAGGCGTCGGGCTGGAGGAGATCGAAGATATCATCTCCGCCTTTGGCTGGTCGCCGAAGGAAGGACCGAGCAACTTCCGGCGCCGGGCTGTGATGGAGCGGGACGAGCCCTTCATTCCGTCCAAGTATGCATCCCATCCGAATGCTCGGGACTATGCCGTCTCTCTCCGGCAGATAACGGACCAAGACATCCGCTCGGTCTACAGCTCATATAGCCGATCCGTGAACGGGCATCTCGCCCTGGCTAAGGCTGGCTTCAAGTCCGAGGCCGACGCCCGCGCACAGATCAAGGACGCGACCAACTTCCGGACCACCATTGAGAAGGGACCGGAGGGCAAGCAGAACGTCTTTGAGCAGGACATGACCACGGCCAGCGAAGAGCTGAACTACTTCGTGGACCGCATCCTCGGGCGCCCGGCATTCCCAGGTGTCAGCAAGAAGACCCGCCTCGCCCTGTCGCTGGTCAAGAACCTCGGCTTCATCCGCTTCATGCAGCAGTCGGGCATATCTCAGCTCGGTGACATGCCGAAGATTCTGCTCAGGTATGGCATCGGCGCGACGTGGCGGCAGTCGAGAATGCGGGACTTCCTTGACGTGTTCACCTATCGGGACACCGAAGCGAACGCCCTCACCCGCGAAATCCAGGCATCAACCGGCATTGGAATCAAGACCAAGAATGCGAAGATGTACGCGCTGCACGAGGACATCAACATCGGCGGCGAGCTGTTCGATGAAGATCTAACCACGAAGATGCTGGCCAAGGCGAACGCCGTTTCGGAAAGCGCCTCGCGCCTCACAGCCACCGTCTCGCTCCTCAATCCGATCACTGACACGCTTCAGCTTTGGGCGGCGCGCGCAGCCAGCCAACGGCTCATCGACATCGCCACCGGCCGCAAGGTGATCAGTCGGAAGTGGATGAACGAGATCGGCATCAACGAGCGTGACCTCGCCGACATGCGCGAGATCGCCAAGCGGATGACGCTTGAGGGCGATGGCAAGATTGTCCGCTGGAATAGCGACAAGCAGCGTGCGGCCGGGATGGACCACGCCGATGCCTACGACCGGTTCCTCGCGCTCGTGCGCCGGGAAACGCAGATGGCCATTCTCGAGACCGCGCCGAACGCCATCAAGCGGTCGATGTCCGGCCCGATGATGGGTCTCTTCTGGCAGCTCAAGAGCTACATGATGAACGCCCTCCTCGCCAACACGGCGAAGAACATCAAGCTCGGCCCTGCCTACATGTCCATGTCCCTTGTTGCCACGTCCGTATGGTCGGCGGCGATCTACGCGGGGCAGCAATATTCGGTCTCGCTTGGGATGCCGGGCGAGGATCGCAAGAAATTCCTCGAAGATCGTCTTTCGCTTCGGGGTATCCTGTCGGCGGGCTTCCAACGCTCGGCCGATAGCTCCATCCTCCCCATGATTATCGACACGACCATCTCCGGCGTGGACGTGTTTACGGGTGAGGACCACCGGCTATTCAGCAACACACGCAACTCCGGCCTCGGCTCCAACATCACCGACAGCATTCCCGCGCTGCGGATGGTGCAGGACATGGGCAAGCTGGCGAAGAACGTAGCGGCTGCGGCTGCACGGACTGACCAGCGTTTCGACCAGACGGAAGGGAGGCAGATGCGGGACTTCATTCCGCTTCTCCGCACATACGGCTTGCTGAACGCCACCAACGCGCTGATCGCCACCATGCCGCCGGACACTGACGGGGCGATCGTCAAGGCAAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0006508 proteolysis biological process None (UniProt)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7w9xW) rather than this protein.
PDB ID
7w9xW
Method AlphaFoldv2
Resolution 80.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50