Protein

UniProt accession
Q8W6J4 [UniProt]
Protein name
Uncharacterized protein orf33
PhaLP type
VAL

evidence: ML prediction

probability: 99 % (predicted by ML model)

Protein sequence
MEPARMPIVIGLDSDVEWSNPLNLSSNAKSGFGLDGTSATETKSSPAASPRDWSDIVDTVIAEAAGEGDEGMAAVASVIKNRAGVRGKSAAEVVREPDQFTGYRKPGDAARQAMQDPEMRRRAEAVVASVFSGERPDTTGGADHYHATSVSPDWASKMPRTTRVGNHLFYNSKPGAARPEAETKALGLTGDEPRRSEAEEPFDALLGREPPPADSSFLLSKLQAGKPSSYIESMKPELQTGLTAMFNEAPDQVKSGLDILSGARSPERQAQIIAENAKKYGLDRKAWLADVAEYGPEEAGRRWRPQFKASGMSKNIGAPGGSRHQHGDAADLGWNGGNFSGAPKEVREWVHANASRYGLSFPMGHEPWHIETAGARGGHSHDEDGGLRPFNAEERRPNSDGSYSTEISTTWQLPDGKWVNVPSLWMGKDGPKQFNADDEAGILGSMQRFEEQNGQKFPRFDSEQEAVAAAKTRSAAGGAGAGGGIVIDENFTSDNPGLMAQAGQLREAGLQRQAQEQAAAEQQRIAARTGTDNQAVAARQQELNADRAGRYQAIDESELPAWQKKWEEENRSSGVMGDTGRILKSGVVGLGQSITSLADTVFRKLPGGEKFLEASDAIDRWALGETFDSKMDTAQDRARASVTPEQQAADAKNWWDTENGWFGPAWRDPRSYLRGVGESAPGTVITMLPGGILARGKYLSMIAAGVEQRVAAAAAAKVATVAGAVSEGLLGGADSTRNVRERIAQIPREQLAQSEAVRILVEGGMAEGDAIKALTEDAASQAFLISGVATGMFGGLGDRALARIIADGVGGGVARRVLAGTTRGAVAEGVFEELPQGVAQTISDNAAIQRVDPSQSLTEGVEEAAAAGVATGGVMGGGMGGVGGAASRRQSAPAGIDADPAAAATQDAPRPSGPLGRAVQHGQQRAEDRAVSTLPDVTTAQPTAASTPADDRPEVGATVRVDAEGLEPILGRIEAYEGDEAIVFDSGTGELYQVPIGNVTKTADSIPAQAAALRQKQAEPPVQDVLPEESSDEALRPLEPRASEITTEIPPAKEQKPVTEMRPTRPQPGGRVIVEGENGDRFPARIETYVEDGTEAVVVTDEGKPYQVPVENLSVSGLTPDQVEQQELERNPPIEREPGDAGPNSRRLGERTVVLPDEKHAHLYDLAREQVIAKKLGGTSQVDMSKVMPAERKRLADEFGVSVEDLASMADDYRYRVERAAKEAKSTLPVKMHSVNERLLKQRQAARSKQEPESVAAKDEAGQWWDVELTAPARKRVLEQAGVKRNERVMWGSLTPNIQKKLVAVRDAERGAGTAEQVSAPAQTFKTAKGSVYQVHADGTTTRDKAARNDPGHEGDSGRKPRTAKTVYVDANAAMLSAAGLSGLGPKGARVAIKDGKATLVTWNKAGNGWGTTDDSRDIPIHAEPAVGRYPLELWDKADDVPGYEAYSRMHAGNPITEMSGAERASDVASTDPAPVAAVDDAAHEAATSPSNALPEPTQAQKEAGNYKVGRIRLGGLDISIENPAGSERKGTSPSGKPWSVKMKSHYGYIRGTVGRDKDHIDIFVKPGTEVLDDSAPIFVVDQRDPARGRFDEHKVMAGYASEDEARAAYLDNYTSGWKGLGAISPTTLGEFKVWLGSGKTTEPFAPKWFGARDKAEAHIAKEKLGATHEVVANGKRFEIREKTKTVKPDAQQKPSQIMRENMREPAAEPETGPRALFRSKEWKAAWNGTSAEKFASPHPDSPVAYQMAKGWRDAKAGLEPQRRPVDGTKYRSDGHNPIDDYLMGYVAAREGKGREIRATNAEAVFGRYLLESVDTPAAWEVEAEKALTRVPDSHPMKRKWAANVANGLKSESDRAGVLRQMQGAAPATTVTPRRDYTTTIISLRQGLESWAETGGNTLIEGKLELAERLQALSEEEWAKVEPFFHRDMEPGVTLEPEKGMAAIDEALSGAAPKPAKRTAASAKPKLPVTANTVFTEDAAEKARALLRRKLSGSTLNSGIDPEILQAGVTLAGYHIEKGARTFVAYAQAMLSDLGEDVRPYLKSWYMGVKYDPRATQFDGMSSAAEVEQLMSPTLPSMKEARAMNLQNWITLGREHWKQHLPNRYRELKQAGMLEPALKEAAEQTYREVSQLEESGFQADEAWQMVRETYLLLPSEGTTQTQPRSETADRMMQAASSGTRTVEIR
Physico‐chemical
properties
protein length:2185 AA
molecular weight:235029,00000 Da
isoelectric point:5,44599
aromaticity:0,05721
hydropathy:-0,61181

Domains

Domains [InterPro]
Protein sequence: Q8W6J4
1 2185
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Sinorhizobium phage PBC5
[NCBI]
179237 No lineage information
Host Sinorhizobium meliloti
[NCBI]
382 Bacteria > Proteobacteria > Alphaproteobacteria > Rhizobiales > Rhizobiaceae > Sinorhizobium

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
AAL49565.1 [NCBI]
Genbank nucleotide accession
AF448724 [NCBI]
CDS location
range 26155 -> 32712
strand -
CDS
ATGGAGCCCGCACGCATGCCTATCGTTATTGGCCTCGACAGTGATGTCGAGTGGAGCAACCCCCTCAACCTTTCCAGCAACGCCAAGTCCGGTTTCGGGCTGGATGGCACGAGCGCAACGGAGACGAAAAGCTCTCCCGCTGCCTCGCCCCGCGATTGGAGCGACATCGTCGACACCGTGATCGCCGAGGCTGCCGGCGAAGGCGACGAGGGCATGGCTGCCGTTGCCAGCGTCATCAAGAACCGTGCTGGTGTGCGCGGTAAGAGCGCGGCCGAGGTGGTGCGAGAGCCCGACCAGTTCACCGGCTACAGGAAGCCGGGCGATGCCGCTAGGCAGGCCATGCAGGATCCGGAGATGCGGCGCCGCGCAGAGGCCGTTGTCGCCAGCGTGTTCTCTGGCGAGCGTCCCGACACGACCGGGGGCGCTGACCACTACCACGCTACCAGCGTTTCGCCCGATTGGGCCTCGAAGATGCCGCGCACGACGCGCGTCGGCAATCACCTGTTCTACAATTCCAAGCCCGGGGCGGCCCGGCCCGAGGCGGAGACCAAGGCACTCGGCCTCACCGGCGACGAGCCGAGGCGCAGCGAGGCCGAAGAGCCATTTGACGCCTTGCTCGGCCGTGAGCCGCCGCCGGCAGACAGCTCATTCCTGTTGAGCAAGCTACAGGCAGGCAAGCCATCGTCCTACATCGAGAGCATGAAGCCGGAACTGCAAACCGGCCTTACTGCCATGTTCAACGAGGCCCCGGATCAAGTGAAGAGCGGGCTCGACATCTTGTCCGGAGCGCGAAGCCCAGAGCGTCAGGCGCAGATCATCGCCGAGAATGCCAAGAAGTACGGTCTCGATCGTAAGGCTTGGCTCGCCGACGTTGCCGAATACGGGCCCGAAGAGGCCGGCAGGCGTTGGCGCCCCCAGTTCAAGGCATCGGGGATGTCGAAGAACATCGGCGCTCCTGGTGGTTCGCGTCACCAGCACGGGGATGCGGCCGATCTCGGATGGAACGGCGGCAACTTCTCCGGCGCGCCGAAGGAAGTGCGCGAGTGGGTGCATGCCAACGCCAGCCGTTATGGTCTGTCTTTCCCGATGGGCCACGAACCTTGGCATATCGAGACCGCGGGCGCGCGTGGCGGCCATTCTCACGACGAGGACGGCGGACTGCGACCATTCAATGCAGAAGAGCGCCGTCCCAACAGCGACGGCAGCTATTCGACCGAGATCAGCACGACATGGCAGTTGCCGGACGGTAAGTGGGTCAATGTTCCCTCGCTCTGGATGGGGAAGGACGGCCCGAAGCAGTTCAATGCAGATGACGAGGCGGGCATCCTGGGATCGATGCAACGCTTTGAAGAGCAGAACGGGCAGAAGTTCCCCCGTTTCGACAGCGAACAAGAAGCCGTCGCCGCCGCAAAAACGCGCAGTGCGGCCGGCGGCGCTGGCGCTGGCGGGGGCATCGTCATCGACGAGAATTTCACGTCGGATAACCCGGGCCTGATGGCGCAGGCCGGGCAGCTCCGCGAGGCAGGCCTGCAGCGCCAGGCGCAAGAACAAGCAGCAGCCGAGCAGCAGCGCATTGCCGCAAGGACCGGCACGGACAATCAGGCCGTCGCAGCGCGTCAGCAGGAGTTGAACGCCGACCGCGCCGGTCGCTACCAAGCGATCGACGAGAGCGAACTGCCGGCGTGGCAGAAGAAGTGGGAAGAGGAAAACCGCTCGTCTGGCGTCATGGGCGACACCGGACGCATCCTGAAATCCGGCGTGGTCGGATTGGGCCAGTCAATCACATCGCTTGCCGATACCGTTTTCCGCAAGTTACCGGGCGGCGAGAAGTTCCTCGAAGCGTCCGACGCCATCGACCGTTGGGCACTGGGCGAGACGTTCGACAGCAAGATGGATACGGCGCAGGACCGCGCCCGTGCTTCCGTCACTCCCGAGCAGCAGGCGGCCGACGCCAAGAATTGGTGGGACACCGAGAACGGCTGGTTTGGCCCGGCGTGGCGCGATCCGCGCAGCTATCTCCGTGGCGTCGGGGAATCGGCCCCGGGCACTGTCATCACCATGCTTCCGGGCGGCATCCTCGCTCGCGGCAAGTATCTGAGCATGATCGCCGCTGGAGTAGAGCAGCGCGTCGCAGCGGCAGCGGCAGCGAAGGTCGCCACGGTCGCAGGCGCAGTTTCCGAGGGCCTGCTCGGCGGCGCCGATTCGACCCGCAACGTTCGGGAGCGCATTGCGCAGATCCCGCGCGAGCAGCTGGCCCAATCGGAAGCCGTCCGTATCCTGGTCGAGGGCGGCATGGCCGAGGGCGACGCCATCAAGGCTCTGACGGAGGATGCAGCATCGCAGGCCTTCCTGATCTCCGGCGTCGCTACAGGCATGTTCGGCGGCTTGGGCGACCGTGCCTTGGCCCGCATCATCGCCGATGGTGTCGGCGGTGGCGTCGCTCGGCGCGTGCTGGCTGGTACCACACGCGGAGCCGTTGCAGAGGGCGTCTTCGAAGAATTGCCGCAGGGCGTTGCGCAGACGATTTCCGACAACGCAGCTATCCAGCGCGTCGACCCTTCGCAATCGCTGACCGAAGGCGTCGAGGAAGCCGCAGCGGCCGGTGTCGCGACCGGCGGCGTGATGGGCGGCGGCATGGGTGGTGTTGGTGGCGCAGCATCGCGTCGCCAGAGTGCACCGGCCGGGATCGATGCGGACCCGGCTGCCGCCGCTACGCAGGACGCCCCTCGCCCGTCCGGACCGTTGGGCCGCGCCGTCCAGCATGGCCAGCAGCGCGCCGAAGATCGCGCCGTATCCACCCTGCCTGATGTCACGACGGCACAGCCCACAGCAGCTTCCACGCCTGCAGATGATCGCCCAGAAGTTGGCGCGACTGTGCGTGTGGACGCCGAGGGATTGGAGCCGATCCTTGGGCGCATCGAGGCGTACGAAGGCGATGAGGCCATCGTTTTCGACAGCGGTACCGGCGAACTCTACCAGGTGCCGATCGGCAATGTGACGAAGACGGCGGACTCGATCCCGGCTCAGGCGGCGGCGTTGCGCCAGAAGCAGGCAGAGCCTCCCGTGCAGGACGTTCTGCCCGAGGAATCGAGCGACGAGGCCTTGAGGCCGCTTGAGCCCCGTGCATCTGAGATCACGACGGAAATCCCGCCTGCGAAAGAGCAGAAGCCGGTTACCGAGATGCGCCCGACGCGCCCGCAGCCAGGCGGCCGTGTCATCGTCGAGGGGGAGAATGGCGACCGGTTCCCGGCGCGCATCGAGACCTATGTTGAGGACGGTACCGAGGCCGTTGTCGTCACCGACGAGGGCAAGCCCTATCAGGTGCCGGTCGAAAACCTCTCGGTCAGCGGCCTGACGCCAGATCAGGTCGAGCAGCAGGAGCTGGAACGCAATCCGCCGATCGAGCGCGAGCCCGGCGATGCAGGCCCGAACAGCCGCCGTCTTGGTGAGCGCACCGTAGTGCTGCCGGACGAGAAGCACGCGCATCTCTACGACCTCGCCCGCGAGCAGGTCATTGCCAAGAAGCTTGGCGGCACGTCTCAGGTCGATATGTCGAAGGTTATGCCGGCAGAGCGCAAGCGTCTGGCTGACGAGTTCGGCGTGTCCGTCGAGGACCTCGCCAGCATGGCGGATGACTATCGGTACCGCGTCGAGCGCGCTGCTAAGGAAGCGAAATCGACGCTGCCGGTGAAGATGCATTCGGTCAACGAGCGCTTGCTCAAGCAGCGCCAGGCGGCACGCTCAAAGCAGGAGCCGGAATCGGTCGCAGCGAAGGACGAGGCCGGCCAGTGGTGGGACGTTGAGCTCACTGCTCCGGCTCGCAAGCGCGTGCTCGAACAGGCTGGCGTCAAGCGCAATGAGCGCGTGATGTGGGGCAGTCTCACGCCCAACATCCAGAAGAAGCTGGTCGCGGTCAGAGATGCGGAGCGCGGCGCGGGAACTGCAGAGCAGGTTTCGGCGCCTGCCCAGACCTTTAAAACCGCGAAGGGCAGCGTCTACCAGGTGCATGCCGATGGCACCACGACGCGCGACAAGGCGGCCCGCAACGATCCGGGCCACGAGGGCGACAGCGGGCGGAAGCCGCGAACCGCCAAAACGGTCTATGTCGATGCTAACGCCGCCATGTTGAGCGCGGCAGGCCTCAGTGGTCTCGGTCCGAAGGGCGCCCGCGTTGCCATCAAGGATGGTAAGGCCACGCTCGTGACGTGGAACAAGGCCGGCAACGGCTGGGGAACGACAGACGACAGCCGAGACATCCCCATCCATGCCGAGCCCGCTGTTGGGCGGTATCCGTTGGAGCTTTGGGACAAGGCCGACGATGTTCCGGGCTACGAGGCCTACTCCCGCATGCATGCTGGCAATCCGATTACGGAGATGTCCGGCGCCGAACGCGCAAGCGATGTAGCTTCCACCGATCCGGCACCAGTCGCAGCAGTCGATGACGCGGCTCACGAAGCCGCGACTTCGCCGTCTAACGCCCTGCCCGAGCCCACTCAGGCACAGAAGGAAGCCGGCAACTACAAGGTCGGCCGCATCCGCCTTGGCGGGTTGGATATCTCGATCGAGAACCCGGCTGGCTCCGAGCGCAAAGGCACGAGCCCATCCGGCAAGCCGTGGTCCGTCAAGATGAAGAGCCATTATGGCTACATTCGAGGCACGGTCGGTCGCGACAAGGACCATATCGACATCTTCGTGAAGCCCGGTACCGAGGTGCTCGACGATTCCGCCCCGATCTTTGTCGTCGATCAGCGTGACCCCGCACGTGGCCGCTTCGACGAGCACAAGGTAATGGCCGGCTACGCCAGCGAGGACGAAGCGCGCGCCGCCTACCTCGACAACTACACCAGCGGCTGGAAGGGGCTCGGCGCCATCAGTCCGACGACGTTGGGCGAGTTCAAAGTGTGGCTGGGTTCGGGCAAGACGACAGAGCCCTTCGCGCCGAAGTGGTTCGGTGCCCGCGATAAGGCCGAGGCGCACATCGCCAAGGAGAAGCTCGGCGCCACGCATGAGGTGGTCGCGAACGGCAAGCGGTTCGAGATCCGCGAGAAAACAAAGACGGTAAAGCCGGACGCGCAGCAAAAACCCTCGCAAATTATGCGCGAAAATATGCGTGAGCCGGCAGCCGAGCCCGAAACAGGTCCGCGCGCTCTGTTCCGCTCCAAGGAATGGAAGGCAGCATGGAACGGCACTTCGGCGGAGAAGTTCGCATCGCCGCATCCCGATAGCCCCGTTGCCTATCAGATGGCCAAGGGATGGCGTGACGCAAAGGCCGGGCTTGAACCTCAGCGCAGGCCTGTAGACGGCACGAAGTACCGCAGCGACGGCCACAACCCTATCGACGATTACCTGATGGGTTATGTCGCCGCACGCGAAGGCAAGGGGCGGGAAATCCGCGCCACCAACGCGGAAGCCGTGTTCGGCCGCTATCTGCTTGAAAGTGTCGACACGCCGGCTGCGTGGGAAGTTGAGGCAGAGAAGGCGCTTACCCGCGTTCCCGATAGCCACCCGATGAAAAGGAAGTGGGCGGCCAACGTCGCCAACGGGCTCAAGAGCGAGAGCGACCGCGCTGGTGTGCTGCGCCAGATGCAGGGAGCCGCACCGGCCACAACGGTAACGCCGCGTCGCGATTACACCACCACCATCATTAGCCTGCGCCAAGGACTCGAGTCGTGGGCAGAGACCGGCGGCAACACCCTCATCGAGGGTAAGCTTGAACTGGCTGAACGGCTGCAGGCGCTCTCCGAAGAGGAATGGGCGAAGGTCGAACCCTTCTTCCATCGCGACATGGAGCCCGGCGTCACCCTTGAGCCTGAAAAGGGCATGGCGGCGATCGACGAGGCACTCAGCGGAGCGGCGCCAAAGCCAGCCAAGCGCACGGCCGCGTCTGCCAAGCCGAAGCTCCCGGTCACGGCAAACACCGTGTTCACGGAAGATGCTGCAGAAAAGGCCCGCGCCCTGCTGCGTCGCAAGCTGTCCGGTAGCACGCTCAACAGCGGCATCGATCCGGAAATTCTGCAGGCAGGTGTCACCCTCGCCGGCTACCATATCGAGAAGGGTGCCCGTACCTTCGTCGCCTATGCCCAAGCGATGTTGTCCGATCTCGGCGAGGATGTGCGTCCGTACCTGAAATCGTGGTATATGGGTGTGAAGTACGACCCCCGCGCCACGCAATTCGACGGTATGTCGAGCGCTGCCGAGGTCGAGCAATTGATGTCGCCGACATTGCCGTCGATGAAGGAAGCGCGAGCGATGAACCTGCAGAACTGGATCACTCTGGGTCGCGAGCACTGGAAGCAGCACCTGCCGAACAGGTATCGCGAGCTGAAACAGGCCGGGATGCTCGAACCGGCGCTGAAAGAAGCCGCGGAACAGACGTACCGCGAAGTGAGCCAGCTCGAAGAGAGCGGGTTTCAGGCGGACGAGGCGTGGCAGATGGTCAGGGAGACCTACTTGCTGCTGCCGAGCGAGGGGACGACGCAGACGCAGCCCCGCAGCGAGACCGCCGACAGAATGATGCAGGCCGCCAGCAGCGGGACGAGAACGGTCGAGATCAGGTAG

Gene Ontology

Description Category Evidence (source)
GO:0006508 proteolysis Biological process Inferred from Electronic Annotation (InterPro)
GO:0008233 peptidase activity Molecular function Inferred from Electronic Annotation (InterPro)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.