Protein
- UniProt accession
- Q8W6J4 [UniProt]
- Protein name
- Uncharacterized protein orf33
- PhaLP type
-
VAL
evidence: ML prediction
probability: 99 % (predicted by ML model)
- Protein sequence
-
MEPARMPIVIGLDSDVEWSNPLNLSSNAKSGFGLDGTSATETKSSPAASPRDWSDIVDTVIAEAAGEGDEGMAAVASVIKNRAGVRGKSAAEVVREPDQFTGYRKPGDAARQAMQDPEMRRRAEAVVASVFSGERPDTTGGADHYHATSVSPDWASKMPRTTRVGNHLFYNSKPGAARPEAETKALGLTGDEPRRSEAEEPFDALLGREPPPADSSFLLSKLQAGKPSSYIESMKPELQTGLTAMFNEAPDQVKSGLDILSGARSPERQAQIIAENAKKYGLDRKAWLADVAEYGPEEAGRRWRPQFKASGMSKNIGAPGGSRHQHGDAADLGWNGGNFSGAPKEVREWVHANASRYGLSFPMGHEPWHIETAGARGGHSHDEDGGLRPFNAEERRPNSDGSYSTEISTTWQLPDGKWVNVPSLWMGKDGPKQFNADDEAGILGSMQRFEEQNGQKFPRFDSEQEAVAAAKTRSAAGGAGAGGGIVIDENFTSDNPGLMAQAGQLREAGLQRQAQEQAAAEQQRIAARTGTDNQAVAARQQELNADRAGRYQAIDESELPAWQKKWEEENRSSGVMGDTGRILKSGVVGLGQSITSLADTVFRKLPGGEKFLEASDAIDRWALGETFDSKMDTAQDRARASVTPEQQAADAKNWWDTENGWFGPAWRDPRSYLRGVGESAPGTVITMLPGGILARGKYLSMIAAGVEQRVAAAAAAKVATVAGAVSEGLLGGADSTRNVRERIAQIPREQLAQSEAVRILVEGGMAEGDAIKALTEDAASQAFLISGVATGMFGGLGDRALARIIADGVGGGVARRVLAGTTRGAVAEGVFEELPQGVAQTISDNAAIQRVDPSQSLTEGVEEAAAAGVATGGVMGGGMGGVGGAASRRQSAPAGIDADPAAAATQDAPRPSGPLGRAVQHGQQRAEDRAVSTLPDVTTAQPTAASTPADDRPEVGATVRVDAEGLEPILGRIEAYEGDEAIVFDSGTGELYQVPIGNVTKTADSIPAQAAALRQKQAEPPVQDVLPEESSDEALRPLEPRASEITTEIPPAKEQKPVTEMRPTRPQPGGRVIVEGENGDRFPARIETYVEDGTEAVVVTDEGKPYQVPVENLSVSGLTPDQVEQQELERNPPIEREPGDAGPNSRRLGERTVVLPDEKHAHLYDLAREQVIAKKLGGTSQVDMSKVMPAERKRLADEFGVSVEDLASMADDYRYRVERAAKEAKSTLPVKMHSVNERLLKQRQAARSKQEPESVAAKDEAGQWWDVELTAPARKRVLEQAGVKRNERVMWGSLTPNIQKKLVAVRDAERGAGTAEQVSAPAQTFKTAKGSVYQVHADGTTTRDKAARNDPGHEGDSGRKPRTAKTVYVDANAAMLSAAGLSGLGPKGARVAIKDGKATLVTWNKAGNGWGTTDDSRDIPIHAEPAVGRYPLELWDKADDVPGYEAYSRMHAGNPITEMSGAERASDVASTDPAPVAAVDDAAHEAATSPSNALPEPTQAQKEAGNYKVGRIRLGGLDISIENPAGSERKGTSPSGKPWSVKMKSHYGYIRGTVGRDKDHIDIFVKPGTEVLDDSAPIFVVDQRDPARGRFDEHKVMAGYASEDEARAAYLDNYTSGWKGLGAISPTTLGEFKVWLGSGKTTEPFAPKWFGARDKAEAHIAKEKLGATHEVVANGKRFEIREKTKTVKPDAQQKPSQIMRENMREPAAEPETGPRALFRSKEWKAAWNGTSAEKFASPHPDSPVAYQMAKGWRDAKAGLEPQRRPVDGTKYRSDGHNPIDDYLMGYVAAREGKGREIRATNAEAVFGRYLLESVDTPAAWEVEAEKALTRVPDSHPMKRKWAANVANGLKSESDRAGVLRQMQGAAPATTVTPRRDYTTTIISLRQGLESWAETGGNTLIEGKLELAERLQALSEEEWAKVEPFFHRDMEPGVTLEPEKGMAAIDEALSGAAPKPAKRTAASAKPKLPVTANTVFTEDAAEKARALLRRKLSGSTLNSGIDPEILQAGVTLAGYHIEKGARTFVAYAQAMLSDLGEDVRPYLKSWYMGVKYDPRATQFDGMSSAAEVEQLMSPTLPSMKEARAMNLQNWITLGREHWKQHLPNRYRELKQAGMLEPALKEAAEQTYREVSQLEESGFQADEAWQMVRETYLLLPSEGTTQTQPRSETADRMMQAASSGTRTVEIR
- Physico‐chemical
properties -
protein length: 2185 AA molecular weight: 235029,00000 Da isoelectric point: 5,44599 aromaticity: 0,05721 hydropathy: -0,61181
Domains
Domains [InterPro]
Taxonomy
Name | Taxonomy ID | Lineage | |
---|---|---|---|
Phage |
Sinorhizobium phage PBC5 [NCBI] |
179237 | No lineage information |
Host |
Sinorhizobium meliloti [NCBI] |
382 | Bacteria > Proteobacteria > Alphaproteobacteria > Rhizobiales > Rhizobiaceae > Sinorhizobium |
Coding sequence (CDS)
Coding sequence (CDS)
Genbank protein accession
AAL49565.1
[NCBI]
Genbank nucleotide accession
AF448724
[NCBI]
CDS location
range 26155 -> 32712
strand -
strand -
CDS
ATGGAGCCCGCACGCATGCCTATCGTTATTGGCCTCGACAGTGATGTCGAGTGGAGCAACCCCCTCAACCTTTCCAGCAACGCCAAGTCCGGTTTCGGGCTGGATGGCACGAGCGCAACGGAGACGAAAAGCTCTCCCGCTGCCTCGCCCCGCGATTGGAGCGACATCGTCGACACCGTGATCGCCGAGGCTGCCGGCGAAGGCGACGAGGGCATGGCTGCCGTTGCCAGCGTCATCAAGAACCGTGCTGGTGTGCGCGGTAAGAGCGCGGCCGAGGTGGTGCGAGAGCCCGACCAGTTCACCGGCTACAGGAAGCCGGGCGATGCCGCTAGGCAGGCCATGCAGGATCCGGAGATGCGGCGCCGCGCAGAGGCCGTTGTCGCCAGCGTGTTCTCTGGCGAGCGTCCCGACACGACCGGGGGCGCTGACCACTACCACGCTACCAGCGTTTCGCCCGATTGGGCCTCGAAGATGCCGCGCACGACGCGCGTCGGCAATCACCTGTTCTACAATTCCAAGCCCGGGGCGGCCCGGCCCGAGGCGGAGACCAAGGCACTCGGCCTCACCGGCGACGAGCCGAGGCGCAGCGAGGCCGAAGAGCCATTTGACGCCTTGCTCGGCCGTGAGCCGCCGCCGGCAGACAGCTCATTCCTGTTGAGCAAGCTACAGGCAGGCAAGCCATCGTCCTACATCGAGAGCATGAAGCCGGAACTGCAAACCGGCCTTACTGCCATGTTCAACGAGGCCCCGGATCAAGTGAAGAGCGGGCTCGACATCTTGTCCGGAGCGCGAAGCCCAGAGCGTCAGGCGCAGATCATCGCCGAGAATGCCAAGAAGTACGGTCTCGATCGTAAGGCTTGGCTCGCCGACGTTGCCGAATACGGGCCCGAAGAGGCCGGCAGGCGTTGGCGCCCCCAGTTCAAGGCATCGGGGATGTCGAAGAACATCGGCGCTCCTGGTGGTTCGCGTCACCAGCACGGGGATGCGGCCGATCTCGGATGGAACGGCGGCAACTTCTCCGGCGCGCCGAAGGAAGTGCGCGAGTGGGTGCATGCCAACGCCAGCCGTTATGGTCTGTCTTTCCCGATGGGCCACGAACCTTGGCATATCGAGACCGCGGGCGCGCGTGGCGGCCATTCTCACGACGAGGACGGCGGACTGCGACCATTCAATGCAGAAGAGCGCCGTCCCAACAGCGACGGCAGCTATTCGACCGAGATCAGCACGACATGGCAGTTGCCGGACGGTAAGTGGGTCAATGTTCCCTCGCTCTGGATGGGGAAGGACGGCCCGAAGCAGTTCAATGCAGATGACGAGGCGGGCATCCTGGGATCGATGCAACGCTTTGAAGAGCAGAACGGGCAGAAGTTCCCCCGTTTCGACAGCGAACAAGAAGCCGTCGCCGCCGCAAAAACGCGCAGTGCGGCCGGCGGCGCTGGCGCTGGCGGGGGCATCGTCATCGACGAGAATTTCACGTCGGATAACCCGGGCCTGATGGCGCAGGCCGGGCAGCTCCGCGAGGCAGGCCTGCAGCGCCAGGCGCAAGAACAAGCAGCAGCCGAGCAGCAGCGCATTGCCGCAAGGACCGGCACGGACAATCAGGCCGTCGCAGCGCGTCAGCAGGAGTTGAACGCCGACCGCGCCGGTCGCTACCAAGCGATCGACGAGAGCGAACTGCCGGCGTGGCAGAAGAAGTGGGAAGAGGAAAACCGCTCGTCTGGCGTCATGGGCGACACCGGACGCATCCTGAAATCCGGCGTGGTCGGATTGGGCCAGTCAATCACATCGCTTGCCGATACCGTTTTCCGCAAGTTACCGGGCGGCGAGAAGTTCCTCGAAGCGTCCGACGCCATCGACCGTTGGGCACTGGGCGAGACGTTCGACAGCAAGATGGATACGGCGCAGGACCGCGCCCGTGCTTCCGTCACTCCCGAGCAGCAGGCGGCCGACGCCAAGAATTGGTGGGACACCGAGAACGGCTGGTTTGGCCCGGCGTGGCGCGATCCGCGCAGCTATCTCCGTGGCGTCGGGGAATCGGCCCCGGGCACTGTCATCACCATGCTTCCGGGCGGCATCCTCGCTCGCGGCAAGTATCTGAGCATGATCGCCGCTGGAGTAGAGCAGCGCGTCGCAGCGGCAGCGGCAGCGAAGGTCGCCACGGTCGCAGGCGCAGTTTCCGAGGGCCTGCTCGGCGGCGCCGATTCGACCCGCAACGTTCGGGAGCGCATTGCGCAGATCCCGCGCGAGCAGCTGGCCCAATCGGAAGCCGTCCGTATCCTGGTCGAGGGCGGCATGGCCGAGGGCGACGCCATCAAGGCTCTGACGGAGGATGCAGCATCGCAGGCCTTCCTGATCTCCGGCGTCGCTACAGGCATGTTCGGCGGCTTGGGCGACCGTGCCTTGGCCCGCATCATCGCCGATGGTGTCGGCGGTGGCGTCGCTCGGCGCGTGCTGGCTGGTACCACACGCGGAGCCGTTGCAGAGGGCGTCTTCGAAGAATTGCCGCAGGGCGTTGCGCAGACGATTTCCGACAACGCAGCTATCCAGCGCGTCGACCCTTCGCAATCGCTGACCGAAGGCGTCGAGGAAGCCGCAGCGGCCGGTGTCGCGACCGGCGGCGTGATGGGCGGCGGCATGGGTGGTGTTGGTGGCGCAGCATCGCGTCGCCAGAGTGCACCGGCCGGGATCGATGCGGACCCGGCTGCCGCCGCTACGCAGGACGCCCCTCGCCCGTCCGGACCGTTGGGCCGCGCCGTCCAGCATGGCCAGCAGCGCGCCGAAGATCGCGCCGTATCCACCCTGCCTGATGTCACGACGGCACAGCCCACAGCAGCTTCCACGCCTGCAGATGATCGCCCAGAAGTTGGCGCGACTGTGCGTGTGGACGCCGAGGGATTGGAGCCGATCCTTGGGCGCATCGAGGCGTACGAAGGCGATGAGGCCATCGTTTTCGACAGCGGTACCGGCGAACTCTACCAGGTGCCGATCGGCAATGTGACGAAGACGGCGGACTCGATCCCGGCTCAGGCGGCGGCGTTGCGCCAGAAGCAGGCAGAGCCTCCCGTGCAGGACGTTCTGCCCGAGGAATCGAGCGACGAGGCCTTGAGGCCGCTTGAGCCCCGTGCATCTGAGATCACGACGGAAATCCCGCCTGCGAAAGAGCAGAAGCCGGTTACCGAGATGCGCCCGACGCGCCCGCAGCCAGGCGGCCGTGTCATCGTCGAGGGGGAGAATGGCGACCGGTTCCCGGCGCGCATCGAGACCTATGTTGAGGACGGTACCGAGGCCGTTGTCGTCACCGACGAGGGCAAGCCCTATCAGGTGCCGGTCGAAAACCTCTCGGTCAGCGGCCTGACGCCAGATCAGGTCGAGCAGCAGGAGCTGGAACGCAATCCGCCGATCGAGCGCGAGCCCGGCGATGCAGGCCCGAACAGCCGCCGTCTTGGTGAGCGCACCGTAGTGCTGCCGGACGAGAAGCACGCGCATCTCTACGACCTCGCCCGCGAGCAGGTCATTGCCAAGAAGCTTGGCGGCACGTCTCAGGTCGATATGTCGAAGGTTATGCCGGCAGAGCGCAAGCGTCTGGCTGACGAGTTCGGCGTGTCCGTCGAGGACCTCGCCAGCATGGCGGATGACTATCGGTACCGCGTCGAGCGCGCTGCTAAGGAAGCGAAATCGACGCTGCCGGTGAAGATGCATTCGGTCAACGAGCGCTTGCTCAAGCAGCGCCAGGCGGCACGCTCAAAGCAGGAGCCGGAATCGGTCGCAGCGAAGGACGAGGCCGGCCAGTGGTGGGACGTTGAGCTCACTGCTCCGGCTCGCAAGCGCGTGCTCGAACAGGCTGGCGTCAAGCGCAATGAGCGCGTGATGTGGGGCAGTCTCACGCCCAACATCCAGAAGAAGCTGGTCGCGGTCAGAGATGCGGAGCGCGGCGCGGGAACTGCAGAGCAGGTTTCGGCGCCTGCCCAGACCTTTAAAACCGCGAAGGGCAGCGTCTACCAGGTGCATGCCGATGGCACCACGACGCGCGACAAGGCGGCCCGCAACGATCCGGGCCACGAGGGCGACAGCGGGCGGAAGCCGCGAACCGCCAAAACGGTCTATGTCGATGCTAACGCCGCCATGTTGAGCGCGGCAGGCCTCAGTGGTCTCGGTCCGAAGGGCGCCCGCGTTGCCATCAAGGATGGTAAGGCCACGCTCGTGACGTGGAACAAGGCCGGCAACGGCTGGGGAACGACAGACGACAGCCGAGACATCCCCATCCATGCCGAGCCCGCTGTTGGGCGGTATCCGTTGGAGCTTTGGGACAAGGCCGACGATGTTCCGGGCTACGAGGCCTACTCCCGCATGCATGCTGGCAATCCGATTACGGAGATGTCCGGCGCCGAACGCGCAAGCGATGTAGCTTCCACCGATCCGGCACCAGTCGCAGCAGTCGATGACGCGGCTCACGAAGCCGCGACTTCGCCGTCTAACGCCCTGCCCGAGCCCACTCAGGCACAGAAGGAAGCCGGCAACTACAAGGTCGGCCGCATCCGCCTTGGCGGGTTGGATATCTCGATCGAGAACCCGGCTGGCTCCGAGCGCAAAGGCACGAGCCCATCCGGCAAGCCGTGGTCCGTCAAGATGAAGAGCCATTATGGCTACATTCGAGGCACGGTCGGTCGCGACAAGGACCATATCGACATCTTCGTGAAGCCCGGTACCGAGGTGCTCGACGATTCCGCCCCGATCTTTGTCGTCGATCAGCGTGACCCCGCACGTGGCCGCTTCGACGAGCACAAGGTAATGGCCGGCTACGCCAGCGAGGACGAAGCGCGCGCCGCCTACCTCGACAACTACACCAGCGGCTGGAAGGGGCTCGGCGCCATCAGTCCGACGACGTTGGGCGAGTTCAAAGTGTGGCTGGGTTCGGGCAAGACGACAGAGCCCTTCGCGCCGAAGTGGTTCGGTGCCCGCGATAAGGCCGAGGCGCACATCGCCAAGGAGAAGCTCGGCGCCACGCATGAGGTGGTCGCGAACGGCAAGCGGTTCGAGATCCGCGAGAAAACAAAGACGGTAAAGCCGGACGCGCAGCAAAAACCCTCGCAAATTATGCGCGAAAATATGCGTGAGCCGGCAGCCGAGCCCGAAACAGGTCCGCGCGCTCTGTTCCGCTCCAAGGAATGGAAGGCAGCATGGAACGGCACTTCGGCGGAGAAGTTCGCATCGCCGCATCCCGATAGCCCCGTTGCCTATCAGATGGCCAAGGGATGGCGTGACGCAAAGGCCGGGCTTGAACCTCAGCGCAGGCCTGTAGACGGCACGAAGTACCGCAGCGACGGCCACAACCCTATCGACGATTACCTGATGGGTTATGTCGCCGCACGCGAAGGCAAGGGGCGGGAAATCCGCGCCACCAACGCGGAAGCCGTGTTCGGCCGCTATCTGCTTGAAAGTGTCGACACGCCGGCTGCGTGGGAAGTTGAGGCAGAGAAGGCGCTTACCCGCGTTCCCGATAGCCACCCGATGAAAAGGAAGTGGGCGGCCAACGTCGCCAACGGGCTCAAGAGCGAGAGCGACCGCGCTGGTGTGCTGCGCCAGATGCAGGGAGCCGCACCGGCCACAACGGTAACGCCGCGTCGCGATTACACCACCACCATCATTAGCCTGCGCCAAGGACTCGAGTCGTGGGCAGAGACCGGCGGCAACACCCTCATCGAGGGTAAGCTTGAACTGGCTGAACGGCTGCAGGCGCTCTCCGAAGAGGAATGGGCGAAGGTCGAACCCTTCTTCCATCGCGACATGGAGCCCGGCGTCACCCTTGAGCCTGAAAAGGGCATGGCGGCGATCGACGAGGCACTCAGCGGAGCGGCGCCAAAGCCAGCCAAGCGCACGGCCGCGTCTGCCAAGCCGAAGCTCCCGGTCACGGCAAACACCGTGTTCACGGAAGATGCTGCAGAAAAGGCCCGCGCCCTGCTGCGTCGCAAGCTGTCCGGTAGCACGCTCAACAGCGGCATCGATCCGGAAATTCTGCAGGCAGGTGTCACCCTCGCCGGCTACCATATCGAGAAGGGTGCCCGTACCTTCGTCGCCTATGCCCAAGCGATGTTGTCCGATCTCGGCGAGGATGTGCGTCCGTACCTGAAATCGTGGTATATGGGTGTGAAGTACGACCCCCGCGCCACGCAATTCGACGGTATGTCGAGCGCTGCCGAGGTCGAGCAATTGATGTCGCCGACATTGCCGTCGATGAAGGAAGCGCGAGCGATGAACCTGCAGAACTGGATCACTCTGGGTCGCGAGCACTGGAAGCAGCACCTGCCGAACAGGTATCGCGAGCTGAAACAGGCCGGGATGCTCGAACCGGCGCTGAAAGAAGCCGCGGAACAGACGTACCGCGAAGTGAGCCAGCTCGAAGAGAGCGGGTTTCAGGCGGACGAGGCGTGGCAGATGGTCAGGGAGACCTACTTGCTGCTGCCGAGCGAGGGGACGACGCAGACGCAGCCCCGCAGCGAGACCGCCGACAGAATGATGCAGGCCGCCAGCAGCGGGACGAGAACGGTCGAGATCAGGTAG
Gene Ontology
Description | Category | Evidence (source) | |
---|---|---|---|
GO:0006508 | proteolysis | Biological process | Inferred from Electronic Annotation (InterPro) |
GO:0008233 | peptidase activity | Molecular function | Inferred from Electronic Annotation (InterPro) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available.