Protein

UniProt accession
A8WA08 [UniProt]
Protein name
Tapemeasure protein
PhaLP type
VAL

evidence: GO annotation

probability: 99 % (predicted by ML model)

Protein sequence
MPDIGYYTLPVIPSFRGTTNRLQTDLNRQFGKAGTDAGKTLTTAASKQIEKDRAIEKATATKAKAVDNLTKAMDKAASAAGRVRTAEANLVQVRKSGNEARIVAAEERLAEARRKVAAETRNVDKANREVSSSTKRLQEAMKQTVDAGNSGGGAGQAGIAGLMLFGKGGAAGLSSLSSSAGRTAGIALRAGIVGAVGVGIGAALVAPIAAAFKAFNWGAEVGLPLERTLNTLRGVTEASGAQMAAAGNEARRLGSDINLSGVTASDAAAAMTELAKGGLTLDQAMKATRGTLQLATAGQLDAAEAAKYQTAAMNTFQLGAEKATHVADLLAKAANASSAEVSDIGMALQQGGAVAAGFGLSLEETVSTLAAFSKMGINGSDAGTMLKTSLQAITDQGNPAQAAIEQLGLTLYDSNGKFVGYTNMMNQVAEASTHMTQEQFQAATAVLFGSDAMRASMIAAKGGPELFNKTAEEMSKVDGAAAAMAGAQMHGLPGVIEGLDNTMDGLKLSVYDAGNAIVTAMGQEALGSLDSFADMVARNQPTIIGFFTNVGTFAVEGATQLVMFAAEGARALAQFVNVIGDVHGGMLRAGAALKRLTGDTAGADKWDAEADAAFGYADGIYAVADKLDGTVNKLRGFKDRLADAGKQAQNASKLTVALGTAIAEVPDGKDIMIRENTPETIENLRALGIQVEETPTGLKMTATTDEAEDIVNDWREQQGIKPVEVPIKPTVNDADMQALLQQYPMLGGAGVTAPGTNTTPPPGASVAELLIPRPRALGGLFSGIQPLPDDAKIQQPVPGGVVQWAEAGDAEAFIPINGSQRSKDIWVATGRALGILQSFANGGLGDAGGALPYTQALRELMFRQFPALKDIGTYRAPDGFNEHSSGRAADVMIPNYNTPQGLALGNQVASFALSLPGTERVMWQHRTWYPDGRSNWVEERGSDTANHMDHVHVFANDIAAQARGGGVPNLGGPYGAGLNTGAATQTVPDWDAIAQAESGGNWAINTGNGYYGGLQFDQPTWDAYKPAGAPARADMAPRETQIAAAENLVRDRGANAPKAWPNTWKTKDVPATGLSTGTRTAGIDPETGERGFYTPDPKKIREADQKVADAEQRIREADAKVAEEEAALRELAPDAKESQRLSAQRQLDAAKADADKARREADDARADLAEAQRGDFKRGSGVSGGDGASQFGEVGSILSSFMQDTFGLGDLLPDPSQLGIVKLLGAIMGIKYTPQGAGFPWQTGYAGGNGTPFSGNPFAGSQAITDPLAAATSMLPFGMVPDAAQMAGGGQTLPNGLPVPAMPPEGVHAGGGAAPGPVQQDNSVNVTVNGYSQTDVVNGVRREMQWAPRVNTYTPPGMG
Physico‐chemical
properties
protein length:1359 AA
molecular weight:140672,00000 Da
isoelectric point:5,09331
aromaticity:0,05519
hydropathy:-0,23525

Domains

Domains [InterPro]
Protein sequence: A8WA08
1 1359
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage Giles
[NCBI]
480808 Gilesvirus > Gilesvirus giles
Host Mycobacterium smegmatis
[NCBI]
1772 Bacteria > Actinobacteria > Actinobacteria > Corynebacteriales > Mycobacteriaceae > Mycobacterium

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ABW88415.1 [NCBI]
Genbank nucleotide accession
EU203571 [NCBI]
CDS location
range 12222 -> 16301
strand +
CDS
GTGCCTGATATTGGCTACTACACATTGCCGGTGATCCCGTCATTCCGCGGGACGACCAACCGGTTGCAAACCGACCTCAACCGGCAGTTCGGTAAGGCGGGAACCGACGCCGGTAAGACACTCACCACGGCGGCGTCGAAGCAGATCGAAAAGGATCGCGCCATCGAGAAGGCCACCGCCACAAAGGCGAAGGCCGTCGACAACCTCACGAAGGCGATGGACAAAGCCGCCTCCGCGGCGGGCCGGGTGCGCACCGCGGAGGCCAACCTCGTCCAGGTGCGGAAGTCGGGCAACGAAGCCCGCATTGTGGCCGCGGAGGAACGGTTGGCGGAGGCGCGCCGGAAGGTGGCCGCGGAGACGCGCAACGTCGACAAGGCCAACCGCGAAGTGTCGTCGTCGACGAAACGCCTCCAGGAGGCGATGAAGCAGACGGTCGACGCGGGCAACAGCGGCGGCGGCGCGGGCCAGGCGGGGATCGCCGGACTGATGTTGTTCGGTAAGGGCGGCGCGGCGGGGTTGTCGTCGCTGTCGTCGTCGGCGGGCCGCACCGCGGGGATCGCGTTGCGCGCCGGGATCGTCGGCGCGGTCGGCGTCGGCATCGGGGCCGCGTTGGTCGCGCCTATCGCGGCGGCGTTTAAGGCGTTCAACTGGGGCGCGGAGGTCGGTCTACCGCTCGAGCGGACCCTGAACACCCTTCGGGGCGTTACGGAGGCGTCCGGGGCGCAGATGGCCGCCGCGGGCAACGAGGCGCGCCGGTTGGGTTCGGACATCAACCTATCGGGCGTCACGGCGTCCGACGCCGCCGCGGCGATGACGGAGTTGGCGAAGGGCGGGTTGACGCTCGATCAGGCGATGAAGGCGACGCGCGGCACGTTGCAGTTGGCCACGGCCGGGCAGTTGGACGCCGCGGAGGCCGCGAAGTATCAGACGGCCGCCATGAATACGTTCCAGTTGGGTGCGGAAAAGGCAACGCACGTCGCCGACCTGTTGGCGAAGGCGGCCAACGCGTCGTCGGCCGAAGTGTCCGACATCGGCATGGCGTTACAGCAGGGCGGCGCGGTGGCCGCCGGGTTCGGGTTGTCGCTCGAAGAGACGGTGTCGACGTTGGCGGCGTTCTCCAAGATGGGCATCAACGGATCCGACGCCGGAACGATGCTCAAAACGTCGTTGCAGGCGATCACCGATCAGGGGAACCCGGCGCAGGCGGCCATCGAACAGTTGGGCCTCACCCTGTACGACAGCAACGGCAAATTCGTCGGCTACACGAACATGATGAACCAGGTGGCCGAAGCGTCAACCCACATGACGCAGGAACAATTTCAAGCCGCGACGGCGGTGTTGTTCGGGTCGGATGCCATGCGCGCGTCCATGATTGCCGCGAAGGGCGGCCCCGAACTGTTCAACAAGACCGCCGAAGAGATGTCGAAGGTTGACGGCGCGGCGGCGGCGATGGCCGGGGCGCAGATGCACGGGTTGCCCGGCGTCATCGAGGGCCTCGACAACACGATGGACGGGTTGAAGCTGTCGGTGTACGACGCCGGTAACGCCATCGTGACGGCGATGGGCCAGGAGGCTCTCGGTTCGTTGGATTCGTTCGCCGACATGGTCGCCCGGAACCAACCGACCATCATCGGATTCTTCACCAACGTCGGCACTTTCGCCGTTGAGGGCGCGACGCAGTTGGTGATGTTCGCGGCCGAAGGGGCGCGGGCGTTGGCGCAGTTCGTGAACGTCATCGGCGATGTGCACGGCGGGATGTTGCGCGCCGGTGCGGCGTTGAAGCGGTTGACCGGCGACACCGCGGGCGCGGACAAATGGGACGCCGAAGCTGACGCGGCGTTCGGCTACGCCGACGGGATTTACGCGGTGGCCGACAAACTCGACGGCACCGTGAACAAACTACGCGGGTTCAAAGACCGGCTCGCCGACGCCGGGAAACAGGCACAGAACGCGTCGAAACTCACCGTGGCGTTGGGCACCGCTATCGCCGAAGTTCCCGACGGTAAGGACATCATGATCCGGGAGAACACCCCGGAGACGATAGAAAACCTGCGCGCGTTGGGGATCCAGGTCGAAGAGACGCCGACCGGGTTGAAGATGACGGCGACGACCGACGAAGCCGAAGACATCGTTAACGATTGGCGCGAACAACAGGGCATCAAACCCGTTGAGGTACCGATCAAACCGACGGTCAACGACGCCGACATGCAGGCACTGTTGCAGCAGTACCCGATGTTAGGCGGGGCCGGTGTCACCGCGCCGGGAACCAACACGACGCCACCGCCGGGGGCGTCGGTGGCCGAACTGTTGATCCCGCGGCCGCGCGCGTTGGGCGGCCTGTTCTCCGGGATCCAACCGTTGCCCGACGACGCGAAGATTCAACAACCCGTCCCCGGCGGTGTGGTGCAGTGGGCCGAAGCCGGTGACGCCGAAGCGTTTATCCCGATCAACGGGTCGCAACGGTCGAAGGACATCTGGGTTGCCACCGGCCGGGCGTTGGGGATCCTCCAATCGTTCGCTAACGGCGGGTTGGGCGACGCCGGGGGCGCGCTGCCGTATACGCAGGCGTTGCGCGAGCTGATGTTCCGACAGTTCCCGGCGTTGAAGGACATCGGCACCTACCGCGCGCCGGACGGGTTCAACGAGCATTCGTCGGGCCGCGCGGCCGATGTGATGATCCCGAACTACAACACCCCGCAGGGCCTCGCGTTGGGCAACCAGGTGGCGTCGTTCGCGTTGTCGCTGCCGGGCACCGAACGCGTGATGTGGCAACACCGCACGTGGTATCCGGACGGCCGCAGCAATTGGGTCGAGGAACGCGGATCCGACACCGCGAACCACATGGACCATGTGCACGTCTTCGCAAACGACATCGCGGCGCAGGCGCGCGGCGGCGGGGTGCCCAACCTCGGCGGGCCGTACGGTGCCGGGTTGAACACCGGCGCGGCGACGCAGACGGTGCCCGATTGGGATGCCATCGCGCAGGCCGAATCCGGCGGCAACTGGGCGATCAACACCGGCAACGGGTACTACGGCGGGTTGCAGTTCGACCAACCGACCTGGGACGCCTACAAACCGGCGGGCGCACCGGCCCGGGCCGACATGGCCCCGCGGGAGACACAGATCGCCGCGGCGGAGAACCTCGTGAGGGATCGCGGGGCCAACGCGCCGAAGGCGTGGCCGAACACGTGGAAGACGAAGGACGTTCCGGCCACCGGGTTGTCGACGGGCACGCGCACCGCAGGCATCGACCCGGAGACCGGCGAGCGCGGGTTCTACACCCCGGACCCGAAGAAAATCCGGGAGGCCGACCAAAAGGTGGCCGACGCCGAACAGCGGATCCGGGAGGCCGACGCGAAGGTGGCCGAAGAAGAGGCCGCCCTACGTGAGCTCGCACCCGACGCGAAAGAGTCGCAACGCCTGTCGGCGCAACGGCAGTTGGACGCGGCGAAGGCCGACGCCGACAAGGCGCGCCGGGAGGCCGACGACGCCCGCGCCGATTTGGCCGAAGCGCAACGCGGCGACTTCAAACGCGGATCCGGGGTGTCGGGTGGTGACGGCGCGTCGCAGTTCGGCGAAGTCGGGTCCATCCTGTCGTCGTTCATGCAAGACACGTTCGGGTTGGGTGACCTACTGCCGGACCCGTCGCAGTTGGGCATCGTGAAGTTGCTCGGCGCAATCATGGGCATCAAATACACCCCGCAGGGTGCGGGGTTCCCGTGGCAGACGGGTTACGCCGGAGGCAACGGCACCCCGTTCTCGGGCAACCCGTTCGCCGGTTCGCAGGCGATCACCGACCCGTTGGCCGCGGCAACGTCGATGCTGCCGTTCGGGATGGTGCCCGATGCCGCTCAGATGGCCGGAGGCGGCCAAACGCTGCCGAACGGTCTACCGGTACCGGCCATGCCGCCGGAGGGCGTGCACGCGGGCGGTGGGGCCGCGCCCGGCCCCGTGCAACAGGACAACTCGGTGAACGTCACCGTCAACGGTTACAGCCAAACCGATGTCGTGAACGGGGTGCGCCGCGAAATGCAGTGGGCACCACGGGTGAACACGTACACACCGCCGGGTATGGGATAG

Gene Ontology

Description Category Evidence (source)
GO:0016787 hydrolase activity Molecular function Inferred from Electronic Annotation (InterPro)
GO:0098003 viral tail assembly Biological process Inferred from Electronic Annotation (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.