Protein

UniProt accession
A9Q1W5 [UniProt]
Protein name
Putative tail fiber
PhaLP type
VAL

evidence: ML prediction

probability: 99 % (predicted by ML model)

Protein sequence
MANQKDLMKLSVNELIALGSQSGLTFHAGMKKSHMVQQLSASAASGWLDTNAELMGGSFEDDSLITESLGDSSMISDAAHIAQVLSSAGYTEAFHAAMNGPTHHVEAVHAYMERLGVNTDDVWMHMPKPNPNLPQGSFNMLNAYMRDTLAGHQDIMPELPGHYTGDIMGEYSTNRGDIAKSMGYLAHMYVDRQQYDDPDRYARDVYRVAKRLESELPQNFREVAAVSAMNVGNKSGPRVSYMDSLPQLGSESIVGDIQHPRQPLNASGLPLGSMGSGIKAEYSLSASLSGSPGWSDASKSLYQDVSGAVKSAAKVYAGGSERGMNAYRTQSSDRDLILDSASRYTDLADARSGYDNLKKDIGDDPRYSGASIRGVLENAHQYNEAEISKTFDPAERLRNTGPTELSTSNEFTANLNEPTSWNSARDRRESIAIANLDSASVGQHSSVKYHDALEQGTQEWLDFRKQYDITGSTIGDYLGHNPATNNSPIHTMGEKIGLTVRKDSPRARENFERGHRLEAWARPRVGERYGIEITETGAITNDDYPGMMYSPDGLIGDDALWEHKAPNNFKDLETTPNYMDQMQLGMHLSGRSRTLFTQTVGEESRSQWVEADPTWFERNKNKIISSQARMNAGREFMESSDLEGKDLVNETRKVMSGDGIWGYQTRDHREGEGYTAGKRGMAKYSAAAGTAADPFIGSHSPYNPEASRSGYQPNFVMHEQNFPATTGNGDTGNDSMALSVKKGILAAQEENKQKGIGADADFDGKADSMGWNQERFDAANGGGSGGGGGRGGYFTSGGNYFDDYGRMGGSLAAGIAGGSIGSATNGVMQALMATPAGRMAAVGIGAIQIGNEAAEYMNDFIGNSLDAGVMNPNEYSSMSQGLEMLGLNSQQAARMNQTTHSAYNTMLNGDPSAAVNIVRGSRGLLTIGDIRSTGGDPVALARIMQERGKERGWSQARIAGAAQMAGLDGMARAFDRTEYSHERAGSVVESGRNSDFAEGMAQSEMLQVERAQLLPGYNVPQSVLSHGAALFEAGSTAAGAANSGYSQARQVAANVYDFIAGEESGGKEYNKDGTRVTSPTGARGIMQVLPSTARDPGYGIKPSDGSPEDDARVGREYYDAMYKRFGGDHEKAMAAYTDGAGTVDKAVDKFGMDWLSAVPAQAQKRVKAFREWSKSSQSLEEGATGFTRNGMSYGQTQTVVNVKIDAKVNNQVASATVAVPGGQTVTQQMNMNNGAQQRR
Physico‐chemical
properties
protein length:1239 AA
molecular weight:133361,00000 Da
isoelectric point:5,31197
aromaticity:0,07345
hydropathy:-0,61525

Domains

Domains [InterPro]
Protein sequence: A9Q1W5
1 1239
Legend: Pfam SMART CDD TIGRFAM HAMAP SUPFAM PRINTS Gene3D PANTHER Other

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage phiEcoM-GJ1
[NCBI]
451705 Chaseviridae > Carltongylesvirus > Carltongylesvirus GJ1
Host Escherichia coli
[NCBI]
562 Bacteria > Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Escherichia

Coding sequence (CDS)

Coding sequence (CDS)
Genbank protein accession
ABR68774.1 [NCBI]
Genbank nucleotide accession
EF460875 [NCBI]
CDS location
range 38231 -> 41950
strand +
CDS
ATGGCTAATCAAAAAGATTTAATGAAGTTGAGCGTTAATGAGTTAATTGCATTAGGCTCTCAAAGCGGTCTGACTTTCCATGCTGGTATGAAGAAGTCGCACATGGTTCAACAACTTAGTGCGAGCGCTGCTTCGGGATGGCTGGATACAAATGCAGAATTAATGGGTGGGTCGTTTGAAGATGACAGTTTAATTACTGAATCTCTTGGCGACTCTTCCATGATTTCAGACGCTGCCCACATTGCACAAGTGCTTTCAAGTGCTGGCTACACAGAAGCATTTCATGCAGCAATGAATGGCCCAACACACCATGTAGAAGCTGTTCACGCATACATGGAAAGGCTTGGTGTCAACACAGATGATGTGTGGATGCACATGCCAAAGCCAAATCCGAACTTACCGCAAGGTAGCTTCAACATGCTTAATGCGTACATGCGTGATACGCTGGCAGGTCATCAAGATATAATGCCGGAACTTCCTGGTCATTACACTGGTGACATTATGGGTGAATACTCTACAAACCGTGGAGATATTGCCAAATCTATGGGCTATCTTGCTCATATGTATGTTGACCGTCAACAATATGACGACCCAGACCGCTATGCACGAGATGTTTATCGTGTAGCCAAACGTTTAGAATCAGAATTGCCGCAAAACTTCCGTGAAGTTGCTGCTGTATCTGCAATGAATGTAGGTAATAAGTCGGGGCCAAGAGTTTCATACATGGATTCTCTCCCTCAACTTGGTTCCGAATCTATTGTTGGTGATATTCAACATCCTCGCCAACCACTTAACGCATCTGGCCTACCTTTAGGCTCTATGGGTTCTGGCATTAAAGCTGAATATTCTTTATCTGCATCATTATCTGGCTCCCCAGGTTGGTCTGATGCAAGTAAATCTTTATATCAAGATGTATCTGGCGCAGTTAAATCTGCTGCTAAGGTATATGCTGGTGGTTCAGAACGAGGGATGAACGCATATCGCACTCAATCTTCTGACCGAGATTTGATTCTTGATTCTGCATCTCGTTATACAGACCTTGCAGATGCTCGTTCTGGTTATGACAACCTAAAGAAAGATATTGGTGATGACCCTCGTTACTCTGGTGCATCTATTCGTGGCGTACTGGAAAATGCTCACCAATATAATGAAGCTGAAATCAGTAAGACTTTCGACCCTGCTGAACGTTTAAGAAATACTGGACCTACTGAACTTAGTACATCCAATGAGTTTACGGCAAATCTAAATGAACCGACAAGTTGGAACTCTGCAAGGGATAGAAGAGAGTCTATTGCTATTGCAAATCTCGATAGTGCTAGCGTTGGACAGCATAGTTCGGTTAAGTACCACGATGCGCTCGAACAAGGTACGCAGGAGTGGCTTGATTTTCGTAAGCAGTATGATATTACTGGCTCTACTATTGGTGACTATCTGGGCCACAACCCCGCAACCAATAATAGCCCAATACATACAATGGGCGAAAAGATTGGCCTCACAGTAAGAAAGGATTCCCCACGAGCGCGCGAGAACTTTGAGCGTGGACATAGATTAGAGGCGTGGGCCAGACCCAGGGTAGGTGAACGATATGGGATTGAAATAACTGAAACTGGTGCAATCACAAACGACGACTATCCTGGCATGATGTACTCGCCTGATGGGCTAATTGGTGATGATGCTTTGTGGGAACATAAAGCTCCAAATAACTTTAAAGATTTGGAAACAACTCCAAACTACATGGACCAGATGCAACTTGGTATGCATTTGAGTGGCCGTAGTCGCACACTGTTTACCCAAACTGTTGGCGAAGAGTCCAGAAGTCAGTGGGTTGAAGCCGACCCAACGTGGTTTGAACGTAACAAGAACAAGATTATATCCTCTCAAGCACGCATGAATGCTGGACGCGAGTTTATGGAAAGCTCCGACCTTGAGGGAAAAGACCTTGTTAATGAAACCCGCAAAGTTATGTCTGGTGATGGAATTTGGGGCTACCAGACTCGTGACCACAGGGAAGGTGAGGGATATACTGCTGGCAAGCGCGGGATGGCTAAATATAGTGCTGCTGCTGGCACTGCTGCTGACCCGTTTATTGGCTCCCATTCTCCCTACAATCCAGAGGCATCTCGTTCAGGCTACCAACCAAACTTTGTAATGCACGAGCAAAACTTTCCAGCAACCACAGGAAATGGTGATACTGGAAATGACTCGATGGCATTGTCTGTTAAGAAAGGTATCCTTGCTGCTCAGGAAGAGAATAAGCAAAAGGGTATTGGTGCAGACGCAGACTTTGATGGCAAAGCTGATTCAATGGGTTGGAATCAGGAACGATTTGATGCTGCCAATGGTGGTGGAAGTGGTGGCGGCGGTGGTCGTGGCGGCTACTTCACAAGTGGTGGCAACTACTTCGATGACTACGGTCGTATGGGTGGTTCACTTGCTGCTGGCATTGCTGGTGGCAGTATTGGTTCGGCAACCAACGGAGTTATGCAAGCATTGATGGCAACTCCTGCCGGACGTATGGCTGCTGTAGGCATTGGTGCTATTCAGATTGGCAATGAAGCTGCTGAATACATGAATGACTTTATCGGCAACTCGCTTGATGCTGGTGTTATGAATCCTAATGAATATTCTTCCATGTCGCAAGGCTTGGAGATGTTAGGACTCAACTCACAACAAGCGGCACGTATGAATCAAACCACACATAGTGCCTACAACACCATGCTTAACGGCGACCCCAGCGCCGCTGTGAACATCGTTCGCGGCAGTAGGGGATTGCTCACCATAGGTGATATTCGCTCGACTGGCGGCGACCCTGTTGCCCTCGCTCGCATTATGCAGGAAAGAGGCAAGGAACGTGGCTGGAGTCAGGCCCGTATCGCTGGTGCTGCGCAGATGGCTGGGCTGGATGGTATGGCTCGTGCCTTCGACCGCACGGAATACAGCCATGAGCGAGCAGGTTCGGTGGTAGAAAGTGGTAGAAACTCTGACTTTGCCGAAGGTATGGCTCAATCAGAAATGTTGCAGGTGGAGCGCGCACAGCTTCTGCCAGGGTATAACGTGCCACAAAGTGTGCTATCTCATGGTGCTGCACTGTTCGAAGCTGGAAGCACTGCTGCTGGTGCTGCTAACTCTGGATACAGCCAAGCCCGACAAGTTGCTGCAAACGTTTATGATTTCATTGCTGGTGAAGAGTCTGGTGGCAAGGAATACAACAAGGATGGTACACGAGTTACAAGCCCGACTGGTGCTCGTGGAATCATGCAGGTTCTTCCTTCTACTGCTCGTGACCCAGGTTACGGAATCAAACCTTCTGATGGAAGTCCTGAAGATGATGCTCGTGTCGGTCGTGAATACTACGATGCGATGTATAAACGATTCGGTGGCGACCATGAGAAAGCAATGGCTGCTTACACGGATGGTGCTGGAACTGTTGACAAGGCTGTCGATAAGTTTGGAATGGATTGGCTCAGTGCTGTTCCGGCTCAGGCTCAGAAACGTGTTAAAGCATTCCGTGAATGGTCCAAATCTTCTCAATCTTTGGAAGAAGGTGCTACAGGGTTTACTCGCAATGGAATGTCCTACGGTCAAACCCAAACTGTTGTCAATGTTAAGATTGATGCTAAGGTCAACAACCAGGTTGCTTCTGCTACAGTTGCAGTTCCTGGTGGCCAGACTGTAACTCAACAAATGAACATGAACAACGGTGCACAACAAAGACGTTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.