Protein

Protein accession
G9FHI5 [UniProt]
Representative
76KKK
Source
UniProt (cluster: phalp2_33438)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MATITSLNFAIRSSWDGTALNKARDDLMALQAQMRTISGASLHFTVDADTEEAKAKIAELRAQARDIQMGIDLNDGQLATAEAKLDILARDREVDVRVNDHDIANKMDRISDSFENASVSAVDFGNSSNRAGFMAKVGIAAFAAGLVVLPGLINLVGASLTGAMGGGIIAVGALALKENEQMKAGWEALWTEMKSTAQSAAQPMLQPFLDTMSMAKTRIAELKPELTDLFASAAPAIQPLTSGLMDLVTNAMPGLTQALEQSGPMVQGFADGLGSLGTHVSNMFAGMADGAAGFGRLWKITFDQVGLMMEQFGVAAGKMSETGTDLWDRLLTGFNQFIGGFLDGLVDFTAAVDNGVGDALSVFGLLGDVMRGTLGPLGELVAAISNALMPVLDGTVQALTPVIGALMSGLAPAINSLVPLSNALQPLLVMVGQSLTQAINQLAPLLPPIVDALVSALIPAFQALIPALVPVIEQIAVGLKPLLEMLPGVINTLSPIVVGLAIAFGQITQWLSPLIPHLMQMYVAWKLISTAMAIGRGIMIAYTAVRTALTVATIAHTAAVWANNIAMSANPIGLIIIAIGALVAAIVWLATKTQFFQTVWEALKVAMAAVWNWMKDAWDATVGFLAMRWEQFSGTIVNAWNATWNVIKVAAQAVWGFLTNAWNQFLTVLKVTWEVVSGVIKAAWDVWWFAIRASVAIIWAWLEVAWGALWGTVRRVWDSFVAIFQPLWDGLWNGVKVFVEGVWGGIQILWDGLWKAVTAGWNAFMGWLRPIWESFWNTVKQVGESIWNAVKLAWEFLWDQVKQKWDWWVNLFNSVWQPFWNGIKAAAETVWNGITSAWNAFWNGIKAMWENFSNAIRAGWEAFWNWVRDFASGVWDAITSKWGEFKQKFEEVFSTMVDKAREIWDKIREVFAKPINFVIRIWNDHVAGKFGLPALDEIGGFATGGRVDGKGTGTSDDIPARLSRGEHVWTAKEVDAAGGHGNVEAMRRNTLNGVAHYASGGPVEWMIGQQQKFAPALQVTSAQRDSNDYHGQGKAVDFSNGGDAGTPEMMAFANWIADTWGANTLELIHSPFGRNIKDGNSVGDGMGFYGAGTMAQHRNHVHWAVDRPLNEDEGGQSLLGKIWGGVRKGIAWTFEKLTNPILGAIPDPFLPGVGSPFAGFPKAAATKVRDAFLDKVRGAEGASGGTASGDIGGVIPDGDRLGIINEALRITNTPPPSSIEAWQRGMNTLITRESGWNAGAINNWDSNAAAGNASRGLAQVIPTTFEAHKAPGYNNIDAPVDNVAASINYIKSRYGTIENVQQANANMAPQGYRTGTNSAAAGWHLVGEDGPEMVNFRGGEQVKTFDDIIRALKDSTSGQSKELESKLTAEIRTLVEKIGTEVNTSGARFAQSVEQAIERVLATAGMQLNLSMPVPQNAADAAAYAQEVANQLLPQLEMMIRQRIGTR
Physico‐chemical
properties
protein length:1447 AA
molecular weight:156081,7 Da
isoelectric point:5,23
hydropathy:0,12
Representative Protein Details
Accession
76KKK
Protein name
76KKK
Sequence length
2106 AA
Molecular weight
224023,18950 Da
Isoelectric point
5,77378
Sequence
MATVRSSLEFSITAAYLGKPAMEAARKDFLETRTELQKLADEAINIRVKLSGLDDDIAEIKALTSTPQDVHLHGVADTKTAEGQLNETARFRPVTMKAVAETADAVRDLDQAAHDRTVIIRYRPENAREAATAADDAMRDRTNVIGTEVQGVSEAAAAFDGLNQKRAAEFVATPENVSEANAELNELAKPRVSPIVAKIEADAASQKRMSDLIAQKNRMNYAIEEVRRKRYVEIDEGLGQAKRDVNERYPDRRRNPQSARAAFNEKSGLESEAKNAKNSARMDELRQKNVVQQWADQQGIDERATTKKTKAGIVQDARDQSAAEKAHARKQSQAQKAMESAATAKAKIDKTAEDRTATIRVQAQAGDAKAAIDDVAKRRKLQIDLDLKNAKDELDKAFPGKRPMEIQTMITSAKDNLDFAARNKKLDIDQRSAGAKRDIDRAVATRQVAVYRGENRDQVDADASRTISTIKNTAATGISNAQFEHSAQQREAVITAKADTAEARAQLDEVAKARTARVDTDSSGLDGIKSAADKATKSVKDLASAMALVSLGSAALGAVMTTGLAAGVIAAGGAAIAVNKRLAASHKEAMDAAQLQAETAKRDLAQAYRDVADTAIESSQRIAAAQHEVQMADRNEEDARRSLTEAYRDARQELEDLQLQLEQAPINQRSADIGVARAYQNLTQLGKRQDVTPLDYEDAFNQIDEAKAHQDEVRVKNQQLQDNAAVAAKKGVEGNDKVIKGKEALADADYQQKVSQQDLVNTQRETAEAQLKAAESVKVALQEQARANSELAKATKDANSAMSQMSAMFADLAAPLQGPLHNAMQSLKQQFMDMKPVIQQGFQSAADYVQPFTDALTGLMLGPVPGVVTALQNSQPAVTGFRDGMRSLGTDIGTMFSQMSAGSAGFGDLWRSLGDQVGKFLIQIGTFIGQYAGPASQALSTLLQGVNDLAAGFLNGLGPGMKDLPVIASAIASVMKDVGQIIGILVQSSFPAIIAIAPIIKDLATGFAAVMNWLQPMLPILAPLAAGWFLLDAALDLNPFVLIAGAIAGVAAGIGYLATQTKVFQDIWNKMPKGVQEFGAAVVNDTKKAGGAVASFATKKDSNGQTGLQKTGSAIMHAGSSVGGAVVSAGKSLGKTFAPELQDMKATFGGIWSSLKAEWDKDLKPAFDSLWSSLKVLWNNSKPILDFIGGAFVAIGKVVMSVISGVIGPIIGVFVDTIKNIVNAVRGLIMMISGFFETVKGVFEICFNFFKTIFGLLKGVFTGDFSTFKSGLDGIGNGFKDMLSGLGTFFAGVWHLVTSLFGEVIDILKGAWKTVEGIVLGIVNGIIHWFEWLYDEMVGHSIIPDLVNDVIKWFESLPDKLIKLVEDLGKKIADAFTKLWGDVKDIASKAWDALKNDTGIGSFVDGIKGTFSGLLKDIGDIWDGIKKVFATPINWVIGIWNNDIVGKIPGLGKIDTIPGYASGGQPDGSPGYINGTGGSREDKHVVAVSNREYIVNAEATSRNRAVLDAMNFGGERAMIPGFALGGTPAHAEAEVPGFSLGGIAFRHMRKAAGLRFADGGPTDGSTNPAIARALQWAAGYQGRPYNDQGWLDCSGLASGIYDSLLGRTPKREFTTTSDFTALGFVPGRGGIMEIGVTPLPGNYGHMATTLAGHKLESGGVHDDIRVDGPAIGADDSQFADHYYLPGKFFNPAYSGAGADGDKGSGGGFFGMIGSAFQAVVGGVRSGISDLFQKLTDPVLNAIPDPMIGGQKGFLGSFPKQIATKSRDDIANLIRGHESTNTIGGAIPTGDRLAVIDAALALTHTPPPGTKEQWEAGMNTLVERESGWNTGAINDWDDNAKAGNPSKGLAQTTGTTFAAFAAPGHKNIFEGVDNLAASINYIKSKYGGIDRVQQANANMPPKGYATGTTNAKKGWSLVGELGPELVLFGGGETVIPNNMLGNLNDNWNAAAQANGLGLKAQSAGQAFAKANFDQFVGDLGGSTSGDGLAEAAFTQVPSYLLALDAYQKSGKLGKPAYQTLSNQIQGDLSKPPANPLDPGQEGVTTQPGTPPPGAQPQQQQNQNPLSQFLHPTEVHFHVSDVDEAMGKWKQTQREATLGFDPLGIFGS
Other Proteins in cluster: phalp2_33438
Total (incl. this protein): 6 Avg length: 1868,8 Avg pI: 5,64

Protein ID Length (AA) pI
76KKK 2106 5,77378
19x1p 1815 5,72286
3gn6D 1830 6,38293
4LB46 2518 5,62680
A0AA47KXN5 1497 5,11229
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40557
4MMrj
1 34,3% 1782 0.000E+00
2 phalp2_22507
1fUaK
1 27,4% 1768 2.363E-178
3 phalp2_36889
7ukoa
30 22,6% 1607 1.222E-78
4 phalp2_33467
7m6oK
34 22,7% 1422 1.943E-69
5 phalp2_6553
11DGv
4 22,5% 1797 2.945E-65
6 phalp2_21847
4rAYx
3 22,0% 1403 1.818E-55
7 phalp2_18324
5inLv
5 20,8% 1509 3.601E-51
8 phalp2_18800
1dwRh
1 22,3% 1502 7.118E-47
9 phalp2_9474
HzvZ
51 21,4% 1474 2.883E-41
10 phalp2_9295
7jp7L
18 20,7% 1380 1.189E-30

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhodococcus phage REQ1
[NCBI]
1109712 No lineage information
Host Rhodococcus equi
[NCBI]
43767 Actinobacteria > Actinobacteria > Corynebacteriales > Nocardiaceae > Rhodococcus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JN116825 [NCBI]
CDS location
range 38584 -> 42927
strand +
CDS
ATGGCAACGATCACCTCGCTCAACTTCGCGATCAGGAGTAGTTGGGACGGGACTGCGCTGAACAAGGCGCGTGATGACCTAATGGCTCTGCAGGCTCAGATGCGGACGATCTCGGGTGCCTCGCTGCACTTCACGGTCGACGCCGACACCGAAGAGGCGAAAGCCAAGATCGCGGAGCTGCGCGCACAGGCGCGGGACATCCAGATGGGCATCGACCTGAACGACGGCCAACTCGCTACGGCGGAGGCCAAACTCGACATCCTGGCTCGCGATCGAGAAGTCGACGTGAGGGTGAACGACCATGACATCGCGAACAAGATGGACCGGATCTCGGACTCCTTCGAGAACGCGTCGGTCAGCGCGGTGGACTTCGGCAACTCGTCGAACCGTGCGGGCTTCATGGCCAAGGTCGGCATCGCGGCGTTCGCCGCCGGTCTCGTCGTTCTGCCCGGACTGATCAACCTCGTCGGTGCATCGCTGACCGGCGCGATGGGTGGCGGCATCATCGCTGTCGGCGCGCTGGCGCTCAAGGAGAACGAGCAGATGAAGGCGGGCTGGGAGGCCCTGTGGACCGAGATGAAGTCCACGGCCCAGTCTGCCGCCCAGCCGATGCTGCAGCCGTTCCTGGACACCATGTCGATGGCCAAGACGCGGATCGCGGAACTCAAGCCGGAGCTGACAGACCTCTTCGCGAGTGCAGCCCCGGCGATCCAGCCGCTCACGTCCGGCCTGATGGATCTCGTCACGAACGCGATGCCCGGCCTCACGCAGGCGCTCGAGCAGTCCGGCCCGATGGTCCAAGGGTTCGCCGATGGCCTCGGTTCGCTCGGCACGCACGTCTCGAACATGTTCGCCGGGATGGCAGACGGCGCAGCGGGATTCGGTCGACTGTGGAAGATCACGTTCGATCAGGTCGGCCTGATGATGGAGCAGTTCGGCGTCGCTGCCGGGAAGATGTCCGAGACGGGCACCGACCTCTGGGATCGACTGCTGACCGGCTTCAACCAGTTCATCGGCGGCTTCCTCGACGGCCTCGTCGACTTCACGGCCGCCGTCGACAACGGGGTCGGTGACGCGCTCTCGGTCTTCGGTCTGCTCGGCGACGTCATGCGCGGCACGCTCGGACCGCTCGGCGAGCTGGTGGCCGCGATCTCGAACGCCCTCATGCCCGTCCTCGACGGCACCGTGCAGGCGCTCACCCCCGTCATCGGCGCACTGATGTCCGGCCTCGCTCCGGCGATCAACTCGCTCGTCCCGCTCAGCAATGCGCTGCAGCCGTTGCTCGTCATGGTCGGCCAGTCGCTGACCCAGGCGATCAACCAGCTCGCACCGCTCCTGCCCCCGATCGTGGACGCGCTCGTCTCCGCGCTCATCCCCGCGTTCCAGGCGCTCATTCCCGCACTGGTTCCGGTGATCGAGCAGATCGCAGTAGGCCTCAAGCCGCTGCTGGAGATGCTGCCCGGCGTGATCAACACGCTCAGCCCGATCGTCGTCGGCCTGGCCATCGCGTTCGGCCAGATCACGCAGTGGCTGTCCCCGCTGATCCCGCACCTCATGCAGATGTACGTCGCGTGGAAGCTGATCTCGACGGCCATGGCTATCGGGCGCGGCATCATGATCGCGTACACGGCAGTCCGCACGGCCCTCACGGTGGCGACCATCGCTCACACGGCGGCCGTCTGGGCGAACAACATCGCGATGAGCGCCAACCCGATCGGCCTGATCATCATCGCCATCGGCGCTCTGGTGGCAGCGATCGTATGGCTCGCGACCAAGACTCAGTTCTTCCAGACCGTCTGGGAGGCACTGAAGGTCGCGATGGCGGCTGTGTGGAACTGGATGAAGGATGCGTGGGATGCCACGGTCGGATTCCTGGCCATGCGGTGGGAGCAGTTCTCCGGCACGATCGTCAACGCGTGGAACGCCACGTGGAACGTGATCAAGGTAGCGGCACAGGCTGTCTGGGGATTCCTCACCAACGCCTGGAACCAGTTCCTCACGGTCCTCAAGGTCACCTGGGAGGTCGTCTCCGGCGTCATCAAGGCGGCCTGGGACGTCTGGTGGTTCGCCATCCGCGCGAGTGTCGCGATCATCTGGGCGTGGCTCGAAGTCGCCTGGGGTGCACTGTGGGGCACTGTCCGGCGAGTCTGGGATAGCTTCGTAGCGATCTTCCAGCCGCTGTGGGATGGACTCTGGAACGGCGTCAAGGTCTTCGTCGAGGGCGTCTGGGGTGGAATCCAGATCCTCTGGGACGGACTGTGGAAGGCCGTGACGGCAGGCTGGAACGCCTTCATGGGCTGGCTGCGTCCGATCTGGGAGTCCTTCTGGAACACCGTGAAGCAGGTCGGTGAATCCATCTGGAACGCCGTCAAGCTCGCCTGGGAGTTCCTGTGGGATCAGGTCAAGCAGAAGTGGGACTGGTGGGTCAACCTGTTCAACTCTGTCTGGCAGCCCTTCTGGAACGGGATCAAGGCTGCAGCCGAGACCGTCTGGAACGGAATCACTTCCGCGTGGAACGCCTTCTGGAACGGCATCAAGGCGATGTGGGAGAACTTCTCCAACGCCATCCGTGCCGGGTGGGAAGCCTTCTGGAACTGGGTGCGTGACTTCGCATCTGGCGTCTGGGATGCGATCACCAGCAAGTGGGGCGAGTTCAAGCAGAAGTTCGAAGAAGTCTTCAGCACGATGGTCGACAAGGCCCGCGAGATCTGGGACAAGATCCGAGAAGTCTTCGCGAAGCCGATCAACTTCGTCATCCGCATCTGGAACGACCACGTCGCCGGTAAGTTCGGCCTCCCGGCACTCGACGAGATCGGCGGGTTCGCTACCGGTGGTCGGGTCGACGGCAAGGGCACGGGAACGTCCGACGACATCCCCGCCCGGCTCTCGCGCGGTGAGCACGTCTGGACCGCAAAGGAAGTCGATGCGGCTGGCGGACACGGCAACGTCGAGGCCATGCGCCGCAACACGCTCAACGGTGTCGCGCACTACGCCTCCGGCGGTCCGGTCGAGTGGATGATCGGCCAGCAGCAGAAGTTCGCGCCTGCCCTGCAGGTCACTTCGGCTCAGCGGGACTCGAACGACTACCACGGCCAGGGCAAGGCGGTGGACTTCTCGAACGGCGGCGACGCCGGTACACCGGAAATGATGGCCTTCGCGAACTGGATCGCGGACACCTGGGGTGCTAACACGCTCGAGCTGATCCACAGCCCGTTCGGACGGAACATCAAGGACGGCAACAGCGTCGGCGACGGCATGGGGTTCTACGGGGCAGGCACGATGGCCCAGCACAGGAACCATGTCCACTGGGCAGTCGACCGCCCCCTCAACGAGGACGAGGGCGGCCAGAGCCTGCTGGGCAAGATCTGGGGCGGCGTCCGCAAGGGCATCGCCTGGACCTTCGAGAAGCTGACCAACCCGATCCTCGGAGCCATCCCGGATCCGTTCCTTCCCGGTGTCGGTTCGCCGTTCGCCGGATTCCCGAAGGCAGCGGCAACGAAGGTCCGAGACGCGTTCCTCGACAAGGTGCGCGGCGCTGAGGGTGCATCCGGCGGCACTGCCAGTGGAGACATCGGCGGCGTCATCCCCGACGGGGACCGGCTCGGCATCATCAACGAGGCCCTGCGGATCACGAACACCCCGCCTCCCTCCTCGATCGAAGCATGGCAGCGCGGCATGAACACGCTGATCACGCGAGAGTCGGGCTGGAACGCGGGCGCGATCAACAACTGGGACTCGAACGCAGCGGCGGGCAACGCTTCTCGCGGTCTGGCCCAGGTGATCCCGACGACGTTCGAGGCTCACAAGGCACCGGGCTACAACAACATCGACGCGCCGGTCGACAACGTCGCGGCCTCGATCAACTACATCAAGTCGCGCTACGGCACCATCGAGAACGTCCAGCAGGCGAACGCCAACATGGCCCCTCAGGGCTACCGCACGGGTACCAACTCGGCTGCGGCAGGGTGGCACCTCGTCGGCGAAGACGGACCCGAGATGGTGAACTTCCGTGGCGGCGAGCAGGTCAAGACGTTCGACGACATCATCCGCGCCCTGAAGGACTCCACGTCCGGACAGAGCAAGGAGCTGGAGTCGAAGCTGACGGCCGAGATTCGCACGCTGGTGGAGAAGATCGGGACCGAGGTCAACACCTCCGGAGCGCGGTTCGCGCAGTCTGTCGAGCAGGCTATCGAGCGAGTGCTGGCCACGGCCGGTATGCAGCTCAACCTCTCGATGCCGGTACCGCAGAATGCAGCCGACGCAGCAGCGTACGCGCAGGAAGTCGCGAATCAGCTTCTCCCGCAACTGGAGATGATGATCCGGCAGCGTATCGGCACACGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00023eec70_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (76KKK) rather than this protein.
PDB ID
76KKK
Method AlphaFoldv2
Resolution 48.55
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50