Protein
- Protein accession
- G9FHI5 [UniProt]
- Representative
- 76KKK
- Source
- UniProt (cluster: phalp2_33438)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MATITSLNFAIRSSWDGTALNKARDDLMALQAQMRTISGASLHFTVDADTEEAKAKIAELRAQARDIQMGIDLNDGQLATAEAKLDILARDREVDVRVNDHDIANKMDRISDSFENASVSAVDFGNSSNRAGFMAKVGIAAFAAGLVVLPGLINLVGASLTGAMGGGIIAVGALALKENEQMKAGWEALWTEMKSTAQSAAQPMLQPFLDTMSMAKTRIAELKPELTDLFASAAPAIQPLTSGLMDLVTNAMPGLTQALEQSGPMVQGFADGLGSLGTHVSNMFAGMADGAAGFGRLWKITFDQVGLMMEQFGVAAGKMSETGTDLWDRLLTGFNQFIGGFLDGLVDFTAAVDNGVGDALSVFGLLGDVMRGTLGPLGELVAAISNALMPVLDGTVQALTPVIGALMSGLAPAINSLVPLSNALQPLLVMVGQSLTQAINQLAPLLPPIVDALVSALIPAFQALIPALVPVIEQIAVGLKPLLEMLPGVINTLSPIVVGLAIAFGQITQWLSPLIPHLMQMYVAWKLISTAMAIGRGIMIAYTAVRTALTVATIAHTAAVWANNIAMSANPIGLIIIAIGALVAAIVWLATKTQFFQTVWEALKVAMAAVWNWMKDAWDATVGFLAMRWEQFSGTIVNAWNATWNVIKVAAQAVWGFLTNAWNQFLTVLKVTWEVVSGVIKAAWDVWWFAIRASVAIIWAWLEVAWGALWGTVRRVWDSFVAIFQPLWDGLWNGVKVFVEGVWGGIQILWDGLWKAVTAGWNAFMGWLRPIWESFWNTVKQVGESIWNAVKLAWEFLWDQVKQKWDWWVNLFNSVWQPFWNGIKAAAETVWNGITSAWNAFWNGIKAMWENFSNAIRAGWEAFWNWVRDFASGVWDAITSKWGEFKQKFEEVFSTMVDKAREIWDKIREVFAKPINFVIRIWNDHVAGKFGLPALDEIGGFATGGRVDGKGTGTSDDIPARLSRGEHVWTAKEVDAAGGHGNVEAMRRNTLNGVAHYASGGPVEWMIGQQQKFAPALQVTSAQRDSNDYHGQGKAVDFSNGGDAGTPEMMAFANWIADTWGANTLELIHSPFGRNIKDGNSVGDGMGFYGAGTMAQHRNHVHWAVDRPLNEDEGGQSLLGKIWGGVRKGIAWTFEKLTNPILGAIPDPFLPGVGSPFAGFPKAAATKVRDAFLDKVRGAEGASGGTASGDIGGVIPDGDRLGIINEALRITNTPPPSSIEAWQRGMNTLITRESGWNAGAINNWDSNAAAGNASRGLAQVIPTTFEAHKAPGYNNIDAPVDNVAASINYIKSRYGTIENVQQANANMAPQGYRTGTNSAAAGWHLVGEDGPEMVNFRGGEQVKTFDDIIRALKDSTSGQSKELESKLTAEIRTLVEKIGTEVNTSGARFAQSVEQAIERVLATAGMQLNLSMPVPQNAADAAAYAQEVANQLLPQLEMMIRQRIGTR
- Physico‐chemical
properties -
protein length: 1447 AA molecular weight: 156081,7 Da isoelectric point: 5,23 hydropathy: 0,12
Representative Protein Details
- Accession
- 76KKK
- Protein name
- 76KKK
- Sequence length
- 2106 AA
- Molecular weight
- 224023,18950 Da
- Isoelectric point
- 5,77378
- Sequence
-
MATVRSSLEFSITAAYLGKPAMEAARKDFLETRTELQKLADEAINIRVKLSGLDDDIAEIKALTSTPQDVHLHGVADTKTAEGQLNETARFRPVTMKAVAETADAVRDLDQAAHDRTVIIRYRPENAREAATAADDAMRDRTNVIGTEVQGVSEAAAAFDGLNQKRAAEFVATPENVSEANAELNELAKPRVSPIVAKIEADAASQKRMSDLIAQKNRMNYAIEEVRRKRYVEIDEGLGQAKRDVNERYPDRRRNPQSARAAFNEKSGLESEAKNAKNSARMDELRQKNVVQQWADQQGIDERATTKKTKAGIVQDARDQSAAEKAHARKQSQAQKAMESAATAKAKIDKTAEDRTATIRVQAQAGDAKAAIDDVAKRRKLQIDLDLKNAKDELDKAFPGKRPMEIQTMITSAKDNLDFAARNKKLDIDQRSAGAKRDIDRAVATRQVAVYRGENRDQVDADASRTISTIKNTAATGISNAQFEHSAQQREAVITAKADTAEARAQLDEVAKARTARVDTDSSGLDGIKSAADKATKSVKDLASAMALVSLGSAALGAVMTTGLAAGVIAAGGAAIAVNKRLAASHKEAMDAAQLQAETAKRDLAQAYRDVADTAIESSQRIAAAQHEVQMADRNEEDARRSLTEAYRDARQELEDLQLQLEQAPINQRSADIGVARAYQNLTQLGKRQDVTPLDYEDAFNQIDEAKAHQDEVRVKNQQLQDNAAVAAKKGVEGNDKVIKGKEALADADYQQKVSQQDLVNTQRETAEAQLKAAESVKVALQEQARANSELAKATKDANSAMSQMSAMFADLAAPLQGPLHNAMQSLKQQFMDMKPVIQQGFQSAADYVQPFTDALTGLMLGPVPGVVTALQNSQPAVTGFRDGMRSLGTDIGTMFSQMSAGSAGFGDLWRSLGDQVGKFLIQIGTFIGQYAGPASQALSTLLQGVNDLAAGFLNGLGPGMKDLPVIASAIASVMKDVGQIIGILVQSSFPAIIAIAPIIKDLATGFAAVMNWLQPMLPILAPLAAGWFLLDAALDLNPFVLIAGAIAGVAAGIGYLATQTKVFQDIWNKMPKGVQEFGAAVVNDTKKAGGAVASFATKKDSNGQTGLQKTGSAIMHAGSSVGGAVVSAGKSLGKTFAPELQDMKATFGGIWSSLKAEWDKDLKPAFDSLWSSLKVLWNNSKPILDFIGGAFVAIGKVVMSVISGVIGPIIGVFVDTIKNIVNAVRGLIMMISGFFETVKGVFEICFNFFKTIFGLLKGVFTGDFSTFKSGLDGIGNGFKDMLSGLGTFFAGVWHLVTSLFGEVIDILKGAWKTVEGIVLGIVNGIIHWFEWLYDEMVGHSIIPDLVNDVIKWFESLPDKLIKLVEDLGKKIADAFTKLWGDVKDIASKAWDALKNDTGIGSFVDGIKGTFSGLLKDIGDIWDGIKKVFATPINWVIGIWNNDIVGKIPGLGKIDTIPGYASGGQPDGSPGYINGTGGSREDKHVVAVSNREYIVNAEATSRNRAVLDAMNFGGERAMIPGFALGGTPAHAEAEVPGFSLGGIAFRHMRKAAGLRFADGGPTDGSTNPAIARALQWAAGYQGRPYNDQGWLDCSGLASGIYDSLLGRTPKREFTTTSDFTALGFVPGRGGIMEIGVTPLPGNYGHMATTLAGHKLESGGVHDDIRVDGPAIGADDSQFADHYYLPGKFFNPAYSGAGADGDKGSGGGFFGMIGSAFQAVVGGVRSGISDLFQKLTDPVLNAIPDPMIGGQKGFLGSFPKQIATKSRDDIANLIRGHESTNTIGGAIPTGDRLAVIDAALALTHTPPPGTKEQWEAGMNTLVERESGWNTGAINDWDDNAKAGNPSKGLAQTTGTTFAAFAAPGHKNIFEGVDNLAASINYIKSKYGGIDRVQQANANMPPKGYATGTTNAKKGWSLVGELGPELVLFGGGETVIPNNMLGNLNDNWNAAAQANGLGLKAQSAGQAFAKANFDQFVGDLGGSTSGDGLAEAAFTQVPSYLLALDAYQKSGKLGKPAYQTLSNQIQGDLSKPPANPLDPGQEGVTTQPGTPPPGAQPQQQQNQNPLSQFLHPTEVHFHVSDVDEAMGKWKQTQREATLGFDPLGIFGS
Other Proteins in cluster: phalp2_33438
| Total (incl. this protein): 6 | Avg length: 1868,8 | Avg pI: 5,64 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 76KKK | 2106 | 5,77378 |
| 19x1p | 1815 | 5,72286 |
| 3gn6D | 1830 | 6,38293 |
| 4LB46 | 2518 | 5,62680 |
| A0AA47KXN5 | 1497 | 5,11229 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_40557
4MMrj
|
1 | 34,3% | 1782 | 0.000E+00 |
| 2 |
phalp2_22507
1fUaK
|
1 | 27,4% | 1768 | 2.363E-178 |
| 3 |
phalp2_36889
7ukoa
|
30 | 22,6% | 1607 | 1.222E-78 |
| 4 |
phalp2_33467
7m6oK
|
34 | 22,7% | 1422 | 1.943E-69 |
| 5 |
phalp2_6553
11DGv
|
4 | 22,5% | 1797 | 2.945E-65 |
| 6 |
phalp2_21847
4rAYx
|
3 | 22,0% | 1403 | 1.818E-55 |
| 7 |
phalp2_18324
5inLv
|
5 | 20,8% | 1509 | 3.601E-51 |
| 8 |
phalp2_18800
1dwRh
|
1 | 22,3% | 1502 | 7.118E-47 |
| 9 |
phalp2_9474
HzvZ
|
51 | 21,4% | 1474 | 2.883E-41 |
| 10 |
phalp2_9295
7jp7L
|
18 | 20,7% | 1380 | 1.189E-30 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhodococcus phage REQ1 [NCBI] |
1109712 | No lineage information |
| Host |
Rhodococcus equi [NCBI] |
43767 | Actinobacteria > Actinobacteria > Corynebacteriales > Nocardiaceae > Rhodococcus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
JN116825
[NCBI]
CDS location
range 38584 -> 42927
strand +
strand +
CDS
ATGGCAACGATCACCTCGCTCAACTTCGCGATCAGGAGTAGTTGGGACGGGACTGCGCTGAACAAGGCGCGTGATGACCTAATGGCTCTGCAGGCTCAGATGCGGACGATCTCGGGTGCCTCGCTGCACTTCACGGTCGACGCCGACACCGAAGAGGCGAAAGCCAAGATCGCGGAGCTGCGCGCACAGGCGCGGGACATCCAGATGGGCATCGACCTGAACGACGGCCAACTCGCTACGGCGGAGGCCAAACTCGACATCCTGGCTCGCGATCGAGAAGTCGACGTGAGGGTGAACGACCATGACATCGCGAACAAGATGGACCGGATCTCGGACTCCTTCGAGAACGCGTCGGTCAGCGCGGTGGACTTCGGCAACTCGTCGAACCGTGCGGGCTTCATGGCCAAGGTCGGCATCGCGGCGTTCGCCGCCGGTCTCGTCGTTCTGCCCGGACTGATCAACCTCGTCGGTGCATCGCTGACCGGCGCGATGGGTGGCGGCATCATCGCTGTCGGCGCGCTGGCGCTCAAGGAGAACGAGCAGATGAAGGCGGGCTGGGAGGCCCTGTGGACCGAGATGAAGTCCACGGCCCAGTCTGCCGCCCAGCCGATGCTGCAGCCGTTCCTGGACACCATGTCGATGGCCAAGACGCGGATCGCGGAACTCAAGCCGGAGCTGACAGACCTCTTCGCGAGTGCAGCCCCGGCGATCCAGCCGCTCACGTCCGGCCTGATGGATCTCGTCACGAACGCGATGCCCGGCCTCACGCAGGCGCTCGAGCAGTCCGGCCCGATGGTCCAAGGGTTCGCCGATGGCCTCGGTTCGCTCGGCACGCACGTCTCGAACATGTTCGCCGGGATGGCAGACGGCGCAGCGGGATTCGGTCGACTGTGGAAGATCACGTTCGATCAGGTCGGCCTGATGATGGAGCAGTTCGGCGTCGCTGCCGGGAAGATGTCCGAGACGGGCACCGACCTCTGGGATCGACTGCTGACCGGCTTCAACCAGTTCATCGGCGGCTTCCTCGACGGCCTCGTCGACTTCACGGCCGCCGTCGACAACGGGGTCGGTGACGCGCTCTCGGTCTTCGGTCTGCTCGGCGACGTCATGCGCGGCACGCTCGGACCGCTCGGCGAGCTGGTGGCCGCGATCTCGAACGCCCTCATGCCCGTCCTCGACGGCACCGTGCAGGCGCTCACCCCCGTCATCGGCGCACTGATGTCCGGCCTCGCTCCGGCGATCAACTCGCTCGTCCCGCTCAGCAATGCGCTGCAGCCGTTGCTCGTCATGGTCGGCCAGTCGCTGACCCAGGCGATCAACCAGCTCGCACCGCTCCTGCCCCCGATCGTGGACGCGCTCGTCTCCGCGCTCATCCCCGCGTTCCAGGCGCTCATTCCCGCACTGGTTCCGGTGATCGAGCAGATCGCAGTAGGCCTCAAGCCGCTGCTGGAGATGCTGCCCGGCGTGATCAACACGCTCAGCCCGATCGTCGTCGGCCTGGCCATCGCGTTCGGCCAGATCACGCAGTGGCTGTCCCCGCTGATCCCGCACCTCATGCAGATGTACGTCGCGTGGAAGCTGATCTCGACGGCCATGGCTATCGGGCGCGGCATCATGATCGCGTACACGGCAGTCCGCACGGCCCTCACGGTGGCGACCATCGCTCACACGGCGGCCGTCTGGGCGAACAACATCGCGATGAGCGCCAACCCGATCGGCCTGATCATCATCGCCATCGGCGCTCTGGTGGCAGCGATCGTATGGCTCGCGACCAAGACTCAGTTCTTCCAGACCGTCTGGGAGGCACTGAAGGTCGCGATGGCGGCTGTGTGGAACTGGATGAAGGATGCGTGGGATGCCACGGTCGGATTCCTGGCCATGCGGTGGGAGCAGTTCTCCGGCACGATCGTCAACGCGTGGAACGCCACGTGGAACGTGATCAAGGTAGCGGCACAGGCTGTCTGGGGATTCCTCACCAACGCCTGGAACCAGTTCCTCACGGTCCTCAAGGTCACCTGGGAGGTCGTCTCCGGCGTCATCAAGGCGGCCTGGGACGTCTGGTGGTTCGCCATCCGCGCGAGTGTCGCGATCATCTGGGCGTGGCTCGAAGTCGCCTGGGGTGCACTGTGGGGCACTGTCCGGCGAGTCTGGGATAGCTTCGTAGCGATCTTCCAGCCGCTGTGGGATGGACTCTGGAACGGCGTCAAGGTCTTCGTCGAGGGCGTCTGGGGTGGAATCCAGATCCTCTGGGACGGACTGTGGAAGGCCGTGACGGCAGGCTGGAACGCCTTCATGGGCTGGCTGCGTCCGATCTGGGAGTCCTTCTGGAACACCGTGAAGCAGGTCGGTGAATCCATCTGGAACGCCGTCAAGCTCGCCTGGGAGTTCCTGTGGGATCAGGTCAAGCAGAAGTGGGACTGGTGGGTCAACCTGTTCAACTCTGTCTGGCAGCCCTTCTGGAACGGGATCAAGGCTGCAGCCGAGACCGTCTGGAACGGAATCACTTCCGCGTGGAACGCCTTCTGGAACGGCATCAAGGCGATGTGGGAGAACTTCTCCAACGCCATCCGTGCCGGGTGGGAAGCCTTCTGGAACTGGGTGCGTGACTTCGCATCTGGCGTCTGGGATGCGATCACCAGCAAGTGGGGCGAGTTCAAGCAGAAGTTCGAAGAAGTCTTCAGCACGATGGTCGACAAGGCCCGCGAGATCTGGGACAAGATCCGAGAAGTCTTCGCGAAGCCGATCAACTTCGTCATCCGCATCTGGAACGACCACGTCGCCGGTAAGTTCGGCCTCCCGGCACTCGACGAGATCGGCGGGTTCGCTACCGGTGGTCGGGTCGACGGCAAGGGCACGGGAACGTCCGACGACATCCCCGCCCGGCTCTCGCGCGGTGAGCACGTCTGGACCGCAAAGGAAGTCGATGCGGCTGGCGGACACGGCAACGTCGAGGCCATGCGCCGCAACACGCTCAACGGTGTCGCGCACTACGCCTCCGGCGGTCCGGTCGAGTGGATGATCGGCCAGCAGCAGAAGTTCGCGCCTGCCCTGCAGGTCACTTCGGCTCAGCGGGACTCGAACGACTACCACGGCCAGGGCAAGGCGGTGGACTTCTCGAACGGCGGCGACGCCGGTACACCGGAAATGATGGCCTTCGCGAACTGGATCGCGGACACCTGGGGTGCTAACACGCTCGAGCTGATCCACAGCCCGTTCGGACGGAACATCAAGGACGGCAACAGCGTCGGCGACGGCATGGGGTTCTACGGGGCAGGCACGATGGCCCAGCACAGGAACCATGTCCACTGGGCAGTCGACCGCCCCCTCAACGAGGACGAGGGCGGCCAGAGCCTGCTGGGCAAGATCTGGGGCGGCGTCCGCAAGGGCATCGCCTGGACCTTCGAGAAGCTGACCAACCCGATCCTCGGAGCCATCCCGGATCCGTTCCTTCCCGGTGTCGGTTCGCCGTTCGCCGGATTCCCGAAGGCAGCGGCAACGAAGGTCCGAGACGCGTTCCTCGACAAGGTGCGCGGCGCTGAGGGTGCATCCGGCGGCACTGCCAGTGGAGACATCGGCGGCGTCATCCCCGACGGGGACCGGCTCGGCATCATCAACGAGGCCCTGCGGATCACGAACACCCCGCCTCCCTCCTCGATCGAAGCATGGCAGCGCGGCATGAACACGCTGATCACGCGAGAGTCGGGCTGGAACGCGGGCGCGATCAACAACTGGGACTCGAACGCAGCGGCGGGCAACGCTTCTCGCGGTCTGGCCCAGGTGATCCCGACGACGTTCGAGGCTCACAAGGCACCGGGCTACAACAACATCGACGCGCCGGTCGACAACGTCGCGGCCTCGATCAACTACATCAAGTCGCGCTACGGCACCATCGAGAACGTCCAGCAGGCGAACGCCAACATGGCCCCTCAGGGCTACCGCACGGGTACCAACTCGGCTGCGGCAGGGTGGCACCTCGTCGGCGAAGACGGACCCGAGATGGTGAACTTCCGTGGCGGCGAGCAGGTCAAGACGTTCGACGACATCATCCGCGCCCTGAAGGACTCCACGTCCGGACAGAGCAAGGAGCTGGAGTCGAAGCTGACGGCCGAGATTCGCACGCTGGTGGAGAAGATCGGGACCGAGGTCAACACCTCCGGAGCGCGGTTCGCGCAGTCTGTCGAGCAGGCTATCGAGCGAGTGCTGGCCACGGCCGGTATGCAGCTCAACCTCTCGATGCCGGTACCGCAGAATGCAGCCGACGCAGCAGCGTACGCGCAGGAAGTCGCGAATCAGCTTCTCCCGCAACTGGAGATGATGATCCGGCAGCGTATCGGCACACGGTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi00023eec70_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(76KKK)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50