Protein
- Protein accession
- A0A2H4PA19 [UniProt]
- Representative
- 75tux
- Source
- UniProt (cluster: phalp2_2692)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MADSANVGYASLQVIPTVAGISGNLTRQLGLPFQRAGQQAGQNAGRAIADGVEAAAARVRAASDKVAKARETEIAASAKVGIAEAKLQELRDRGITGGSRWLRAEEDLRRARAASSTATDRARRAAEDLTQAQQDAANATDDVTDSAEGGGSALGGLGDKVKDLVGNFKVAALGAAGLGAAITAGIASNMDTEVINDKLAASLGAGPAEAAHYGKVTADIYRANFGDSMEQVSDSVGIVASSFKNLAESGKGSLTGVTETALNFSKTFGTEVEESVQTVSTLMSTGLVKDANQGFDLMTTAFQKVPAAMRGELPEILHEYGTNFQALGFSGEQAFSMLVTAAGKGKFALDKTGDALKEFTIRGSDMSKASTDAYKAIGLDAEEMSRKVASGGPAAQEALQATAKGLLGIQDPATRANTAIALFGTPLEDLSVDQIPAFLEGLTGGQNNMAGFAGSAERMGNTLNDNTTAKVEAFKRSLQGGLNDAIGQGVTGLMNFTGWVKDNGVAVGITAGILGVVLAPALITAAVGWTQTGIAATRSAAAQLAASYRTVGGWIAMSASAVANAATTSGAWIASGARAAGAWIALKAQAVGSFIATAAGATANAAVTTGAWVASGARSAGAWAAMKLAAVGSFIATGASAVVQAAITAGAWVASQGRMALSIGIATAAFIAQKVATVAGTVATGALTVAQWALNAAMSANPIGLVIAIVVALVAAIVLAYRNSETFRNIVQAAWQGIQVAASFAWNSILKPALDGFLAALGWVGDKAMWLWQNVMIPAWDAIKNAISAVWNFIRPILDNIGKGIHAVGEIAAKVGDAMRNAFNGVVDVLKTPIHAVGKLLASIPDKVLGISIPGASTIKSWGETLQSLRVGGVVNNGMAGRTRNGVFWGPGSGTDDAILGVDAYGMPTALVSNKEGVVTADAMGHGGAGVVAALNSGWRPNAAQLQALGLPAFADGGVVGEPYGLPTGSNISYGAPGFPDWVTKLGDQHKVKPSTYAGHQESDRNEAGYAPNPAHQNRGIDWSGPVEAMDAFARYLLGIAPDTPALEQIIWQNPGTGEKVGWHGRTPDAGFSYFAADYGGHTDHVHTRQSAAFGGAAKPKIPDTNTQVPGYVPPTGLNPDGTTPDINSLGTNNTAPTTPSTTPQAETKRLKTFKELGSDLGGILFGGVEEFFGDSVPAWVWDANKLTEGADTGDNVRTSDNKGTTGNTTTPSTTTPTPGTTGDQTAPSTTGGFAPDTPETLKAAEDSTKNKTPVGVAGAPVIKYNPGAGAEQWTPLAEWAINYVNKSLKGPAQTAAMIGQTSDESGGNPRAVNNSDINAQNGTPSGGLLQVIEPTFQANRDPRLPNDKFDPAANYVAALRYYVPKYGQDLTTRWGRGKGGYKSGGWTGNLDPAHIAGVVHGREFVVKSPFAQKNRAILEAINAGESVGDLMSGRVVAHKAPALVDAGVGSRRDGGGSKLADVVNIQGYTAEEISSEWNKYQWARTAGYGTSRNR
- Physico‐chemical
properties -
protein length: 1495 AA molecular weight: 153331,9 Da isoelectric point: 5,97 hydropathy: -0,08
Representative Protein Details
- Accession
- 75tux
- Protein name
- 75tux
- Sequence length
- 1334 AA
- Molecular weight
- 139055,97940 Da
- Isoelectric point
- 5,50317
- Sequence
-
VTSIGYATLQIIPSLDGVSGAVQKQLGFLPQMGKTAGKALGDGLASGVDDAAKRVEQATAKIEAANKKVEDSAGKVRVAEAQLQALRDKGITDVGRLAAAEEKLAAAQRNAATATKAQETASTGLERAQEKLAAARTSAANGAKQEAESTSRFGTAIGQMGEKAGGAIGNLKNLAVAAAGIGSAMEIATSAMDFESATAKMNATLGATGGLAEDYGKSAATLYGKGFGDSMGDVTKAVEAVATTMPVIGFEGEVSLDKAAERAMNLAKVFDIDVAEAVQSSEQLITNGLAKDSTQAMDLLTTAMQRVPAAMRGELPEILGEYGKHFQTFGLSGQAAMGLIVDMAPQGKIALDKTGDAIKELSIRATDGSKLTTDAFQAIKVDGDKMAKAIASGGPGAQAAMQDIAKGLLTIQDPAQRAQQAIALFGTPLEDLGVDKIPDFLTALSGAGGSMAGFEGATDELGRTLNDTAQSKLTAFGRGIQTGIVEGLGSAIGFIQENKQLLTDLGIAAGITGGALLVMAGPAVLSSIKTMITSTRLWAVAQGALNLVMSLNPLGAVVLGLTALVAGIVVAYRNSETFRNVVAGAWNWIKDAAAAVVSWFTDTAWPFLQSVWDGIGAGINGLVSVAGTAWDMFTAPARAMAEWFTGTLLPWIDRTWENFKTGLRVIVTVAQEVWDGIKEKFSGIADFVGSLPSKISEKARGMWDGIKEAFKNAINWIIRAWNGIEFKIPGFKVGPVGYDGFTLGVPDIPEFFKGGHTGPGAKYDVAGIVHADEFVLSKRARATLEGTKPGGLDFMNKTGQWPGYAEGGKVGYGLPVGTSISYGQSDKFPEWVRALEQRFGVKASTYAGHQEKDGHNKGIDWSGPVDAMQKFAEFAANAGLEQVIWQNPTTGQKIGVADGKPVGPGTDQPGYYRDDFAGHTDHVHTRQSWSWGEPPAAPAPAPQQPAADAAAQAAADDTKKPADDASKTDSKADDDKKTTPTPAAAASSSSSSSGGSYPTSISGWAGFIGEHFVGGQIKSLLSVFGIPDSPGWMKGATQLLGGIKVSDKDGKSVFDGSNPLGGLNAAIDGKPATPAKDADDKKTPAKTDDKATVMPGGALPIPGQQPAAPAAEPAKPAAVAQALVAPAADYNGGTPEIHNAVYKAFKDAGYADGQWGDMVSLINKESSWNPEARNPSSDAYGLGQFLTQGNIDKYLGGKNRDVPVDVQSKAIMQYVKDRYGDPAGALAFHQKNNWYAEGGRVKPFLYDAGGLLPQGLNLVENRSGGPEPVLTQDYWRTAKTAIDVVSSTVKGQAGGQTKQIPPVVYNIQARDTEDAFIRSQRQERERAAAKLSRF
Other Proteins in cluster: phalp2_2692
| Total (incl. this protein): 49 | Avg length: 1270,7 | Avg pI: 5,04 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 75tux | 1334 | 5,50317 |
| 1Yf2z | 1574 | 4,93888 |
| 1b1YJ | 1454 | 4,67816 |
| 1ca5J | 1173 | 4,59494 |
| 1caa9 | 1457 | 4,63649 |
| 1mUJg | 1222 | 4,48559 |
| 1mpSy | 1334 | 4,52873 |
| 3W23 | 1385 | 7,35186 |
| 4eZGb | 1109 | 4,97520 |
| 4f7ld | 1101 | 8,56993 |
| 7243v | 1293 | 4,85475 |
| 72Lzd | 1143 | 4,61825 |
| 73tdZ | 1303 | 4,84197 |
| 76rtu | 1238 | 4,57312 |
| 79J05 | 1103 | 4,87635 |
| 7AVsM | 1186 | 4,74165 |
| 7BN25 | 1609 | 5,13673 |
| 7BN3B | 1174 | 4,67151 |
| 7ehGt | 1444 | 5,64175 |
| 7lGNY | 1316 | 4,64013 |
| 7m5Aq | 1261 | 4,66747 |
| 7mBfA | 1315 | 4,43852 |
| 7o2PK | 1245 | 4,64479 |
| 7o3dM | 1245 | 4,62751 |
| 7o3gn | 1250 | 4,61762 |
| 7ocRp | 1097 | 4,38686 |
| 7p41F | 1291 | 4,63411 |
| 7pOo7 | 1384 | 7,71916 |
| 7pReH | 1385 | 7,71529 |
| 7qHAw | 1105 | 5,49084 |
| 7qJuh | 1095 | 4,66025 |
| 7qzge | 916 | 4,79678 |
| 7rr0R | 1301 | 4,75028 |
| 7ujwL | 1075 | 4,64303 |
| 7vn5D | 1188 | 4,56294 |
| 7vn8r | 1188 | 4,55760 |
| 7vngH | 1345 | 4,69310 |
| 7wgD9 | 1447 | 5,76645 |
| 7x2af | 1198 | 4,63013 |
| 7x5gh | 1432 | 4,57863 |
| 7xjon | 1188 | 4,61450 |
| 7xjrm | 1292 | 4,73613 |
| 7yPeX | 1186 | 4,55698 |
| 7yPhQ | 1188 | 4,60194 |
| 7ykjS | 1183 | 4,42289 |
| 7yyyY | 1457 | 4,59841 |
| 7zGDc | 1145 | 4,64326 |
| 8Lmei | 1416 | 5,41445 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_32224
7cxS3
|
54 | 36,1% | 915 | 3.516E-218 |
| 2 |
phalp2_38308
j3Tw
|
11 | 26,6% | 1345 | 1.163E-151 |
| 3 |
phalp2_23844
1pqSu
|
4 | 27,0% | 1517 | 4.775E-139 |
| 4 |
phalp2_12304
7qhxR
|
109 | 28,1% | 1106 | 2.255E-111 |
| 5 |
phalp2_4828
5kiSn
|
3 | 24,9% | 1216 | 1.901E-95 |
| 6 |
phalp2_7805
7vHjO
|
1 | 24,1% | 915 | 3.715E-72 |
| 7 |
phalp2_27600
5Jhdd
|
38 | 24,0% | 1014 | 4.107E-56 |
| 8 |
phalp2_23552
7r4BM
|
40 | 21,1% | 1482 | 4.560E-51 |
| 9 |
phalp2_32490
1ep70
|
5 | 20,6% | 1143 | 1.506E-45 |
| 10 |
phalp2_381
7hkjr
|
14 | 21,9% | 1477 | 1.041E-44 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Gordonia phage Gustav [NCBI] |
2047872 | Gustavvirus > Gustavvirus gustav |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MG198784
[NCBI]
CDS location
range 9814 -> 14301
strand +
strand +
CDS
GTGGCGGATAGCGCGAACGTGGGATATGCCTCCCTCCAGGTGATCCCCACGGTGGCCGGAATCTCCGGCAACCTCACCCGCCAGTTGGGCCTCCCCTTCCAGCGCGCCGGACAACAGGCCGGACAGAACGCTGGCCGCGCCATTGCGGACGGTGTGGAGGCCGCGGCGGCGCGAGTTCGCGCGGCGTCGGACAAGGTCGCCAAGGCCCGCGAGACCGAGATTGCGGCCTCGGCCAAGGTCGGGATTGCCGAGGCCAAGCTCCAGGAACTCCGAGACCGCGGGATCACCGGCGGCTCCCGCTGGCTCCGCGCCGAGGAGGATCTCCGGCGCGCTCGGGCCGCGTCGTCCACCGCGACGGACCGCGCTCGGCGCGCGGCCGAGGATCTCACCCAGGCCCAGCAGGACGCGGCCAACGCCACCGACGACGTAACCGACTCCGCCGAGGGTGGAGGCTCGGCGCTCGGCGGCCTCGGTGACAAGGTCAAGGACCTGGTGGGTAACTTCAAGGTGGCCGCGCTCGGCGCGGCTGGCCTTGGCGCGGCAATCACCGCGGGCATTGCCAGCAACATGGACACCGAGGTCATCAACGACAAGCTGGCCGCGTCCCTCGGGGCCGGACCGGCGGAGGCCGCTCACTACGGCAAGGTGACCGCGGACATCTACCGGGCCAACTTCGGTGACTCCATGGAGCAGGTCTCCGACTCGGTCGGGATCGTTGCCTCCAGTTTCAAGAACCTGGCCGAGAGCGGCAAGGGGAGCCTCACCGGCGTCACCGAGACCGCGCTCAATTTCTCCAAGACCTTCGGCACCGAGGTGGAGGAGTCGGTCCAGACCGTCTCCACCCTCATGTCCACCGGCCTGGTCAAGGACGCCAACCAAGGTTTCGACCTCATGACCACGGCGTTCCAGAAGGTCCCCGCGGCCATGCGGGGGGAACTCCCCGAGATCCTCCACGAATACGGAACGAACTTCCAGGCCCTCGGGTTCTCCGGTGAGCAAGCGTTCTCCATGCTGGTGACCGCGGCCGGTAAGGGCAAGTTCGCGCTGGACAAGACGGGTGACGCTCTCAAGGAGTTCACCATCCGCGGCTCCGACATGTCCAAGGCCTCCACCGACGCGTACAAGGCTATCGGGCTGGACGCCGAGGAGATGTCCCGCAAGGTGGCCAGCGGCGGCCCCGCGGCCCAGGAGGCCCTCCAGGCCACCGCCAAGGGCCTCCTGGGTATCCAGGACCCGGCCACGCGCGCCAACACCGCCATTGCCCTGTTCGGTACTCCGCTGGAGGACCTCTCGGTTGACCAGATCCCGGCGTTCCTGGAGGGCCTCACCGGCGGCCAGAACAACATGGCCGGGTTCGCGGGCTCGGCCGAGCGCATGGGTAACACCCTCAACGACAACACCACCGCCAAGGTGGAGGCCTTCAAGCGGTCCCTCCAGGGTGGGCTCAACGACGCCATAGGCCAGGGTGTCACCGGCCTCATGAACTTCACCGGCTGGGTCAAGGACAACGGCGTGGCCGTGGGGATCACCGCCGGAATCCTCGGCGTCGTGCTGGCCCCAGCCCTCATCACGGCCGCGGTCGGCTGGACCCAGACCGGGATCGCGGCCACCCGCTCGGCGGCGGCCCAGTTGGCGGCCTCCTACCGCACGGTGGGCGGCTGGATCGCCATGTCCGCGTCCGCGGTGGCCAACGCGGCCACCACGTCCGGCGCGTGGATCGCCTCCGGCGCGCGGGCCGCTGGCGCGTGGATCGCGCTCAAGGCCCAGGCGGTCGGTTCGTTCATCGCCACGGCGGCCGGAGCGACGGCCAACGCCGCGGTCACCACCGGGGCCTGGGTGGCCAGCGGTGCTCGGTCGGCCGGAGCATGGGCCGCCATGAAGCTGGCCGCGGTCGGTTCGTTCATCGCCACCGGCGCGAGCGCGGTGGTCCAGGCCGCAATCACCGCCGGGGCCTGGGTGGCCAGCCAGGGACGTATGGCCCTGTCCATCGGTATCGCCACCGCGGCGTTCATCGCCCAGAAGGTGGCCACCGTCGCGGGCACCGTCGCCACCGGCGCGCTCACCGTGGCCCAGTGGGCACTCAACGCGGCCATGTCGGCCAACCCCATCGGCCTGGTGATCGCCATTGTGGTGGCCCTGGTGGCCGCCATTGTGCTGGCCTACCGCAACTCCGAGACGTTCCGCAACATCGTGCAAGCGGCCTGGCAGGGTATCCAGGTGGCGGCGTCGTTCGCCTGGAACTCCATCCTCAAGCCCGCGCTGGACGGGTTCCTGGCGGCCCTCGGCTGGGTCGGTGACAAGGCCATGTGGCTCTGGCAAAACGTCATGATCCCGGCCTGGGACGCAATCAAGAACGCTATCTCCGCGGTGTGGAACTTCATCCGTCCGATTCTCGACAACATCGGCAAGGGTATCCACGCGGTGGGCGAGATCGCGGCCAAGGTCGGGGACGCCATGCGTAACGCCTTCAACGGCGTGGTGGACGTGCTCAAGACCCCGATCCACGCGGTGGGTAAGCTCCTGGCCTCCATCCCGGACAAGGTCCTCGGGATCTCCATTCCTGGGGCCTCCACCATCAAGTCCTGGGGTGAGACCCTCCAGTCCCTCCGGGTCGGCGGTGTCGTCAACAACGGCATGGCCGGACGTACCCGTAACGGCGTGTTCTGGGGACCCGGCTCGGGCACCGATGACGCAATCCTCGGCGTGGACGCCTACGGTATGCCCACCGCGCTGGTGTCTAACAAGGAGGGAGTGGTCACCGCCGACGCCATGGGCCACGGTGGCGCTGGCGTCGTCGCCGCGCTCAATAGCGGCTGGCGGCCCAACGCGGCCCAGCTCCAGGCCCTTGGCCTCCCCGCGTTCGCGGACGGCGGCGTGGTCGGTGAGCCCTACGGCCTCCCGACCGGGTCGAACATCTCCTACGGCGCGCCGGGGTTCCCGGACTGGGTCACCAAGCTGGGAGACCAGCACAAGGTCAAGCCCTCCACGTACGCCGGACACCAGGAGTCGGACCGCAATGAGGCCGGTTACGCTCCCAACCCGGCCCACCAGAACCGCGGCATTGACTGGAGCGGCCCCGTGGAGGCCATGGACGCGTTCGCGCGCTACCTCCTCGGGATCGCGCCGGACACTCCGGCCCTGGAACAGATCATCTGGCAGAATCCGGGCACCGGCGAAAAGGTCGGTTGGCACGGCCGTACGCCGGACGCCGGGTTCTCCTACTTCGCGGCCGACTACGGCGGCCACACCGACCACGTCCACACCCGCCAGTCGGCGGCGTTCGGTGGCGCGGCCAAGCCCAAGATCCCGGACACCAACACCCAGGTCCCCGGATACGTCCCGCCGACCGGGCTCAACCCGGACGGGACCACGCCGGACATCAACTCCCTCGGCACGAACAACACCGCGCCGACGACGCCGAGCACCACTCCCCAGGCCGAGACCAAGCGACTCAAGACCTTCAAGGAGCTGGGCTCGGATCTCGGTGGAATCCTGTTCGGCGGCGTAGAGGAGTTCTTTGGCGACTCGGTCCCGGCGTGGGTCTGGGACGCCAACAAGCTCACCGAGGGAGCCGACACCGGCGACAACGTCCGCACGTCGGACAACAAGGGCACCACCGGCAACACCACGACGCCGAGCACCACGACACCGACACCGGGCACCACCGGCGACCAGACCGCGCCGAGCACCACGGGCGGGTTCGCGCCGGACACGCCGGAGACGCTCAAGGCCGCCGAGGACTCCACCAAGAACAAGACCCCGGTGGGTGTCGCCGGTGCTCCGGTCATCAAGTACAACCCCGGAGCGGGCGCGGAACAGTGGACCCCGTTGGCGGAGTGGGCAATCAACTACGTCAACAAGTCGCTCAAGGGACCGGCCCAGACGGCCGCCATGATCGGCCAGACCAGCGACGAGTCCGGCGGTAACCCGCGCGCGGTGAACAACTCGGACATCAACGCACAGAACGGAACACCGTCCGGCGGGTTGCTCCAGGTGATTGAGCCCACGTTCCAGGCCAACCGAGATCCTCGGTTGCCCAACGACAAGTTCGACCCGGCCGCCAACTATGTGGCCGCGTTGCGGTACTACGTGCCCAAGTACGGCCAGGACCTCACCACGCGGTGGGGCCGCGGCAAGGGTGGGTACAAGTCCGGCGGGTGGACCGGGAACCTGGACCCGGCGCACATCGCGGGAGTTGTCCACGGCCGGGAGTTCGTGGTCAAGTCCCCGTTCGCCCAGAAGAACCGCGCGATACTGGAGGCAATCAACGCCGGTGAGTCGGTCGGTGACCTCATGAGCGGCCGGGTGGTGGCTCACAAGGCCCCGGCCTTGGTGGACGCCGGTGTCGGATCTCGGCGCGACGGCGGCGGGTCCAAGCTGGCCGACGTGGTCAACATCCAGGGGTACACCGCCGAGGAGATCTCCTCGGAGTGGAACAAGTACCAGTGGGCCCGGACCGCTGGATACGGCACGAGTAGGAACAGGTGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000ca33976_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(75tux)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50