Protein
- Protein accession
- A0A515MKC0 [UniProt]
- Representative
- 3bZcI
- Source
- UniProt (cluster: phalp2_39181)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MPAMQGAETYINVLPSMNGYFKRVNAAVKRNKVTQHVDVELDQRQLKKAQADLDKASKSAADARRRESAATRESVAAERELQALRNKGVTDASRLAAAEDRVAKAKANSRAAANALNAAESKRSTAGARVTRIEAKLDSRRADADSETFLQRITSKFERGGQAAGAKFAGGVSAAMNSHSRGNDEGRSFAAGFVSAIGSGMRMAAVGFTVVNDAARSVIRNVGMVATGVGLAARALKVFSAGLLVSSSLLRVMTGAGIARLAGVLRLAAAAAGILARDIARVTSALLLMAAAARLVGILTRVGRALGMVTVGSAVALGAMSALGSVVSSFATGPLVAGLTAVAGAMGTVAAAAAGILGPAIGVAKMAFAGLSAGAKAWDKSQTAVGASATKGASALKAIESAKKSQVRTAEQGARQIVSAEKQVVKAQKDVKEAQDDVNKARQEAKRDAAGYARTLAGLALDEEAAELALSEAKKTLRETKVDPDADADDMWRANLGVREAKQALEEIKASNEQQRGEIADAQAKGIEGSDRVVEAKQREVDAQEQLQEAQADLAQTQKDVAQANIDAAEAVADAYESMAESQQSAASGDDPFAAMIGQRLAPLLQALKNLREEITDRFSGAMSGAFVKLGGLLDHLTPSLGGLATTLGNLGTQIVSSISSPAALAGWDRMIDGSNRFFQSLSQGENGIGSVFSGLIQVLGTAAQTFADSGAGINAWLLDLGEKLRNISADDLRGTFDSVRQIFQNISVIAGPLFDLFRGFGAEAASGLAPGFSAMGQAIQDSIPGLMDMARELMPALGQALVNLAPVLPGLVDAFSPWADILAILAPHLATIIEKLVPLAPILLGVVTAVKLISVAVTVWNGIMFAASVAQGVFAAATGRSVATLGANTIALVAHRVALIAGAVASNIAAIGMRAFGVALRFATGPIGLIIMAVAALVAGLVWFFTKTELGKKIWDKVWNGIKDAMAKAWEFLKGVFTKIGEIFSTVFNAIADIFKWVWESIIQPIFTGLKTAIAVVIVVFLLWWEGVKLYIELIGNIISWLWTSVAQPIWELMKAGLRMLGDFFAWVWNTLIKPAWDGLAAGISWAWENIIRPAWDALKVALGAVGDFFGWVWNTIIKPAWDGLSAGIAWAWENVIRPAWDALKAALGVVGDFFTSVWETVIRPVWDAFGAGIKAVWENVIRPAWDALKEGLGKVRDFFSEVVGGIGKKWDELKGLLAKPINFMIDTVWNGGILEAWNKAAGLLGLGKAEKLATIPEHATGGQIKGPGTGTSDDVLMWGSNSEHMMTAKEVERAGGHNAVYAIRDMIMRGIPFTWDGGNLIREMGRDNLNAYGAQVAQKGLGNVDPQGMFDWLLPKYKDGGEIRAGAPWEKALENGHRAAKMRNGNPYTWGFEDCSGYMSMIADAILNGGDGVRRWATGSFPGGQPFVPGLGKGFSVGVHDNPGGPGGGHTAGTLTGVGPYATVNVESGGSHGNVAYGGPAVGADSPQFAGKSPGVFHLAIGSDGSFESAGPGGGGGPTPQQKEQFLQKKIAEIFDKVLNPIKDVMGSVIGTPPPESLGIPPGFLDKGRDMTSGFLADKVLGLGEMLSTAWDKAKDIGSVLTFGLLRDQGGFIPNGLSIVRNETGKPEAVLNWDQLQLVRDILGRIGLGNTDKVAEPGQDPGPVDWGGVGAQIGTSLLAEWGNDFLGMVGIGKQFEGMKLVDEYGRRSDQAGERNEASFSDSPSTAESASTSPTYGDPNAQLTQQSVELSPMPNLDGPSSSGASGTVVDKVKAAFKPYGWDTGEQWAAADWIIGKESSWNPLARNPSSGAFSLFQFLGSTKDQYLPDENPDPYIQGQAGAKYIKDRYGDPMAAKSFWEKNNWYDQGGLAFGKGFMLKNVIHPERVLSPRQTEAFEELAPMLNRLQLATTAPGDVMPESARSALELAPMRGGPTYNITGRVDRETMSEVGIHERQSSRTYGSRTR
- Physico‐chemical
properties -
protein length: 1966 AA molecular weight: 207292,7 Da isoelectric point: 6,27 hydropathy: -0,01
Representative Protein Details
- Accession
- 3bZcI
- Protein name
- 3bZcI
- Sequence length
- 1520 AA
- Molecular weight
- 154769,37050 Da
- Isoelectric point
- 8,75160
- Sequence
-
MATAYATLQVIPTVRGITGRIERQIAAPLTAAGQKAGRDTGKAITSGVGSANYESTGRSAGAKISRGLTTAGHKAGRDTGKAITSGVGAANYESAGRSAGARVARGLADSGQRAGRDTGREITSGVGAANYEAAGRSAGVRVARGVTGHGRAAGREVGREINAGVREEDYESTGRSIASRIVSGLTTGLRGVQTSARVINSGFEAATRGLSLFVVNASTIATGFGIAARMVKSFSAATFVSALALQQVASVGLTKLAGALRLIAAIASRVAREVGQVTAAFLVLQGVVRLAGAMNSFASGLAKITVGASIAIGVVSGLGVAFASLAATIGAAAGAAAGAAAGILGPAVAALKVGMSGLSEAKKAFETPPSGDGGASQAKAVASAAKSLADAEKGVVRAKEDALEAEEDLTQAREDAQQQIEDVNRALRDNRLNEREAAREVRKAREALAETLRDPKASADDREEAADRVEAAELRLIETQERSREEEQKAAKANRAGIEGAEQVVAAKERVADANEAIVEAQERATEAAQALVEAQNQSASGGSTVDPFYAMIGERMAPMLTAFDNLKRTVTDDLSSALIPAFANIGTLADTVSPKISALAGVFGRIGTEVSKSLAGPTGVAAFDQMAAASNTFFTALSAGENGLGGFTLGLSQFAATAATTVSGSGGGLNSLLLSLGDKLRNISAEQITAAFDRMQQTFSNIGAVVGPLFNLFSTLGGISAKALGPGFSAVGAAITEATPGLARMAEILMPALSQVMERLAPLIPSLVEAFTPWASTLAAIAPPLATIVSHMAPLAPYLLMAATAFKVAGAVMLLWNAGAFAGAVAQGVFAAATGRSAMTLTGNTIALAAHRIALIAGAVAARAFGIAMAFATGPIGLIVMAVAAVGVALWAFFTKTEVGKRWWEAIWGAIKTAVSATWEFLKVAWDWILVAIQWIGDKAMWLWNSAIKPVFSAIGSLIAKWWTGIVQPAFEGVKTAFGIVGDVISWWWNSIVSPAFSAVGAIFSWWWNDLVSPIFNNVMTIFGKVGDVISWWWNNIVGPAFDAAKTAVGVLGDAFTWWWNSAVTPAFDGVKSVVGKWWDYAKGVFDLIKGGIGKVGEAFEAAGRVMSSIWDKVLDAMRPALHAIGRALSAVPTSIGPIDIPGAAVAVALGDKLQGFRRGGLASGAGTGTSDSIVARLSNREFVVNALATGKTLPLLEAINSGWTPPPWLLDLMVNGLPKFATGGLVDTQNWLRGEAGKPYQWGGTGNPSWDCSGIAGGAWAKATGKSPRNRYFTTGSDFAGLGWQPGPGGANDLTIGTNGLGGSSGHMSGRIGDLKFESSGTDGVEVGADAQDPSNFAKQWHWPLGGNPLNSGDLGSGGAAGGTGTGGLGRSGAGAGVGTTGGASSAGSTSRPSGTAVPVWLDNWHEMPGTAAAPSASAAAASTTTSSDSAGVSTTADNTDGAFDQSAAIAAAFEKFGGAMAGAGGEFLKGQKSAIPGIGGYVEGIEKTVSNVSIVVADVYEAMGAMTREQKRQTVGK
Other Proteins in cluster: phalp2_39181
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_3994
7AFQR
|
49 | 35,4% | 1206 | 5.980E-259 |
| 2 |
phalp2_20938
7fSA6
|
1 | 30,5% | 1462 | 6.592E-199 |
| 3 |
phalp2_28099
7ucAa
|
137 | 25,6% | 1114 | 9.152E-92 |
| 4 |
phalp2_24330
3Pu9x
|
29 | 27,2% | 1049 | 8.731E-87 |
| 5 |
phalp2_36889
7ukoa
|
30 | 26,9% | 1069 | 4.031E-84 |
| 6 |
phalp2_9474
HzvZ
|
51 | 22,8% | 1062 | 2.611E-48 |
| 7 |
phalp2_13411
4v9Zc
|
2 | 21,8% | 1212 | 5.413E-47 |
| 8 |
phalp2_22507
1fUaK
|
1 | 22,8% | 1137 | 4.908E-46 |
| 9 |
phalp2_6456
hsjD
|
15 | 21,3% | 1206 | 8.346E-43 |
| 10 |
phalp2_34982
4HM46
|
2 | 22,2% | 1191 | 7.328E-28 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Rhodococcus phage Whack [NCBI] |
2591132 | Whackvirus > Whackvirus whack |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MK967393
[NCBI]
CDS location
range 10349 -> 16249
strand +
strand +
CDS
ATGCCCGCTATGCAGGGTGCCGAGACGTATATCAACGTCCTGCCGTCGATGAACGGGTACTTCAAGAGGGTCAACGCAGCAGTCAAGCGCAACAAGGTGACTCAACACGTCGACGTCGAGCTCGACCAGCGCCAGCTCAAGAAGGCGCAGGCCGATCTCGACAAGGCGTCCAAAAGTGCCGCTGATGCGAGACGTCGCGAGTCTGCGGCCACCCGCGAGAGTGTCGCCGCCGAGCGTGAGCTGCAAGCCCTGCGAAACAAGGGTGTCACCGACGCGAGCCGACTTGCTGCCGCCGAGGACAGGGTCGCGAAGGCAAAGGCGAACAGCCGAGCTGCCGCGAATGCTCTGAACGCGGCCGAGTCCAAGCGCTCGACAGCCGGTGCTCGCGTCACTCGTATCGAGGCCAAGCTCGACAGCAGGCGAGCCGACGCCGACAGTGAAACATTCCTGCAGCGGATCACCTCGAAGTTCGAACGCGGCGGTCAGGCAGCGGGTGCGAAGTTCGCCGGTGGTGTCAGCGCCGCTATGAACTCGCACTCGCGCGGCAACGACGAGGGTCGCAGTTTCGCGGCCGGGTTCGTCTCCGCGATCGGCAGCGGAATGCGGATGGCTGCTGTCGGTTTCACGGTCGTCAACGATGCGGCCCGGTCCGTCATCCGCAATGTCGGAATGGTGGCCACCGGTGTCGGGCTGGCAGCCCGAGCGTTGAAGGTCTTCAGTGCAGGGCTGCTCGTCAGCTCCTCCCTGCTGCGGGTGATGACCGGCGCGGGTATCGCACGCCTGGCAGGTGTACTGCGTCTCGCCGCAGCTGCCGCCGGCATCCTCGCCCGCGACATCGCGCGCGTGACCTCCGCGCTTCTGCTCATGGCAGCAGCGGCCCGGTTGGTCGGCATCCTGACCCGCGTCGGCCGCGCCCTCGGCATGGTCACTGTCGGGTCTGCGGTCGCCCTCGGCGCCATGTCTGCGCTCGGCTCGGTCGTCTCCAGCTTCGCCACCGGCCCCCTGGTGGCAGGTCTGACGGCAGTCGCCGGGGCAATGGGCACTGTCGCCGCAGCAGCGGCAGGCATCCTCGGCCCGGCTATCGGCGTCGCGAAGATGGCATTCGCGGGTCTGTCCGCCGGGGCGAAGGCTTGGGACAAGTCTCAGACCGCGGTCGGTGCTTCGGCGACCAAGGGCGCATCAGCGCTCAAGGCGATCGAGAGTGCGAAGAAGTCGCAGGTGCGTACCGCCGAGCAGGGTGCGCGTCAGATCGTCTCCGCCGAAAAGCAGGTCGTCAAAGCGCAGAAGGACGTCAAAGAAGCCCAGGACGACGTCAACAAGGCACGGCAGGAAGCCAAGCGAGATGCCGCCGGTTATGCCCGCACGCTCGCGGGGCTTGCACTCGACGAAGAAGCCGCCGAACTCGCGCTGTCCGAAGCCAAGAAGACGTTGCGTGAGACCAAGGTCGATCCCGACGCAGATGCAGACGACATGTGGCGCGCGAACCTCGGTGTCCGCGAGGCCAAGCAGGCGCTCGAGGAGATCAAGGCGTCGAACGAACAGCAGCGCGGGGAGATCGCCGATGCTCAAGCGAAGGGCATCGAAGGATCCGACCGTGTCGTCGAAGCCAAGCAGCGCGAAGTCGATGCGCAGGAGCAACTGCAGGAGGCTCAGGCTGATCTCGCGCAGACGCAGAAGGATGTGGCCCAGGCCAACATCGATGCCGCTGAAGCGGTTGCGGACGCCTACGAGAGCATGGCCGAATCGCAGCAATCGGCCGCATCCGGGGACGATCCCTTCGCCGCGATGATCGGGCAACGCCTCGCACCACTGCTCCAGGCACTGAAGAACCTGAGAGAGGAGATCACCGACCGATTCAGTGGGGCGATGTCGGGTGCGTTCGTCAAACTCGGCGGACTGCTCGATCATCTGACGCCGAGCCTCGGCGGGCTTGCCACCACACTCGGCAACCTCGGGACGCAGATCGTCTCGTCGATTTCGAGTCCGGCAGCGTTGGCGGGCTGGGATCGGATGATCGACGGATCGAACCGGTTCTTCCAGAGCCTCTCGCAGGGCGAAAACGGTATCGGTTCTGTCTTTTCCGGACTGATCCAGGTCCTGGGTACGGCAGCGCAGACCTTCGCGGACAGCGGTGCCGGCATCAACGCGTGGCTACTCGATCTCGGTGAGAAGCTGCGCAATATCAGCGCTGATGATCTTCGTGGCACGTTCGACAGTGTCCGGCAGATCTTCCAGAACATCAGCGTGATCGCAGGACCACTGTTCGATCTCTTCCGAGGATTCGGCGCGGAAGCGGCATCCGGTCTCGCACCAGGGTTCTCGGCGATGGGTCAAGCGATTCAGGACTCGATTCCCGGCCTGATGGATATGGCACGCGAGCTGATGCCCGCACTCGGTCAGGCGCTCGTCAACCTCGCCCCGGTTCTTCCGGGTTTGGTGGATGCGTTCTCACCGTGGGCAGACATCCTCGCGATACTCGCGCCCCACTTGGCAACGATCATCGAGAAGCTGGTGCCCTTGGCGCCGATACTTCTCGGCGTCGTCACCGCTGTGAAACTCATCAGCGTCGCGGTCACAGTGTGGAACGGCATCATGTTCGCCGCATCCGTCGCACAGGGCGTCTTTGCCGCAGCAACCGGACGATCCGTTGCGACATTGGGCGCCAACACGATCGCGTTGGTAGCGCACCGAGTAGCACTGATCGCTGGAGCTGTCGCATCGAACATCGCTGCGATCGGCATGCGAGCTTTCGGTGTAGCGCTTCGCTTCGCCACAGGGCCAATCGGCCTGATCATCATGGCAGTCGCCGCTCTGGTCGCTGGGCTCGTATGGTTCTTCACGAAAACTGAACTCGGGAAGAAGATCTGGGACAAGGTCTGGAACGGCATCAAGGACGCCATGGCCAAGGCCTGGGAGTTCCTCAAGGGAGTTTTCACCAAGATCGGTGAGATCTTCTCCACCGTCTTCAACGCCATCGCCGACATCTTCAAATGGGTGTGGGAGTCGATCATCCAACCGATCTTCACCGGACTGAAAACCGCTATCGCCGTAGTGATCGTGGTGTTCCTGCTCTGGTGGGAGGGCGTCAAGCTCTACATCGAACTGATCGGCAACATCATCTCGTGGCTGTGGACTTCGGTCGCGCAACCGATCTGGGAACTGATGAAGGCCGGTCTGCGGATGCTCGGCGACTTCTTCGCCTGGGTGTGGAACACGCTGATCAAACCGGCGTGGGACGGACTCGCGGCCGGGATCTCGTGGGCGTGGGAGAACATCATTCGCCCCGCATGGGACGCCCTCAAGGTCGCTCTCGGTGCTGTTGGCGACTTCTTCGGATGGGTCTGGAACACGATCATCAAGCCCGCCTGGGATGGACTGTCCGCTGGCATCGCATGGGCCTGGGAGAACGTAATCCGGCCTGCCTGGGATGCGCTCAAGGCCGCTCTCGGCGTAGTGGGTGACTTCTTCACCTCGGTGTGGGAGACCGTAATCCGCCCGGTCTGGGACGCATTCGGCGCTGGCATCAAGGCCGTGTGGGAGAACGTCATTCGACCGGCCTGGGATGCACTCAAAGAGGGCCTCGGCAAGGTCCGAGATTTCTTCTCCGAGGTAGTCGGCGGGATCGGCAAGAAGTGGGACGAACTGAAGGGACTGCTCGCCAAGCCGATCAACTTCATGATCGACACGGTTTGGAATGGCGGCATCCTCGAGGCCTGGAACAAGGCAGCTGGATTGCTCGGACTCGGGAAGGCCGAGAAGCTGGCCACGATTCCCGAGCATGCAACCGGCGGTCAAATCAAGGGGCCGGGCACGGGAACCTCCGACGACGTCCTGATGTGGGGCTCGAACAGCGAGCACATGATGACCGCGAAGGAGGTCGAGCGCGCAGGCGGACACAATGCCGTCTACGCAATCCGCGACATGATCATGCGCGGCATTCCGTTCACGTGGGACGGCGGCAACCTCATCCGTGAGATGGGCCGAGACAACCTCAACGCCTACGGCGCTCAGGTTGCCCAGAAGGGCCTCGGCAACGTAGACCCGCAGGGAATGTTCGACTGGCTCCTCCCGAAGTACAAGGACGGCGGCGAGATCCGCGCCGGCGCCCCGTGGGAGAAGGCACTCGAGAACGGTCATCGCGCGGCGAAGATGCGCAACGGCAATCCGTACACATGGGGCTTCGAGGACTGCTCGGGCTACATGTCGATGATCGCCGACGCCATCCTCAACGGCGGTGACGGTGTACGCCGCTGGGCAACAGGATCCTTCCCCGGCGGCCAGCCGTTCGTGCCCGGCTTGGGTAAGGGCTTCTCGGTCGGCGTTCACGACAACCCGGGCGGCCCCGGTGGCGGACACACCGCCGGCACGCTCACCGGTGTCGGCCCGTACGCGACGGTCAATGTCGAGTCGGGTGGATCTCACGGCAATGTCGCGTACGGCGGCCCTGCCGTCGGGGCGGACTCGCCGCAGTTCGCCGGAAAGTCCCCCGGCGTCTTCCACCTGGCTATCGGATCGGACGGCTCGTTCGAGTCCGCGGGGCCTGGTGGTGGGGGTGGTCCGACTCCGCAACAGAAGGAGCAGTTCCTACAGAAGAAGATCGCCGAGATCTTCGACAAGGTGCTCAACCCGATCAAGGACGTCATGGGTTCCGTCATCGGCACCCCGCCGCCCGAATCGTTGGGTATCCCGCCCGGATTCCTGGACAAGGGCCGCGACATGACGTCCGGATTCCTGGCCGACAAGGTTCTCGGGCTCGGCGAGATGCTCTCGACTGCATGGGACAAGGCCAAGGACATCGGCAGCGTCCTCACCTTCGGACTGTTGCGCGATCAGGGTGGCTTCATCCCGAACGGACTGTCGATCGTGCGTAACGAGACGGGCAAGCCAGAGGCGGTGTTGAACTGGGATCAGCTACAGCTGGTCCGCGACATCCTCGGCCGCATCGGTCTGGGCAACACCGACAAGGTTGCCGAGCCTGGTCAAGATCCTGGTCCGGTCGATTGGGGCGGCGTCGGTGCCCAGATCGGGACTTCACTTCTCGCCGAGTGGGGCAACGACTTCCTTGGCATGGTGGGGATCGGCAAGCAGTTCGAGGGGATGAAACTCGTCGACGAGTACGGGCGCCGGTCCGACCAGGCCGGGGAACGTAACGAGGCCAGTTTCTCTGATTCGCCCTCGACAGCCGAGAGTGCATCGACGTCGCCGACCTACGGCGATCCGAATGCGCAATTGACACAGCAGTCGGTGGAGTTGTCGCCGATGCCGAACCTCGATGGCCCGTCCAGCAGTGGGGCTTCCGGAACGGTCGTGGACAAGGTCAAGGCTGCGTTCAAGCCGTACGGCTGGGACACCGGTGAGCAGTGGGCCGCTGCTGACTGGATCATCGGCAAGGAATCGAGCTGGAACCCGCTCGCACGCAACCCATCCTCGGGTGCGTTCAGCCTGTTCCAGTTCCTCGGGTCCACCAAGGATCAGTACCTTCCGGATGAGAATCCGGATCCGTACATCCAGGGACAGGCCGGTGCGAAGTACATCAAGGACCGCTACGGCGATCCGATGGCGGCGAAGTCGTTCTGGGAGAAGAACAACTGGTACGACCAGGGCGGACTCGCATTCGGTAAGGGCTTCATGCTCAAGAACGTCATCCATCCCGAGCGAGTCCTCTCGCCTCGGCAGACGGAGGCATTCGAAGAACTGGCCCCGATGCTCAACCGACTGCAACTCGCAACCACCGCACCGGGTGACGTCATGCCCGAATCGGCGCGCAGTGCACTCGAATTGGCACCGATGCGCGGCGGTCCTACCTACAACATCACGGGAAGGGTGGATCGAGAAACCATGAGCGAAGTAGGTATTCACGAGCGCCAGTCCTCGCGTACGTACGGATCGAGGACCCGATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(3bZcI)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50