Protein

Protein accession
A0A515MKC0 [UniProt]
Representative
3bZcI
Source
UniProt (cluster: phalp2_39181)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MPAMQGAETYINVLPSMNGYFKRVNAAVKRNKVTQHVDVELDQRQLKKAQADLDKASKSAADARRRESAATRESVAAERELQALRNKGVTDASRLAAAEDRVAKAKANSRAAANALNAAESKRSTAGARVTRIEAKLDSRRADADSETFLQRITSKFERGGQAAGAKFAGGVSAAMNSHSRGNDEGRSFAAGFVSAIGSGMRMAAVGFTVVNDAARSVIRNVGMVATGVGLAARALKVFSAGLLVSSSLLRVMTGAGIARLAGVLRLAAAAAGILARDIARVTSALLLMAAAARLVGILTRVGRALGMVTVGSAVALGAMSALGSVVSSFATGPLVAGLTAVAGAMGTVAAAAAGILGPAIGVAKMAFAGLSAGAKAWDKSQTAVGASATKGASALKAIESAKKSQVRTAEQGARQIVSAEKQVVKAQKDVKEAQDDVNKARQEAKRDAAGYARTLAGLALDEEAAELALSEAKKTLRETKVDPDADADDMWRANLGVREAKQALEEIKASNEQQRGEIADAQAKGIEGSDRVVEAKQREVDAQEQLQEAQADLAQTQKDVAQANIDAAEAVADAYESMAESQQSAASGDDPFAAMIGQRLAPLLQALKNLREEITDRFSGAMSGAFVKLGGLLDHLTPSLGGLATTLGNLGTQIVSSISSPAALAGWDRMIDGSNRFFQSLSQGENGIGSVFSGLIQVLGTAAQTFADSGAGINAWLLDLGEKLRNISADDLRGTFDSVRQIFQNISVIAGPLFDLFRGFGAEAASGLAPGFSAMGQAIQDSIPGLMDMARELMPALGQALVNLAPVLPGLVDAFSPWADILAILAPHLATIIEKLVPLAPILLGVVTAVKLISVAVTVWNGIMFAASVAQGVFAAATGRSVATLGANTIALVAHRVALIAGAVASNIAAIGMRAFGVALRFATGPIGLIIMAVAALVAGLVWFFTKTELGKKIWDKVWNGIKDAMAKAWEFLKGVFTKIGEIFSTVFNAIADIFKWVWESIIQPIFTGLKTAIAVVIVVFLLWWEGVKLYIELIGNIISWLWTSVAQPIWELMKAGLRMLGDFFAWVWNTLIKPAWDGLAAGISWAWENIIRPAWDALKVALGAVGDFFGWVWNTIIKPAWDGLSAGIAWAWENVIRPAWDALKAALGVVGDFFTSVWETVIRPVWDAFGAGIKAVWENVIRPAWDALKEGLGKVRDFFSEVVGGIGKKWDELKGLLAKPINFMIDTVWNGGILEAWNKAAGLLGLGKAEKLATIPEHATGGQIKGPGTGTSDDVLMWGSNSEHMMTAKEVERAGGHNAVYAIRDMIMRGIPFTWDGGNLIREMGRDNLNAYGAQVAQKGLGNVDPQGMFDWLLPKYKDGGEIRAGAPWEKALENGHRAAKMRNGNPYTWGFEDCSGYMSMIADAILNGGDGVRRWATGSFPGGQPFVPGLGKGFSVGVHDNPGGPGGGHTAGTLTGVGPYATVNVESGGSHGNVAYGGPAVGADSPQFAGKSPGVFHLAIGSDGSFESAGPGGGGGPTPQQKEQFLQKKIAEIFDKVLNPIKDVMGSVIGTPPPESLGIPPGFLDKGRDMTSGFLADKVLGLGEMLSTAWDKAKDIGSVLTFGLLRDQGGFIPNGLSIVRNETGKPEAVLNWDQLQLVRDILGRIGLGNTDKVAEPGQDPGPVDWGGVGAQIGTSLLAEWGNDFLGMVGIGKQFEGMKLVDEYGRRSDQAGERNEASFSDSPSTAESASTSPTYGDPNAQLTQQSVELSPMPNLDGPSSSGASGTVVDKVKAAFKPYGWDTGEQWAAADWIIGKESSWNPLARNPSSGAFSLFQFLGSTKDQYLPDENPDPYIQGQAGAKYIKDRYGDPMAAKSFWEKNNWYDQGGLAFGKGFMLKNVIHPERVLSPRQTEAFEELAPMLNRLQLATTAPGDVMPESARSALELAPMRGGPTYNITGRVDRETMSEVGIHERQSSRTYGSRTR
Physico‐chemical
properties
protein length:1966 AA
molecular weight:207292,7 Da
isoelectric point:6,27
hydropathy:-0,01
Representative Protein Details
Accession
3bZcI
Protein name
3bZcI
Sequence length
1520 AA
Molecular weight
154769,37050 Da
Isoelectric point
8,75160
Sequence
MATAYATLQVIPTVRGITGRIERQIAAPLTAAGQKAGRDTGKAITSGVGSANYESTGRSAGAKISRGLTTAGHKAGRDTGKAITSGVGAANYESAGRSAGARVARGLADSGQRAGRDTGREITSGVGAANYEAAGRSAGVRVARGVTGHGRAAGREVGREINAGVREEDYESTGRSIASRIVSGLTTGLRGVQTSARVINSGFEAATRGLSLFVVNASTIATGFGIAARMVKSFSAATFVSALALQQVASVGLTKLAGALRLIAAIASRVAREVGQVTAAFLVLQGVVRLAGAMNSFASGLAKITVGASIAIGVVSGLGVAFASLAATIGAAAGAAAGAAAGILGPAVAALKVGMSGLSEAKKAFETPPSGDGGASQAKAVASAAKSLADAEKGVVRAKEDALEAEEDLTQAREDAQQQIEDVNRALRDNRLNEREAAREVRKAREALAETLRDPKASADDREEAADRVEAAELRLIETQERSREEEQKAAKANRAGIEGAEQVVAAKERVADANEAIVEAQERATEAAQALVEAQNQSASGGSTVDPFYAMIGERMAPMLTAFDNLKRTVTDDLSSALIPAFANIGTLADTVSPKISALAGVFGRIGTEVSKSLAGPTGVAAFDQMAAASNTFFTALSAGENGLGGFTLGLSQFAATAATTVSGSGGGLNSLLLSLGDKLRNISAEQITAAFDRMQQTFSNIGAVVGPLFNLFSTLGGISAKALGPGFSAVGAAITEATPGLARMAEILMPALSQVMERLAPLIPSLVEAFTPWASTLAAIAPPLATIVSHMAPLAPYLLMAATAFKVAGAVMLLWNAGAFAGAVAQGVFAAATGRSAMTLTGNTIALAAHRIALIAGAVAARAFGIAMAFATGPIGLIVMAVAAVGVALWAFFTKTEVGKRWWEAIWGAIKTAVSATWEFLKVAWDWILVAIQWIGDKAMWLWNSAIKPVFSAIGSLIAKWWTGIVQPAFEGVKTAFGIVGDVISWWWNSIVSPAFSAVGAIFSWWWNDLVSPIFNNVMTIFGKVGDVISWWWNNIVGPAFDAAKTAVGVLGDAFTWWWNSAVTPAFDGVKSVVGKWWDYAKGVFDLIKGGIGKVGEAFEAAGRVMSSIWDKVLDAMRPALHAIGRALSAVPTSIGPIDIPGAAVAVALGDKLQGFRRGGLASGAGTGTSDSIVARLSNREFVVNALATGKTLPLLEAINSGWTPPPWLLDLMVNGLPKFATGGLVDTQNWLRGEAGKPYQWGGTGNPSWDCSGIAGGAWAKATGKSPRNRYFTTGSDFAGLGWQPGPGGANDLTIGTNGLGGSSGHMSGRIGDLKFESSGTDGVEVGADAQDPSNFAKQWHWPLGGNPLNSGDLGSGGAAGGTGTGGLGRSGAGAGVGTTGGASSAGSTSRPSGTAVPVWLDNWHEMPGTAAAPSASAAAASTTTSSDSAGVSTTADNTDGAFDQSAAIAAAFEKFGGAMAGAGGEFLKGQKSAIPGIGGYVEGIEKTVSNVSIVVADVYEAMGAMTREQKRQTVGK
Other Proteins in cluster: phalp2_39181
Total (incl. this protein): 8 Avg length: 1450,4 Avg pI: 8,76

Protein ID Length (AA) pI
3bZcI 1520 8,75160
7AJ8O 1336 9,42910
7AJib 1338 9,49892
7PXlA 1340 9,22551
7dPNN 1301 9,52464
7uhQp 1473 8,42481
8KNwr 1329 8,93894
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3994
7AFQR
49 35,4% 1206 5.980E-259
2 phalp2_20938
7fSA6
1 30,5% 1462 6.592E-199
3 phalp2_28099
7ucAa
137 25,6% 1114 9.152E-92
4 phalp2_24330
3Pu9x
29 27,2% 1049 8.731E-87
5 phalp2_36889
7ukoa
30 26,9% 1069 4.031E-84
6 phalp2_9474
HzvZ
51 22,8% 1062 2.611E-48
7 phalp2_13411
4v9Zc
2 21,8% 1212 5.413E-47
8 phalp2_22507
1fUaK
1 22,8% 1137 4.908E-46
9 phalp2_6456
hsjD
15 21,3% 1206 8.346E-43
10 phalp2_34982
4HM46
2 22,2% 1191 7.328E-28

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhodococcus phage Whack
[NCBI]
2591132 Whackvirus > Whackvirus whack
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MK967393 [NCBI]
CDS location
range 10349 -> 16249
strand +
CDS
ATGCCCGCTATGCAGGGTGCCGAGACGTATATCAACGTCCTGCCGTCGATGAACGGGTACTTCAAGAGGGTCAACGCAGCAGTCAAGCGCAACAAGGTGACTCAACACGTCGACGTCGAGCTCGACCAGCGCCAGCTCAAGAAGGCGCAGGCCGATCTCGACAAGGCGTCCAAAAGTGCCGCTGATGCGAGACGTCGCGAGTCTGCGGCCACCCGCGAGAGTGTCGCCGCCGAGCGTGAGCTGCAAGCCCTGCGAAACAAGGGTGTCACCGACGCGAGCCGACTTGCTGCCGCCGAGGACAGGGTCGCGAAGGCAAAGGCGAACAGCCGAGCTGCCGCGAATGCTCTGAACGCGGCCGAGTCCAAGCGCTCGACAGCCGGTGCTCGCGTCACTCGTATCGAGGCCAAGCTCGACAGCAGGCGAGCCGACGCCGACAGTGAAACATTCCTGCAGCGGATCACCTCGAAGTTCGAACGCGGCGGTCAGGCAGCGGGTGCGAAGTTCGCCGGTGGTGTCAGCGCCGCTATGAACTCGCACTCGCGCGGCAACGACGAGGGTCGCAGTTTCGCGGCCGGGTTCGTCTCCGCGATCGGCAGCGGAATGCGGATGGCTGCTGTCGGTTTCACGGTCGTCAACGATGCGGCCCGGTCCGTCATCCGCAATGTCGGAATGGTGGCCACCGGTGTCGGGCTGGCAGCCCGAGCGTTGAAGGTCTTCAGTGCAGGGCTGCTCGTCAGCTCCTCCCTGCTGCGGGTGATGACCGGCGCGGGTATCGCACGCCTGGCAGGTGTACTGCGTCTCGCCGCAGCTGCCGCCGGCATCCTCGCCCGCGACATCGCGCGCGTGACCTCCGCGCTTCTGCTCATGGCAGCAGCGGCCCGGTTGGTCGGCATCCTGACCCGCGTCGGCCGCGCCCTCGGCATGGTCACTGTCGGGTCTGCGGTCGCCCTCGGCGCCATGTCTGCGCTCGGCTCGGTCGTCTCCAGCTTCGCCACCGGCCCCCTGGTGGCAGGTCTGACGGCAGTCGCCGGGGCAATGGGCACTGTCGCCGCAGCAGCGGCAGGCATCCTCGGCCCGGCTATCGGCGTCGCGAAGATGGCATTCGCGGGTCTGTCCGCCGGGGCGAAGGCTTGGGACAAGTCTCAGACCGCGGTCGGTGCTTCGGCGACCAAGGGCGCATCAGCGCTCAAGGCGATCGAGAGTGCGAAGAAGTCGCAGGTGCGTACCGCCGAGCAGGGTGCGCGTCAGATCGTCTCCGCCGAAAAGCAGGTCGTCAAAGCGCAGAAGGACGTCAAAGAAGCCCAGGACGACGTCAACAAGGCACGGCAGGAAGCCAAGCGAGATGCCGCCGGTTATGCCCGCACGCTCGCGGGGCTTGCACTCGACGAAGAAGCCGCCGAACTCGCGCTGTCCGAAGCCAAGAAGACGTTGCGTGAGACCAAGGTCGATCCCGACGCAGATGCAGACGACATGTGGCGCGCGAACCTCGGTGTCCGCGAGGCCAAGCAGGCGCTCGAGGAGATCAAGGCGTCGAACGAACAGCAGCGCGGGGAGATCGCCGATGCTCAAGCGAAGGGCATCGAAGGATCCGACCGTGTCGTCGAAGCCAAGCAGCGCGAAGTCGATGCGCAGGAGCAACTGCAGGAGGCTCAGGCTGATCTCGCGCAGACGCAGAAGGATGTGGCCCAGGCCAACATCGATGCCGCTGAAGCGGTTGCGGACGCCTACGAGAGCATGGCCGAATCGCAGCAATCGGCCGCATCCGGGGACGATCCCTTCGCCGCGATGATCGGGCAACGCCTCGCACCACTGCTCCAGGCACTGAAGAACCTGAGAGAGGAGATCACCGACCGATTCAGTGGGGCGATGTCGGGTGCGTTCGTCAAACTCGGCGGACTGCTCGATCATCTGACGCCGAGCCTCGGCGGGCTTGCCACCACACTCGGCAACCTCGGGACGCAGATCGTCTCGTCGATTTCGAGTCCGGCAGCGTTGGCGGGCTGGGATCGGATGATCGACGGATCGAACCGGTTCTTCCAGAGCCTCTCGCAGGGCGAAAACGGTATCGGTTCTGTCTTTTCCGGACTGATCCAGGTCCTGGGTACGGCAGCGCAGACCTTCGCGGACAGCGGTGCCGGCATCAACGCGTGGCTACTCGATCTCGGTGAGAAGCTGCGCAATATCAGCGCTGATGATCTTCGTGGCACGTTCGACAGTGTCCGGCAGATCTTCCAGAACATCAGCGTGATCGCAGGACCACTGTTCGATCTCTTCCGAGGATTCGGCGCGGAAGCGGCATCCGGTCTCGCACCAGGGTTCTCGGCGATGGGTCAAGCGATTCAGGACTCGATTCCCGGCCTGATGGATATGGCACGCGAGCTGATGCCCGCACTCGGTCAGGCGCTCGTCAACCTCGCCCCGGTTCTTCCGGGTTTGGTGGATGCGTTCTCACCGTGGGCAGACATCCTCGCGATACTCGCGCCCCACTTGGCAACGATCATCGAGAAGCTGGTGCCCTTGGCGCCGATACTTCTCGGCGTCGTCACCGCTGTGAAACTCATCAGCGTCGCGGTCACAGTGTGGAACGGCATCATGTTCGCCGCATCCGTCGCACAGGGCGTCTTTGCCGCAGCAACCGGACGATCCGTTGCGACATTGGGCGCCAACACGATCGCGTTGGTAGCGCACCGAGTAGCACTGATCGCTGGAGCTGTCGCATCGAACATCGCTGCGATCGGCATGCGAGCTTTCGGTGTAGCGCTTCGCTTCGCCACAGGGCCAATCGGCCTGATCATCATGGCAGTCGCCGCTCTGGTCGCTGGGCTCGTATGGTTCTTCACGAAAACTGAACTCGGGAAGAAGATCTGGGACAAGGTCTGGAACGGCATCAAGGACGCCATGGCCAAGGCCTGGGAGTTCCTCAAGGGAGTTTTCACCAAGATCGGTGAGATCTTCTCCACCGTCTTCAACGCCATCGCCGACATCTTCAAATGGGTGTGGGAGTCGATCATCCAACCGATCTTCACCGGACTGAAAACCGCTATCGCCGTAGTGATCGTGGTGTTCCTGCTCTGGTGGGAGGGCGTCAAGCTCTACATCGAACTGATCGGCAACATCATCTCGTGGCTGTGGACTTCGGTCGCGCAACCGATCTGGGAACTGATGAAGGCCGGTCTGCGGATGCTCGGCGACTTCTTCGCCTGGGTGTGGAACACGCTGATCAAACCGGCGTGGGACGGACTCGCGGCCGGGATCTCGTGGGCGTGGGAGAACATCATTCGCCCCGCATGGGACGCCCTCAAGGTCGCTCTCGGTGCTGTTGGCGACTTCTTCGGATGGGTCTGGAACACGATCATCAAGCCCGCCTGGGATGGACTGTCCGCTGGCATCGCATGGGCCTGGGAGAACGTAATCCGGCCTGCCTGGGATGCGCTCAAGGCCGCTCTCGGCGTAGTGGGTGACTTCTTCACCTCGGTGTGGGAGACCGTAATCCGCCCGGTCTGGGACGCATTCGGCGCTGGCATCAAGGCCGTGTGGGAGAACGTCATTCGACCGGCCTGGGATGCACTCAAAGAGGGCCTCGGCAAGGTCCGAGATTTCTTCTCCGAGGTAGTCGGCGGGATCGGCAAGAAGTGGGACGAACTGAAGGGACTGCTCGCCAAGCCGATCAACTTCATGATCGACACGGTTTGGAATGGCGGCATCCTCGAGGCCTGGAACAAGGCAGCTGGATTGCTCGGACTCGGGAAGGCCGAGAAGCTGGCCACGATTCCCGAGCATGCAACCGGCGGTCAAATCAAGGGGCCGGGCACGGGAACCTCCGACGACGTCCTGATGTGGGGCTCGAACAGCGAGCACATGATGACCGCGAAGGAGGTCGAGCGCGCAGGCGGACACAATGCCGTCTACGCAATCCGCGACATGATCATGCGCGGCATTCCGTTCACGTGGGACGGCGGCAACCTCATCCGTGAGATGGGCCGAGACAACCTCAACGCCTACGGCGCTCAGGTTGCCCAGAAGGGCCTCGGCAACGTAGACCCGCAGGGAATGTTCGACTGGCTCCTCCCGAAGTACAAGGACGGCGGCGAGATCCGCGCCGGCGCCCCGTGGGAGAAGGCACTCGAGAACGGTCATCGCGCGGCGAAGATGCGCAACGGCAATCCGTACACATGGGGCTTCGAGGACTGCTCGGGCTACATGTCGATGATCGCCGACGCCATCCTCAACGGCGGTGACGGTGTACGCCGCTGGGCAACAGGATCCTTCCCCGGCGGCCAGCCGTTCGTGCCCGGCTTGGGTAAGGGCTTCTCGGTCGGCGTTCACGACAACCCGGGCGGCCCCGGTGGCGGACACACCGCCGGCACGCTCACCGGTGTCGGCCCGTACGCGACGGTCAATGTCGAGTCGGGTGGATCTCACGGCAATGTCGCGTACGGCGGCCCTGCCGTCGGGGCGGACTCGCCGCAGTTCGCCGGAAAGTCCCCCGGCGTCTTCCACCTGGCTATCGGATCGGACGGCTCGTTCGAGTCCGCGGGGCCTGGTGGTGGGGGTGGTCCGACTCCGCAACAGAAGGAGCAGTTCCTACAGAAGAAGATCGCCGAGATCTTCGACAAGGTGCTCAACCCGATCAAGGACGTCATGGGTTCCGTCATCGGCACCCCGCCGCCCGAATCGTTGGGTATCCCGCCCGGATTCCTGGACAAGGGCCGCGACATGACGTCCGGATTCCTGGCCGACAAGGTTCTCGGGCTCGGCGAGATGCTCTCGACTGCATGGGACAAGGCCAAGGACATCGGCAGCGTCCTCACCTTCGGACTGTTGCGCGATCAGGGTGGCTTCATCCCGAACGGACTGTCGATCGTGCGTAACGAGACGGGCAAGCCAGAGGCGGTGTTGAACTGGGATCAGCTACAGCTGGTCCGCGACATCCTCGGCCGCATCGGTCTGGGCAACACCGACAAGGTTGCCGAGCCTGGTCAAGATCCTGGTCCGGTCGATTGGGGCGGCGTCGGTGCCCAGATCGGGACTTCACTTCTCGCCGAGTGGGGCAACGACTTCCTTGGCATGGTGGGGATCGGCAAGCAGTTCGAGGGGATGAAACTCGTCGACGAGTACGGGCGCCGGTCCGACCAGGCCGGGGAACGTAACGAGGCCAGTTTCTCTGATTCGCCCTCGACAGCCGAGAGTGCATCGACGTCGCCGACCTACGGCGATCCGAATGCGCAATTGACACAGCAGTCGGTGGAGTTGTCGCCGATGCCGAACCTCGATGGCCCGTCCAGCAGTGGGGCTTCCGGAACGGTCGTGGACAAGGTCAAGGCTGCGTTCAAGCCGTACGGCTGGGACACCGGTGAGCAGTGGGCCGCTGCTGACTGGATCATCGGCAAGGAATCGAGCTGGAACCCGCTCGCACGCAACCCATCCTCGGGTGCGTTCAGCCTGTTCCAGTTCCTCGGGTCCACCAAGGATCAGTACCTTCCGGATGAGAATCCGGATCCGTACATCCAGGGACAGGCCGGTGCGAAGTACATCAAGGACCGCTACGGCGATCCGATGGCGGCGAAGTCGTTCTGGGAGAAGAACAACTGGTACGACCAGGGCGGACTCGCATTCGGTAAGGGCTTCATGCTCAAGAACGTCATCCATCCCGAGCGAGTCCTCTCGCCTCGGCAGACGGAGGCATTCGAAGAACTGGCCCCGATGCTCAACCGACTGCAACTCGCAACCACCGCACCGGGTGACGTCATGCCCGAATCGGCGCGCAGTGCACTCGAATTGGCACCGATGCGCGGCGGTCCTACCTACAACATCACGGGAAGGGTGGATCGAGAAACCATGAGCGAAGTAGGTATTCACGAGCGCCAGTCCTCGCGTACGTACGGATCGAGGACCCGATGA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3bZcI) rather than this protein.
PDB ID
3bZcI
Method AlphaFoldv2
Resolution 50.30
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50