Protein

Protein accession
A0A9E8M2P5 [UniProt]
Representative
HzvZ
Source
UniProt (cluster: phalp2_9474)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAKTFLVGEGAVRLIPNAAGFHVKARKAIKEGGGLNVGVDLRPETKAFRQEARTRLQSVKLTHDVKLRADVTGFSRDAQAKITATRNLSANVSLKATLDKGSLRGAYDAARSLLTSWGPLHVSIKASVDDADLRRALEQMRVRVESARLTANIRSRDRGGIGGGGGGGLGGGGGGGGRPLRTAARTTAVIAAPIVTQAALGGLTALVGAASQAAGALGLIPAAATAAGAGLAAVAIGAVGIGGAFSALSDASEQAGSAVSQSASQQASAQRQIAQADRGLATAHRGVTRALEDLNNERRNAVRRLRDMNDELKMAPINEREGALAIKESYKRLQEAYASGDTLEIEGAQIDVEKSKLQYDQIRKQNSDLAADVAVANKKGVEGDSQVIAAKDGVVDANNALIDAQDALTSAMESAAEATKQMAAGTDKLAQAMAKLSPNAQDFVRKIHALGPAWTETRKFIQDNLFAHLGDSVTTLANVQLPVLRTGLAGIASEINLGVRGALATFSTEMAAADFTTTLENTRQMWAGIGQSFAPFSQAFMNLATVGSTFMPRLGTAVANMANEFKQFTDEARADGSMQEFFENSLTMAKQLGRILANVGAIVGEVFSAGAEVGGGFLNTIETATGELREFLGSAEGQTALTTFFEGVRVAVQTLAPIIQIVASTILTVLGPALTDLVIGLGPGLVAMFEGLSVGLAAIQPVMQVVGQAIGTIGVELGEVFKVIGPVIAETLSALAPAVQPLAQAFGSLITAVAPILPLLAQLVAQLVGALAPVLTTLFDALAPVISQLVEALMPVIPPLAEVLGRLAGVFGEIIAKLLGALGPVLVDLVNTFMDLITQVMPFTNLLLDLVAEVLPAFASILTEILPLLPALVQPLVELAMTVLPHLLPVFQALVPIIGEVMKFIAGIISWAIREVIVPVITWLSTPLENIGRVFGWLWNEAIKPAWDGITNAISWGWDNIISPAFDALQTGISRVGDFFSDVVDGIQVAWGRLQSAASTPINWVINHVINGGIGRAWKAVDNFLGGHLPDWVDVSPIGMAVGGEVPMAKGAERGKDSVRILGMPGEHMWDVEDVNRAGGQQAMYRMRDMVMRGKPFTWTPGGLADATGDGALPRYAKGGELSAGDKLSPLPGEGGLQPIAQLMARIIKGTWPKTVSSIGGYRPPDGYNEHSSGRALDVMVTELGGKTGDEVTDFSMANHPNYPVTHTIWKQQMHYPPDGRTEGMDDRGSPTQNHMDHPHIWYAPNQGPINPNVMPDNIAFGGVTDAGVRKGITAWAEKAFNTALAPVKKLLDTQAFNPPPEIKATPRELYKGVVQPAKEKLLDKVSELTSMEGWKNMLGGAVDKVRKGAGGLIGGIAKLFDTGGVVRPGTTVVQNDTGQDEYLLNPLDTLMLRGLIGALRGIGINPKIEQQAPLTPEGTGPADVNIAGVGGKSTTPGELPTPQQDEIKPPTAEDLDGGLSGTGSGAATIPLKRNPDGTYSSTDPEWDHLIQRESGGIANRQQEVVDVNSGGNEASGLFQIAKGTWASNGGTKYAPTAGEATPEQQAEIAAKIFNDQGGSPWGSGLAGRESDDKLRAGIRPATPAGTKDDPVAVTVDTPSADPSKDWPTTADTAPGDKTGSAYGQNLQGAAIGPNGEYKPDQNVTPGPSGTAAQKPMFVNPFDTFAGKIGQNFAEHTPLGIGGPQVSKLAEKAPAITELANGVAQNIPAYAAALAGNPAMLAEKVATATGAWATKTATDFASYVPENAGGMVESLLSAAAGPLIGTVNTGLSKDDLTSTMEDVQNRQIRRTKTGRRRI
Physico‐chemical
properties
protein length:1798 AA
molecular weight:187701,4 Da
isoelectric point:5,42
hydropathy:-0,02
Representative Protein Details
Accession
HzvZ
Protein name
HzvZ
Sequence length
1414 AA
Molecular weight
149004,82840 Da
Isoelectric point
4,97042
Sequence
RRAPGGAAAAVENRRQLQRQVEAAERGVVQANRRIEDAERRVAEAQKNTRKAQEALNDARKEAVKDLKELKDQLSDAALNEEEATLAVARARQNLIDAQLDKDSTNLDIAEADLAHRKAVKNLDELREKNNQLARDVQSANEAGVEGSKKVQDAKEKVEAASQREADAQRGLLEAHENAIVANERLADALEKVGEAGAGAAGGGVDPFAEAMAKLSPNAREFVLAMQALGDQWTDLRLQVQDNLFEGMGEAVTNLATAQLPILKTGLAEVAAEINTGLRANIQALASESSQAGLGNMLANTAQGFSGLNQAAQPLVQAMVDIGSAGSNYLPQIGQYLGEAGARFGEFLTQATQTGQFDQWVQNGVNALKGIGDTLGDIGGIISGVWGAAAAAGQSSLGPMSQVLSMMNDFINSTAGQGALGSFFSAMTDGLAALMPILSTALQSIGTTIMPAISDFIQQAAPGVQVLVQGLADGLSALAPVMGPIGSLFGAIGQALAPLLPLFGEILAGALLPLANGLTTVIQAMAPVMQILAGALQPIIQQLAPVFQQLVEVIANVLVQVLQQITPYLPQLADAFSSILAAVIPLLPQLIQLVFQVITPFIPILGELMPVLVQLVQAFGSIIQAIMPVIQVLVDLIGVIARVAAEIVAFVARGIAEIATFVATVIAKIAQFVADIIGWFTNLAQRAPEQIRQLGDRVERFFTSMWDTAVKVVTIGVTNVVNKVREIKSLITGVFDGASKWLINAGKVIISGLWDGMKEMWDKTSDWFSDRISSIRSPFSSSRRANGSYSSNAMGSVSYYAAGGEDHSPQIAAAGEWRVWAEPETGGEAYIPLANDYRRGRAVEITAAVANHFGYNLVDAKGKGIKPSSKGSLGPTDVRAFAEGGITIEDLDDFSSDLEGKPYIWGGVHWGDCSGAMSGIARFVAGLDPWGGRFATGNQREALAGLGFLPGLGGAGALSMGWFNGGPYGGHTAGTLPSGTNVEMGGGRGNGQFGGHAAGAADPSFTDHAHVPAEFFAPIKVPKMAGLGNLDFGDLDFGEDDPTYGLDTDDPSAAKLRAFRASKKSNADNYVNPNGTAAAKDGPSTISEMVADVAKTAVAGHTKDILGIFGVPDDIPMVKAYGQWLKARRQVTPRASTSTKKREITSLSQAAADVIDADPGIETVELTGLDLVGGLSPIKEPKADDGDIGHIYVPGGGAEQWRGMAMAAMRRVGFDADNPAQVNAMVAQIQSESGGDPNIAQQIVDVNGTGDAAGVGLLQIIPTTYEANRDPDLPNDRRNPFSNMVAALRYYRGKYGMDLTTMWGHGHGYWAGGLVQGPGGPTDDLIPAMLSNNEFVVREAAARHARPLLEAINSDPQRARQIGQAFSTAVPQSTPAAVGRNVELHYHIETNNVEEGMRRSEMHARQQVMANLGI
Other Proteins in cluster: phalp2_9474
Total (incl. this protein): 51 Avg length: 1729,1 Avg pI: 4,86

Protein ID Length (AA) pI
HzvZ 1414 4,97042
5N99z 1677 5,15492
5XOu2 1843 4,65559
5XdH6 1652 5,00583
61JrH 1652 4,98997
66Huq 1670 5,06631
679HP 1840 4,63297
6YcRy 1686 4,63229
6Yd0Z 1695 5,05426
6YxP5 1824 4,66298
6Z8En 1843 4,64422
6ZCJ9 1567 4,52475
6d29g 1886 4,70390
6dN2X 1892 4,71038
6gw0W 1670 5,03584
6iKhh 1652 4,97298
6k7ap 1649 4,63371
6lNJ2 1843 4,65730
6nKzF 1652 4,97298
6q99S 1844 4,63888
6tKG9 1652 4,96019
6uSwR 1843 4,64815
6vIgA 1795 4,69066
70WTZ 1366 4,84793
717QK 1607 4,54737
71Uub 1587 6,42971
7A0lq 1635 4,45262
7MYqi 1892 4,59602
7NKvo 1843 4,64985
7NXv7 1844 4,63507
7cpLC 1648 4,56391
7x2bb 1691 4,56436
854km 1654 5,00350
8aZfX 1654 5,00350
BDzg 1844 4,63149
BuzP 1843 4,64422
DQP0 1652 4,99032
HMKd 1654 5,02806
MYTW 1654 4,95309
Nf7T 1649 4,64877
Ohve 1844 4,63149
VOQQ 1652 4,99958
oYiV 1844 4,62086
p8Bb 1843 4,64553
wLCi 1843 4,64945
yscU 1654 5,01760
zjNa 1844 4,63149
A0A386KD49 1802 5,50664
G8EJY7 1804 5,53040
A0A2L1IX75 1800 5,46714
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11312
6YTtq
2 67,1% 878 0.000E+00
2 phalp2_36889
7ukoa
30 32,3% 1417 4.440E-209
3 phalp2_33467
7m6oK
34 25,8% 1368 1.219E-125
4 phalp2_18324
5inLv
5 26,6% 1451 4.407E-122
5 phalp2_36243
7zfxz
7 24,2% 1611 1.771E-110
6 phalp2_12266
72uge
6 23,7% 1564 5.591E-105
7 phalp2_22507
1fUaK
1 21,5% 1339 6.471E-80
8 phalp2_8527
1qRCG
6 22,8% 1226 8.514E-76
9 phalp2_6456
hsjD
15 24,2% 1267 3.593E-72
10 phalp2_20950
7lGFg
5 21,9% 1314 1.851E-64

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage Dalilpop
[NCBI]
2998886 Zierdtviridae > Gruunavirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
OP867020 [NCBI]
CDS location
range 25804 -> 31200
strand +
CDS
GTGGCGAAGACCTTCCTGGTCGGCGAAGGCGCCGTTCGGCTCATCCCCAACGCCGCCGGATTCCACGTCAAGGCCCGGAAGGCGATCAAGGAGGGTGGCGGCCTCAACGTCGGCGTCGACCTCCGGCCGGAGACGAAGGCCTTCCGCCAGGAGGCGCGCACCCGACTCCAGTCGGTCAAGCTCACCCACGACGTGAAGCTCCGCGCCGACGTCACCGGTTTCAGCCGCGACGCGCAGGCCAAGATCACCGCCACCCGCAACCTCTCGGCCAACGTCAGCCTGAAGGCCACCCTCGACAAGGGCTCGCTCCGCGGAGCATACGACGCCGCGAGGTCCCTGCTCACCTCGTGGGGTCCGCTGCACGTCTCGATCAAGGCCAGCGTCGACGACGCCGACCTCCGGCGCGCTCTGGAGCAGATGCGTGTCCGAGTCGAGTCCGCACGACTCACCGCGAACATCCGAAGCCGAGACAGAGGCGGGATCGGCGGTGGTGGGGGCGGGGGCCTCGGTGGCGGTGGTGGTGGAGGGGGCCGACCCCTCCGGACGGCCGCGCGCACCACCGCGGTGATCGCAGCCCCCATCGTCACCCAGGCCGCACTCGGCGGTCTCACCGCGCTCGTCGGCGCCGCATCGCAGGCCGCTGGAGCGCTCGGACTCATCCCCGCCGCGGCGACCGCCGCCGGAGCAGGCCTCGCCGCCGTGGCCATCGGCGCCGTCGGCATCGGGGGTGCGTTCTCCGCGCTCAGCGACGCCTCGGAACAGGCCGGGAGCGCGGTCAGCCAGAGCGCCAGCCAACAGGCCTCGGCGCAGCGCCAGATCGCGCAGGCCGACCGCGGGCTCGCCACTGCGCACCGCGGAGTCACTCGCGCGCTCGAAGACCTGAACAACGAGCGCCGCAACGCAGTTCGTCGCCTCCGCGACATGAACGACGAGCTGAAGATGGCTCCGATCAACGAGCGCGAGGGAGCGCTCGCGATCAAGGAGAGCTACAAGCGGCTCCAGGAGGCCTACGCCTCCGGCGACACCCTCGAAATCGAGGGTGCGCAGATCGACGTCGAGAAGTCGAAGCTCCAGTACGACCAGATCCGGAAGCAGAACTCGGACCTGGCCGCCGACGTCGCGGTGGCGAACAAGAAGGGTGTCGAGGGAGACTCCCAGGTCATCGCGGCCAAGGACGGTGTCGTCGACGCCAACAACGCGCTCATCGACGCTCAGGACGCGCTCACCTCGGCCATGGAGTCGGCGGCCGAGGCCACCAAGCAGATGGCTGCCGGGACCGACAAACTCGCGCAGGCGATGGCGAAGCTCTCCCCCAACGCGCAGGACTTCGTCCGGAAGATTCACGCGCTCGGACCGGCCTGGACCGAGACGCGAAAGTTCATCCAGGACAACCTGTTTGCCCATCTCGGTGATTCGGTCACCACTCTCGCCAATGTCCAGCTCCCGGTGCTCCGGACCGGGCTGGCGGGCATCGCCTCCGAGATCAACCTCGGCGTGCGCGGCGCGCTGGCCACCTTCTCCACCGAGATGGCCGCCGCCGACTTCACCACCACGCTGGAGAACACCCGGCAGATGTGGGCCGGGATCGGACAGAGCTTCGCCCCGTTCTCGCAGGCCTTCATGAACCTCGCGACCGTCGGCTCGACGTTCATGCCGCGCCTCGGGACCGCCGTCGCGAACATGGCGAACGAGTTCAAGCAGTTCACCGACGAGGCTCGCGCCGACGGCTCGATGCAGGAGTTCTTCGAGAACTCGCTGACCATGGCCAAGCAGCTTGGCCGCATCCTCGCCAACGTCGGAGCCATTGTCGGAGAGGTCTTCTCGGCCGGAGCCGAGGTCGGCGGGGGCTTCCTCAACACCATCGAGACCGCGACCGGCGAGCTGCGCGAGTTCCTCGGATCGGCCGAAGGGCAGACCGCGCTCACCACCTTCTTCGAGGGTGTCCGCGTCGCCGTGCAAACGCTCGCCCCCATCATCCAGATCGTCGCGTCGACGATCCTCACCGTGCTCGGCCCTGCCCTGACCGATCTCGTCATCGGTCTCGGCCCGGGCCTGGTCGCCATGTTCGAGGGCCTGTCGGTCGGTCTCGCGGCGATCCAGCCGGTGATGCAGGTCGTCGGCCAGGCCATCGGCACTATCGGCGTCGAGCTGGGTGAGGTCTTCAAGGTCATCGGTCCGGTCATCGCCGAGACCCTGTCGGCACTCGCACCGGCGGTCCAGCCGCTCGCGCAGGCCTTCGGCTCCCTGATCACCGCGGTGGCCCCGATCCTCCCGTTGCTCGCACAGCTCGTCGCGCAGCTCGTCGGCGCGCTGGCGCCTGTCCTGACCACCCTTTTCGACGCACTCGCGCCGGTCATCTCGCAACTCGTCGAGGCCCTGATGCCGGTCATCCCGCCGCTCGCGGAGGTGCTCGGCCGACTCGCCGGGGTGTTCGGCGAGATCATTGCGAAGCTGCTCGGCGCGCTCGGCCCGGTGCTCGTCGACCTCGTGAACACCTTCATGGACCTGATCACCCAGGTCATGCCGTTCACGAACCTGCTCCTCGACCTCGTCGCCGAGGTGCTCCCGGCCTTCGCGTCGATCCTGACCGAGATCCTCCCGTTGCTCCCGGCGCTCGTGCAGCCGCTCGTCGAGCTGGCCATGACCGTGCTGCCGCACCTGCTCCCGGTCTTCCAGGCTCTCGTGCCGATCATCGGCGAGGTCATGAAGTTCATCGCGGGCATCATCTCGTGGGCCATTCGCGAAGTGATCGTCCCCGTAATTACTTGGCTCTCAACGCCTTTGGAGAATATCGGTCGCGTCTTCGGCTGGCTGTGGAACGAAGCGATCAAGCCAGCCTGGGACGGGATCACCAACGCGATCTCGTGGGGCTGGGACAACATCATCTCCCCGGCCTTCGACGCACTCCAGACCGGGATCTCGCGCGTCGGCGACTTCTTCTCCGACGTCGTCGACGGAATCCAAGTTGCCTGGGGCCGACTACAATCGGCGGCCTCGACCCCGATCAACTGGGTGATCAACCACGTGATCAACGGCGGCATCGGCCGCGCGTGGAAGGCGGTCGACAACTTCCTCGGCGGACACCTGCCGGACTGGGTCGACGTCAGCCCGATCGGCATGGCGGTCGGTGGCGAGGTGCCGATGGCCAAGGGAGCCGAGCGGGGAAAGGACTCGGTCCGCATCCTCGGTATGCCCGGCGAGCACATGTGGGACGTCGAAGACGTCAACCGCGCCGGTGGCCAGCAGGCGATGTACCGGATGCGCGACATGGTCATGCGCGGGAAGCCGTTCACCTGGACCCCCGGCGGACTGGCCGACGCGACCGGCGACGGCGCGCTGCCGCGCTACGCCAAGGGTGGCGAGCTGTCGGCCGGTGACAAGCTGTCCCCGCTCCCGGGCGAGGGTGGCCTCCAGCCGATCGCGCAGCTCATGGCGCGCATCATCAAGGGCACCTGGCCGAAGACGGTGTCCTCCATCGGCGGGTACCGGCCGCCGGACGGCTACAACGAGCACTCCTCGGGCCGCGCGCTCGACGTCATGGTCACCGAGCTGGGTGGGAAGACTGGCGACGAGGTCACCGACTTCTCGATGGCGAACCACCCGAACTACCCGGTGACGCACACCATCTGGAAGCAACAGATGCACTACCCGCCGGACGGCCGTACCGAGGGTATGGACGATCGCGGGTCGCCGACGCAGAACCACATGGACCACCCGCACATCTGGTATGCGCCGAACCAGGGGCCGATCAACCCCAACGTCATGCCGGACAACATCGCGTTCGGTGGTGTGACTGACGCCGGTGTGCGCAAGGGGATCACCGCGTGGGCTGAGAAGGCCTTCAACACCGCGCTCGCGCCGGTGAAGAAGCTCCTCGACACGCAGGCGTTCAACCCGCCACCCGAGATCAAGGCGACCCCGCGCGAGCTGTACAAGGGTGTCGTTCAGCCAGCCAAGGAGAAGCTCCTCGACAAGGTGTCGGAGCTGACCTCCATGGAGGGCTGGAAGAACATGCTCGGCGGCGCCGTGGACAAGGTCCGCAAGGGTGCTGGCGGCCTCATCGGCGGGATCGCGAAGCTGTTCGACACCGGTGGCGTCGTCCGGCCGGGCACCACCGTGGTGCAGAACGACACTGGCCAGGACGAATACCTGCTCAACCCGCTCGACACGCTGATGCTGCGCGGGCTCATTGGCGCGCTCCGCGGCATCGGCATCAATCCGAAGATCGAGCAGCAGGCGCCGCTGACCCCCGAGGGCACCGGCCCGGCGGACGTGAACATCGCTGGCGTCGGCGGGAAGTCGACGACCCCGGGCGAGCTGCCGACTCCGCAGCAGGACGAGATCAAGCCGCCGACGGCCGAGGATCTCGATGGCGGTCTCTCCGGCACCGGCTCCGGCGCCGCGACGATCCCGCTGAAGCGCAACCCCGACGGCACCTACTCGTCGACCGACCCCGAGTGGGACCACCTCATCCAGCGCGAGTCCGGCGGTATCGCCAATCGCCAGCAAGAGGTCGTCGACGTCAACTCTGGCGGCAACGAGGCCTCGGGCCTGTTCCAGATCGCCAAGGGCACCTGGGCCTCGAACGGCGGCACGAAGTACGCGCCGACCGCGGGCGAAGCCACCCCAGAGCAGCAGGCCGAGATCGCCGCGAAGATCTTCAACGACCAGGGAGGCTCCCCCTGGGGCTCCGGCCTGGCCGGACGCGAGAGCGACGACAAGCTGCGCGCGGGCATCCGCCCGGCCACCCCGGCGGGCACCAAGGACGACCCGGTCGCGGTCACCGTCGACACTCCGTCGGCCGACCCGAGCAAGGACTGGCCGACCACCGCGGACACCGCGCCTGGCGACAAGACCGGGTCGGCGTATGGCCAGAACCTCCAGGGTGCCGCGATCGGTCCGAACGGCGAGTACAAGCCGGACCAGAACGTCACCCCGGGACCGAGCGGAACCGCGGCGCAGAAGCCGATGTTCGTCAACCCCTTCGACACCTTCGCGGGCAAGATCGGGCAGAACTTTGCCGAGCACACCCCGCTCGGCATCGGCGGTCCGCAGGTCTCCAAGCTCGCCGAGAAGGCTCCGGCGATCACCGAGCTGGCCAACGGCGTGGCGCAGAACATCCCGGCCTACGCCGCGGCGCTGGCGGGCAATCCGGCCATGCTCGCCGAGAAGGTCGCTACCGCCACCGGCGCCTGGGCCACCAAGACCGCGACCGACTTCGCCAGCTACGTCCCCGAGAACGCCGGAGGCATGGTCGAGTCGTTGCTGTCCGCAGCCGCCGGGCCGCTGATTGGTACGGTGAACACCGGCCTGAGCAAGGACGACTTGACGTCAACCATGGAGGACGTCCAGAACCGGCAGATCAGGCGGACCAAGACCGGGCGACGGAGGATCTGA

Gene Ontology

Description Category Evidence (source)
GO:0016787 hydrolase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (HzvZ) rather than this protein.
PDB ID
HzvZ
Method AlphaFoldv2
Resolution 60.08
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50