Protein

Protein accession
A0A9E7S239 [UniProt]
Representative
49Z73
Source
UniProt (cluster: phalp2_17285)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MTAPRPGKGSIGVRVRPNANDFIRDLERQLKDKKKTFYVDVHANLKPANKEIQEWTRTTLKSMGAKIPVRADMSPANKDVAAWRTKQRGIKTNIPIGADLTKALSEVLAFRKVVGKPIDIELKVSTASANRAAERFAAALSRKSINVPINLDPNSVRKTANKIDTDLIRRLQNQGGVLIGVDADTSQAEARMNFFEAREEADPIHKQVDLDLKRARLEMLQLKAELARRKLEVQVDVKTSGVQRLQRSFDMIEKRLGNFSFIRSLDVGPFNLGKPTGLMGTLTTLNLLAGALPTATYGVTALSKALVDLGGAAALLPGMLGGFLASLSTFSVGISGVSDAFEKLTDMWTETPAEAASAARRSVQAHNQLRQAVREEAQAQRDVAAARREATNDLRNLNNELRGSVLNEAQAILDLQKARDRLAQGGFENATDMMQAQLDEAKAYQNLIDVRERNTQLQQKANDESAKGVENSDKVQEALERQTRASEQAAMALEAISSTQATSALGKFQQELDQLTPSAREFVLSIAGMRGEFEMLRNMVQETIFQGTGPAFQQMISNLLPIVGPGMQRIAAAMNDNILTVFKELESPTGKSIIERILGGTAEAQKMLSGLINPLLRGFGTLMAAGAEHLPQLVDLFTRLAERFANFVETADRNGNLDNFLDRGIGALEKMAELGINLIQIISSLGDTFDGDLLQSLVDATQKFEDWINSAEGQEKINELIQDARDLWQQWKPILEDLPGIMGRVADAAQIVLKPLLSILDRLTSFMVEHPGLVEGFVAAWLGAKVLVGAASPIVQLAKILTGVVSAVKALPTLLAKIPGSLPGLLMGNSAAGGKFPGIIPALGPAATIGLPAILGNEIAGQVTGQDLPLPTAIGQGFGAPKRIFDDIFGGGSEWTRISRANEAIKDVKFGGQSVPKYEPLGHDASQALIGSLVERGSKAGQWIAEAGVGLEQARRYSWLFSQPNAEEIINDPNFNPPKDYPSFDKGGFTNWGVEQGKQVVLHGKEYVQPHQTVDYYGVEAMEAIHQRRVPKQVLDSFYVGGFNFPLHPQVPGPAPAPAPTPAPAPAPTPAATPAPVQHGTGTGLPSLTTGVGTGPLPGPANPPQVDTTFTPTTPGPVGGQDDQQSINILGFNVPIGGAEQQMPDWSDPGLWPFGIPGIGRPGHTEADAGKWLADWGAKTLLGFGETLLGGVLGFFGAEGLLNNPYMNSIRGAIGYYSSLPGSVSSNKQADGADATNANVASLLDQYYNMPLNPLEGAPQLLALGNAAPNAPGNEGLQVNTARGKQIIQSVFPWATNIGGVREDALKWHPSGLALDVMIPGAGGLNDPTPPEGKALGDQMYAWLQANKEALGIDYIMWQEKGHYNHLHVNFKESGFADAMGLGTGSGQSSGGGSAQSEGLTALSELNAALGLDSATPPQGNGPLNLDSGSPVGRSLPSGYKRRAVRGGPKAVGDHEAEGLLQTPTEKGLLKAAAKQLYMRAGMPPAEWPAFDRLIEKESSWNPTAKNPKSTAYGLGQFLDITDAKYGPRSADPMVQLPRIFQYIRDRYDGSPAKALDFHNKNNWYDRGGWLMPGQSTVDNATGKPELVIPWDDIPSFYQGGMLPRGAVTGGRRPLPRGPEIGRLKPQPSAPVPIGPKAPPPSQLPGNLRPAPGGPADPNTPLLPGQSPAPPDGNYGPAPDTGGASARGGGVAAAQLGTGPGVAAGGSHLHPAAQKGIRSGAAALGSVVSSAISAAAAAGSFGAAGAAGAAGGGMGSLIGGLFTQGGKIVEDVANVGASFLVGNITNGTTPNPYGVTQRATNPTGGTRIVDNSQRYGDVYTQSPREFFRQLDLREAQHSQGSLGGYDRYA
Physico‐chemical
properties
protein length:1849 AA
molecular weight:195723,2 Da
isoelectric point:6,20
hydropathy:-0,27
Representative Protein Details
Accession
49Z73
Protein name
49Z73
Sequence length
2026 AA
Molecular weight
213044,33120 Da
Isoelectric point
5,99699
Sequence
MATDAARSSIGVRVRPNANNFINDLRTDLQGKKYTFYVDIKAQTQGATRDVKRWAATELRDVNAKVYVSANMSRATSDVARWRERQSSIKTQVQVTANMLEAQRDITAWRAIAGRDLEIKVKANVSGRLTEVEKLRKSAEREAKLTVRGDASQVRRDIKQGIDSVDQLSMFTVEVDADTKKAEVSINKFIMQEEQLPLALDLKLDTKEAVVEAKALKQKVEKDNPRAKVLLETAEARRDLAKLRLDAARKKLVVEVDVKTKKWDKFSKKVDQFEKGFGGGSVIRSLDFGPVNLGKPTGLLGTMTTITAFAGLVPGLVTGIAALSDGFVRLAGAAAMLPGALASMGAAFGTFKVGMFGFSNALDAMFNVWTESTDKIERNQRNTIKWTNDLTRALGNEKAAQRAITDARRDATNELRNLNNELRGSVLNEAQAILDLQRARDRMAQGDFENQTEYMQAQLDIARADQNVLDVRERNMQLQQKYAQKQQQGVEGSDQVTQALESQARATEAVALAMQSISMANPMGAQSLFEDAMDRLSPKAQAAVRAIEGLRSGITGFQRDLQDTMFDGVAEQITGTFSNLAPTIMPGMNAVAQGLNQNIMQIFDTLNSPDGESIIERILGGTAEAQQAMTGLIDPLIRGFGTLMAAGAEHMPQLVQLFTTLADRFANFIETADKSGALDKFLDDGITALGNIAELGINVVKIINDFSTAFRSAFGTDLLTKFVEITDRWHEFLSSAEGQQKLNEHIADAKEIWEGWKPILEKLPEIFDKVQNVAMRFLDIFLPALNAIAGVLEKTPGLVEAFAMAWIGGKMLGAAKGILDFGKLIFGVGKAVGQLGVSIGGMLIRMAPYLPLLGKYFNTPGLSQGGGLPTFLPGDKPVGDPKNQKPQSPLGKAGSAAGGILGGPGGLAATTAIVAVPSYIEQNARFASDDAQLAEINKSLPAGVKTLVKDTISGREQMGADEEAALDSLAQQGSPVLKWLMDPKGDDNKVERARRWAYLNRNPAIMNGDPNAPFEPPVEGSYEKGGFTDWPRGVGKNAMLHGQEFVLPAEAVDFYGRDFMESLRQRKLGFAEGGEAKIVDPLSGRTVDPSTTNHGAQPHGLGQQGPGILSAIATGVSGMVSNAANQATNAATGPLPGPAQPGIGTGPIPGPVSVPGLPPVTPGPSPKADPLTMSLGGLEVPLSTSLPTGWPTADGKPPVGLGGGPDGFDIRRFGIGPGPAGSGPADWMKWTTDFIGGTVTNLGSALLGGALDIFGLSGIMSNPYVTSALGLGSHFFGSATGDQEQPSVQAPGADQMNALANEYLQNYAQMPINPAYPGVPSMMTPEMQAMFPGYGVQFPGAAPLPNLNSLLPGGQAGYIPGGTELGAGNIPGGEANLQRATVAGRRILSAAFPFLPVIGGHRESDPYPWHPSGRALDPMIPSALVGTPQGTAIGDAIASYVMQNGAEMGVENVIWNDQSYSLGRDGQWVASPYKGFDNSPSQRHIDHVHIQFKEAGFPDANTKYYAPSSGIIQYAQNEQGFPVASLGEPGTLGSPPNGTTYTPGNGAASGPGRTPKKPVMYPVPAGAPKPGEGGLSGLGPKFNPYGYPVKPGAPKPGEGGLAGLGPAAKSGTASNASSNKSQGGAALSPKEQAKQLYLAAGLPESEWSAFWEIRQSTRAMDQMQKLYGFKRGVSSEAIASTIAHIKKMYQNSPVKALNQFRSQNTFRSGGAVFGQGGPTADLIPAMLSNGEHVLTAQEVEQMGGQESVYRFRDAVQSGQIGGFAWGGAVQALVPRPPPPPKPAPPPPVAQPPAVEPKPAVAQPSPIEQQPPQEPQSDATVPASTPPGPMLSPATAPDPNAVPAPEGTQPTADQLGPQGDETTNKAAEMLQGTGPGAPNLDKLHPALSTGISSGAAALGNIVSTAVSTAATAAAAGGTMGIGAAGGGAAGSLASSLIGGAFQQGGKILEGVANVGANFLVGNLTGGTTSNAYGVRQVANQPSGGTKIYDASTSIGSLHTADLNEYYRMENRRQAQRAQSGLGHWGSR
Other Proteins in cluster: phalp2_17285
Total (incl. this protein): 18 Avg length: 1858,6 Avg pI: 6,32

Protein ID Length (AA) pI
49Z73 2026 5,99699
1p5IR 1909 7,43149
7juLg 1849 6,19837
A0A023ZY61 1849 6,19837
A0A192YAV3 1849 6,26004
A0A2L1J068 1849 6,19837
A0A385UIK3 1840 6,51286
A0A385UIQ0 1835 6,08509
A0A3G8FEY5 1826 6,29955
A0A3S9U911 1849 6,19837
A0A411AYB3 1849 6,19837
A0A482MDW5 1849 6,19837
A0A5P8D520 1840 6,51286
A0A5P8D5T0 1849 6,19837
A0A5P8DDT3 1840 6,51286
A0A5P8DGM6 1849 6,25964
G1BNB6 1849 6,25964
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5195
3NQWs
35 27,1% 1956 2.501E-144
2 phalp2_31639
4Es8W
8 23,9% 1993 3.701E-96
3 phalp2_27883
8IrIz
99 25,3% 1837 7.282E-83
4 phalp2_383
7hFrk
4 25,3% 1588 4.776E-57

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage Huphlepuff
[NCBI]
2950302 Marvinvirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
ON637772 [NCBI]
CDS location
range 20694 -> 26243
strand +
CDS
GTGACGGCACCACGACCGGGTAAGGGCTCCATCGGCGTACGGGTGCGCCCGAACGCCAACGACTTCATCCGCGACCTGGAGCGCCAGCTCAAGGACAAGAAGAAGACGTTCTACGTCGATGTCCACGCCAACCTGAAGCCCGCGAACAAGGAAATTCAGGAGTGGACTCGCACCACCCTGAAGTCGATGGGGGCGAAAATCCCCGTGCGGGCCGATATGTCGCCCGCGAACAAGGACGTCGCCGCGTGGCGCACGAAGCAGCGCGGCATCAAGACGAACATCCCTATCGGTGCCGATCTGACCAAGGCGCTGTCGGAGGTTCTGGCGTTCCGCAAGGTCGTCGGGAAGCCCATCGACATCGAGCTGAAGGTGTCCACGGCGTCGGCGAACCGTGCGGCTGAGCGGTTCGCGGCGGCGCTGTCCCGCAAGTCGATCAACGTCCCGATCAACCTGGACCCGAACTCGGTTCGCAAGACGGCGAACAAGATCGACACCGACCTGATCCGGCGTCTCCAGAATCAGGGCGGCGTGCTGATCGGCGTGGACGCGGACACGTCGCAGGCCGAAGCGCGGATGAACTTCTTCGAGGCCCGCGAGGAAGCCGACCCGATCCACAAGCAGGTTGATCTGGACCTGAAGCGCGCTCGACTGGAGATGTTGCAGCTCAAGGCTGAGCTGGCGCGCCGCAAGCTTGAGGTCCAGGTGGACGTCAAGACGAGCGGCGTGCAGCGGCTCCAGCGGTCGTTCGACATGATTGAGAAGCGCCTGGGCAACTTCTCGTTCATCCGCTCTTTGGACGTCGGCCCGTTCAACCTGGGCAAGCCGACCGGCCTGATGGGCACCCTGACGACGCTGAACCTGCTCGCCGGGGCGCTGCCTACGGCGACGTACGGCGTGACGGCGCTGTCGAAGGCCCTGGTGGACCTGGGCGGCGCGGCGGCGCTCCTCCCTGGCATGTTGGGCGGGTTCCTGGCGAGCCTGTCCACCTTCTCAGTAGGCATTTCCGGTGTATCGGATGCGTTCGAGAAGCTGACCGACATGTGGACCGAGACGCCCGCAGAGGCGGCGTCGGCGGCGCGGCGGTCGGTGCAGGCCCACAACCAGCTGCGCCAGGCGGTGCGCGAGGAGGCACAGGCCCAGCGCGACGTCGCGGCGGCGCGGCGCGAAGCCACCAACGACCTGCGCAACCTGAACAACGAGCTGCGCGGGTCGGTACTGAACGAGGCGCAAGCCATCCTGGACCTCCAGAAGGCCCGCGACCGGCTGGCCCAGGGTGGGTTCGAGAACGCCACCGACATGATGCAGGCCCAACTCGATGAGGCGAAGGCGTACCAGAACCTCATCGACGTGCGGGAACGCAACACCCAGCTTCAGCAGAAGGCCAACGACGAGTCGGCGAAGGGTGTCGAGAACTCCGACAAGGTGCAGGAAGCGCTGGAGCGTCAGACGCGCGCATCCGAGCAGGCCGCGATGGCGCTGGAGGCCATCAGCTCCACCCAGGCCACCAGCGCCCTGGGCAAGTTTCAGCAAGAGCTTGACCAGCTGACCCCGAGTGCCCGCGAGTTCGTGCTGAGCATCGCGGGGATGCGCGGCGAGTTCGAGATGCTGCGCAACATGGTCCAGGAGACCATCTTCCAGGGCACCGGCCCAGCCTTCCAGCAGATGATCAGCAACCTGCTGCCCATTGTGGGGCCGGGTATGCAGCGCATCGCGGCGGCGATGAACGACAACATCCTCACCGTCTTCAAGGAACTGGAGTCGCCGACAGGCAAGAGCATCATCGAGCGCATCCTCGGTGGCACCGCTGAGGCGCAGAAGATGCTCAGCGGCCTGATCAACCCGCTGCTGCGCGGGTTCGGCACGCTGATGGCTGCCGGTGCCGAGCACCTGCCACAGCTGGTGGACCTGTTCACCCGGCTCGCCGAGAGGTTCGCCAACTTCGTGGAGACCGCTGACCGCAACGGCAACCTGGACAACTTCCTGGACCGTGGCATTGGCGCGCTGGAGAAGATGGCTGAGCTGGGCATCAACCTCATTCAGATCATCTCCAGCCTGGGTGACACCTTCGACGGCGACCTGTTGCAGTCGTTGGTGGACGCCACCCAGAAGTTCGAGGATTGGATCAACTCAGCTGAGGGCCAGGAGAAGATCAACGAGCTGATCCAAGACGCCCGGGACCTGTGGCAGCAGTGGAAGCCGATCCTTGAGGACTTGCCGGGGATCATGGGCCGGGTCGCCGACGCGGCGCAGATCGTGCTCAAGCCGCTGCTGTCCATCCTGGACCGGCTCACGTCGTTCATGGTTGAGCACCCCGGCCTGGTGGAGGGCTTCGTCGCGGCGTGGCTGGGCGCGAAGGTGCTGGTCGGCGCGGCGTCGCCGATCGTGCAGCTGGCGAAGATTCTGACCGGCGTGGTGTCGGCGGTGAAGGCGCTTCCGACGCTGCTGGCGAAGATTCCCGGCTCGCTGCCTGGCCTGCTCATGGGCAACAGCGCGGCGGGAGGCAAGTTCCCCGGCATCATCCCGGCGCTGGGGCCTGCGGCCACCATCGGCCTGCCTGCCATCCTGGGCAACGAGATCGCGGGTCAGGTCACCGGCCAAGACCTTCCGTTGCCGACCGCCATCGGCCAGGGGTTCGGCGCTCCGAAGCGCATTTTCGATGACATCTTCGGCGGCGGGTCGGAGTGGACCCGCATCAGCCGCGCGAACGAAGCCATCAAGGATGTGAAGTTCGGTGGCCAGTCGGTGCCGAAGTACGAGCCGCTGGGCCACGACGCCTCGCAGGCGTTGATCGGGTCGCTTGTCGAGCGTGGCTCCAAGGCTGGCCAGTGGATCGCCGAGGCTGGTGTGGGCCTGGAGCAGGCGCGCCGGTACTCGTGGCTGTTCTCCCAGCCGAACGCTGAGGAGATCATCAACGACCCGAACTTCAACCCGCCCAAGGACTACCCGTCGTTCGACAAGGGCGGCTTCACGAACTGGGGTGTCGAGCAGGGCAAGCAGGTCGTTCTGCACGGCAAGGAATACGTCCAGCCGCACCAGACAGTGGACTACTACGGCGTGGAGGCGATGGAGGCCATCCACCAGCGCCGGGTGCCCAAGCAGGTGTTGGACAGCTTCTACGTGGGCGGATTCAACTTCCCGCTGCACCCGCAGGTGCCTGGACCAGCCCCGGCTCCCGCCCCGACTCCGGCCCCAGCGCCCGCCCCGACTCCGGCTGCCACGCCAGCGCCGGTGCAGCACGGCACCGGCACGGGGCTGCCGTCTCTGACGACGGGTGTTGGCACCGGCCCGCTGCCGGGACCGGCCAACCCGCCGCAGGTGGACACCACGTTCACCCCGACCACCCCCGGCCCGGTGGGCGGGCAGGACGATCAGCAGTCCATCAACATTCTCGGCTTCAACGTGCCCATCGGCGGTGCCGAACAGCAAATGCCCGACTGGTCCGATCCTGGCCTGTGGCCGTTCGGCATCCCCGGCATCGGGCGTCCTGGACACACGGAAGCCGATGCCGGGAAGTGGCTGGCCGACTGGGGCGCGAAGACTTTGCTAGGGTTCGGCGAAACGCTCCTCGGAGGCGTGCTGGGGTTCTTCGGTGCCGAGGGCCTGCTGAACAACCCGTACATGAACTCGATTCGCGGCGCTATCGGGTATTACTCGTCCCTGCCGGGGTCGGTGTCCAGCAATAAGCAGGCCGATGGCGCGGACGCCACGAACGCAAACGTGGCGTCTTTGCTGGATCAGTATTACAACATGCCACTGAACCCCCTCGAGGGGGCTCCTCAGCTCCTCGCGCTGGGTAATGCCGCCCCGAACGCCCCCGGCAATGAGGGGTTGCAGGTCAACACTGCTCGCGGCAAGCAAATCATCCAGTCGGTGTTCCCGTGGGCCACCAACATCGGCGGCGTACGCGAAGATGCGCTGAAGTGGCACCCGAGCGGCCTGGCGCTGGATGTGATGATCCCCGGCGCGGGCGGGCTGAACGATCCGACGCCACCGGAGGGCAAGGCCCTCGGCGACCAGATGTACGCCTGGCTTCAGGCGAACAAAGAGGCGCTGGGCATCGACTACATCATGTGGCAGGAGAAAGGCCACTACAACCATCTCCATGTCAACTTCAAGGAAAGCGGCTTCGCCGACGCGATGGGCCTCGGCACGGGCAGCGGCCAGTCGAGCGGCGGTGGCTCAGCCCAGTCGGAGGGCCTGACGGCGCTCAGCGAGCTAAACGCCGCCCTGGGCCTGGACTCGGCCACCCCACCCCAGGGCAACGGGCCGCTGAACCTTGACTCCGGCAGCCCCGTGGGCCGCTCGCTGCCGTCTGGCTACAAGCGGCGCGCGGTGCGCGGCGGGCCGAAGGCCGTCGGAGACCACGAGGCCGAAGGTCTCCTCCAGACCCCGACAGAGAAGGGCCTTCTGAAGGCTGCCGCCAAGCAGCTCTACATGCGGGCGGGTATGCCGCCCGCCGAGTGGCCCGCGTTCGATCGGCTCATTGAGAAAGAGTCGAGCTGGAACCCGACCGCGAAGAACCCGAAGTCCACGGCGTACGGCCTGGGCCAGTTCCTGGACATCACGGACGCGAAGTACGGGCCGCGCAGCGCCGATCCGATGGTGCAGCTGCCGCGCATCTTCCAGTACATCCGCGACCGCTACGACGGTTCGCCAGCAAAGGCTCTGGACTTCCATAACAAGAACAACTGGTACGACCGGGGTGGCTGGCTGATGCCTGGCCAGTCCACGGTGGACAACGCCACGGGCAAGCCTGAGCTGGTGATCCCGTGGGACGACATCCCGTCGTTCTATCAGGGCGGGATGCTGCCGCGCGGCGCGGTCACCGGTGGCCGTCGGCCCTTGCCGCGCGGCCCTGAGATCGGTCGGCTCAAGCCGCAGCCTTCGGCTCCTGTGCCGATTGGGCCGAAGGCCCCTCCGCCGAGCCAGCTTCCAGGGAATCTGCGGCCCGCGCCGGGTGGCCCTGCGGACCCGAACACCCCGCTCCTGCCGGGGCAGTCCCCTGCGCCTCCTGACGGCAACTATGGCCCTGCGCCGGATACTGGCGGCGCGTCGGCGCGCGGCGGTGGCGTCGCGGCGGCGCAGCTGGGCACCGGCCCAGGCGTGGCGGCTGGCGGCAGCCACCTGCACCCTGCGGCGCAGAAGGGCATCCGTTCCGGCGCGGCGGCGCTGGGCAGCGTCGTGTCGTCGGCCATCAGCGCGGCAGCTGCGGCCGGATCGTTCGGCGCGGCGGGCGCGGCGGGCGCGGCGGGTGGCGGCATGGGCTCCCTGATCGGTGGACTCTTCACTCAGGGCGGCAAGATTGTCGAGGACGTAGCAAACGTCGGTGCTAGCTTCTTGGTAGGCAACATCACCAATGGCACGACGCCGAACCCGTACGGCGTCACACAACGGGCTACGAACCCGACAGGAGGGACGCGGATCGTTGACAACTCGCAGCGGTACGGGGACGTCTACACGCAGAGCCCGCGCGAGTTCTTCCGGCAGTTGGACTTGCGCGAGGCTCAGCATTCTCAGGGCTCGCTCGGTGGATACGACAGGTATGCGTAA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (49Z73) rather than this protein.
PDB ID
49Z73
Method AlphaFoldv2
Resolution 45.20
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50