Protein
- Protein accession
- A0A5P8D520 [UniProt]
- Representative
- 49Z73
- Source
- UniProt (cluster: phalp2_17285)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MTAPRPGKGSIGVRVRPNANDFIRDLERQLKDKKKTFYVDVHANLKPANKEIQEWTRTTLKSMGAKVPVRADMSPANKDVAAWRTKQRGIKTNIPIGADLTKALSEVLAFRKVVGKPIDIELKVSTASANRAAERFAAALSRKSINVPINLDPNSVRKTANKIDTDLIRRLQNQGGVLIGVDADTSQAEARMNFFEAREEADPIHKQVDLDLKRARLEMLQLKAELARRKLEVQVDVKTSGVQRLQRSFDMIEKRLGNFSFVRSLDVGPFNLGKPTGLMGTLTTLNLLAGALPTATYGVTALSKALVDLGGAAALLPGMLGGFLASLSTFSVGISGVSDAFEKLTDMWTETPAEAASAARRSVQAHNQLRQAVREEAQAQRDVAAARREATNDLRNLNNELRGSVLNEAQAILDLQKARDRLAQGGFENATDMMQAQLDEAKAYQNLIDVRERNTQLQQKANDESAKGVENSDKVQEALERQTRASEQAAMALEAISSTQATSALGKFQQELDQLTPSAREFVLSIAGMRGEFEMLRNMVQETIFQGTGPAFQQMISNLLPIVGPGMQRIAAAMNDNILTVFKELESPTGKSIIERILGGTAEAQKMLSGLINPLLRGFGTLMAAGAEHLPQLVDLFTRLAERFANFVETADRNGNLDKFLDRGIGALEKMAELGINLIQIISSLGDTFDGDLLQSLVDATQKFEDWINSAEGQEKINELIQDARDLWQQWKPILEDLPGIMGRVADAAQIVLKPLLSILDRLTSFMVEHPGLVEGFVAAWLGAKVLVGAASPIVQLAKILTGVVSAVKALPTLLAKIPGSLPGLLMGNSAAGGKFPGIIPALGPAATIGLPTILGNEIAGQVTGQDLPLPTAIGQGFGAPKRIFDDIFGGGSEWTRIGRANEAIKDVKFGGQSVPKYEPLGHDASQALIGSLVERGSKAGQWIAEAGVGLEQARRYSWLFSQPNAEEIINDPNFNPPKDYPSFDKGGFTNWGVEQGKQVVLHGKEYVQPHQTVDYYGVEAMEAIHQRRVPKQVLDSFYVGGFNFPLHPQVPGPAPAPAPTPAPAPTPTPAATPAPVQHGTGTVLPSLTTGVGTGPLPGPANPPQVDTTFTPTTPGPMGGQDEQQSVNILGFNVPIGGAEQQMPDWSDPGLWPFGIPGIGRPGHTEADAGKWLADWGAKTLLGFGETLLGGVLGFFGAEGLLNNPYMNSIRGAIGYYSSLPGSVSSKKQADGADTTNTNVASLLDQYYNMPLNPLTGAPQLLALGNAAPNAPGNEGLQVNTARGKQIIQSVFPWATNIGGVRPDALKWHPSGLALDVMIPGAGGLNDPTPAEGKALGDQMYAWLQANKEALGIDYIMWQEKDHYNHLHVNFKESGFAAPHGSGGGSSGGGSSGGSAQSEGMTALSELNAALGLDSATPSQGNGPLNLDSGNPVGRSLPSGYKRRAVRGGPKAIGDHEAEGLLQTPTEKGLLKAAAKQLYMQAGMPPAEWPAFDRLIEKESSWNPTAKNPKSTAYGLGQFLDSTDAKYGPRSPDPMVQLPRIFQYIRDRYDGSPAKALDFHNKNNWYDRGGWLMPGQSTVDNATGKPELVIPWDDIPSFYQGGMLPRGAVTGGRRPLPRGPEIGRLQPRGPQAQPKAPSGPQAPASSGPIPPSASTPVQLPPVPHGVSGAQPGPAAQQQGGGVQSAQLGTGPGVSAGGSHLHPAAQKGIRSGAAALGSVVSSAISAAAAAGSFGAAGAAGAAGGGMGSLIGGLFTQGGKIVEDVANVGASFLVGNITNGTTPNPYGVTQRATNPTGGTRIVDNSQRYGDVYTQSPREFFRQLDLREAQHSQGSLGGYDRYA
- Physico‐chemical
properties -
protein length: 1840 AA molecular weight: 194949,2 Da isoelectric point: 6,51 hydropathy: -0,29
Representative Protein Details
- Accession
- 49Z73
- Protein name
- 49Z73
- Sequence length
- 2026 AA
- Molecular weight
- 213044,33120 Da
- Isoelectric point
- 5,99699
- Sequence
-
MATDAARSSIGVRVRPNANNFINDLRTDLQGKKYTFYVDIKAQTQGATRDVKRWAATELRDVNAKVYVSANMSRATSDVARWRERQSSIKTQVQVTANMLEAQRDITAWRAIAGRDLEIKVKANVSGRLTEVEKLRKSAEREAKLTVRGDASQVRRDIKQGIDSVDQLSMFTVEVDADTKKAEVSINKFIMQEEQLPLALDLKLDTKEAVVEAKALKQKVEKDNPRAKVLLETAEARRDLAKLRLDAARKKLVVEVDVKTKKWDKFSKKVDQFEKGFGGGSVIRSLDFGPVNLGKPTGLLGTMTTITAFAGLVPGLVTGIAALSDGFVRLAGAAAMLPGALASMGAAFGTFKVGMFGFSNALDAMFNVWTESTDKIERNQRNTIKWTNDLTRALGNEKAAQRAITDARRDATNELRNLNNELRGSVLNEAQAILDLQRARDRMAQGDFENQTEYMQAQLDIARADQNVLDVRERNMQLQQKYAQKQQQGVEGSDQVTQALESQARATEAVALAMQSISMANPMGAQSLFEDAMDRLSPKAQAAVRAIEGLRSGITGFQRDLQDTMFDGVAEQITGTFSNLAPTIMPGMNAVAQGLNQNIMQIFDTLNSPDGESIIERILGGTAEAQQAMTGLIDPLIRGFGTLMAAGAEHMPQLVQLFTTLADRFANFIETADKSGALDKFLDDGITALGNIAELGINVVKIINDFSTAFRSAFGTDLLTKFVEITDRWHEFLSSAEGQQKLNEHIADAKEIWEGWKPILEKLPEIFDKVQNVAMRFLDIFLPALNAIAGVLEKTPGLVEAFAMAWIGGKMLGAAKGILDFGKLIFGVGKAVGQLGVSIGGMLIRMAPYLPLLGKYFNTPGLSQGGGLPTFLPGDKPVGDPKNQKPQSPLGKAGSAAGGILGGPGGLAATTAIVAVPSYIEQNARFASDDAQLAEINKSLPAGVKTLVKDTISGREQMGADEEAALDSLAQQGSPVLKWLMDPKGDDNKVERARRWAYLNRNPAIMNGDPNAPFEPPVEGSYEKGGFTDWPRGVGKNAMLHGQEFVLPAEAVDFYGRDFMESLRQRKLGFAEGGEAKIVDPLSGRTVDPSTTNHGAQPHGLGQQGPGILSAIATGVSGMVSNAANQATNAATGPLPGPAQPGIGTGPIPGPVSVPGLPPVTPGPSPKADPLTMSLGGLEVPLSTSLPTGWPTADGKPPVGLGGGPDGFDIRRFGIGPGPAGSGPADWMKWTTDFIGGTVTNLGSALLGGALDIFGLSGIMSNPYVTSALGLGSHFFGSATGDQEQPSVQAPGADQMNALANEYLQNYAQMPINPAYPGVPSMMTPEMQAMFPGYGVQFPGAAPLPNLNSLLPGGQAGYIPGGTELGAGNIPGGEANLQRATVAGRRILSAAFPFLPVIGGHRESDPYPWHPSGRALDPMIPSALVGTPQGTAIGDAIASYVMQNGAEMGVENVIWNDQSYSLGRDGQWVASPYKGFDNSPSQRHIDHVHIQFKEAGFPDANTKYYAPSSGIIQYAQNEQGFPVASLGEPGTLGSPPNGTTYTPGNGAASGPGRTPKKPVMYPVPAGAPKPGEGGLSGLGPKFNPYGYPVKPGAPKPGEGGLAGLGPAAKSGTASNASSNKSQGGAALSPKEQAKQLYLAAGLPESEWSAFWEIRQSTRAMDQMQKLYGFKRGVSSEAIASTIAHIKKMYQNSPVKALNQFRSQNTFRSGGAVFGQGGPTADLIPAMLSNGEHVLTAQEVEQMGGQESVYRFRDAVQSGQIGGFAWGGAVQALVPRPPPPPKPAPPPPVAQPPAVEPKPAVAQPSPIEQQPPQEPQSDATVPASTPPGPMLSPATAPDPNAVPAPEGTQPTADQLGPQGDETTNKAAEMLQGTGPGAPNLDKLHPALSTGISSGAAALGNIVSTAVSTAATAAAAGGTMGIGAAGGGAAGSLASSLIGGAFQQGGKILEGVANVGANFLVGNLTGGTTSNAYGVRQVANQPSGGTKIYDASTSIGSLHTADLNEYYRMENRRQAQRAQSGLGHWGSR
Other Proteins in cluster: phalp2_17285
| Total (incl. this protein): 18 | Avg length: 1858,6 | Avg pI: 6,32 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 49Z73 | 2026 | 5,99699 |
| 1p5IR | 1909 | 7,43149 |
| 7juLg | 1849 | 6,19837 |
| A0A023ZY61 | 1849 | 6,19837 |
| A0A192YAV3 | 1849 | 6,26004 |
| A0A2L1J068 | 1849 | 6,19837 |
| A0A385UIK3 | 1840 | 6,51286 |
| A0A385UIQ0 | 1835 | 6,08509 |
| A0A3G8FEY5 | 1826 | 6,29955 |
| A0A3S9U911 | 1849 | 6,19837 |
| A0A411AYB3 | 1849 | 6,19837 |
| A0A482MDW5 | 1849 | 6,19837 |
| A0A5P8D5T0 | 1849 | 6,19837 |
| A0A5P8DDT3 | 1840 | 6,51286 |
| A0A5P8DGM6 | 1849 | 6,25964 |
| G1BNB6 | 1849 | 6,25964 |
| A0A9E7S239 | 1849 | 6,19797 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_5195
3NQWs
|
35 | 27,1% | 1956 | 2.501E-144 |
| 2 |
phalp2_31639
4Es8W
|
8 | 23,9% | 1993 | 3.701E-96 |
| 3 |
phalp2_27883
8IrIz
|
99 | 25,3% | 1837 | 7.282E-83 |
| 4 |
phalp2_383
7hFrk
|
4 | 25,3% | 1588 | 4.776E-57 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Mycobacterium phage JoieB [NCBI] |
2653277 | Marvinvirus > Marvinvirus marvin |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MN329674
[NCBI]
CDS location
range 20913 -> 26435
strand +
strand +
CDS
GTGACGGCACCACGACCGGGTAAGGGCTCCATCGGCGTACGGGTGCGCCCGAACGCCAACGACTTCATCCGCGACCTGGAGCGCCAGCTCAAGGACAAGAAGAAGACGTTCTACGTCGATGTCCACGCCAACCTGAAGCCCGCGAACAAGGAAATTCAGGAGTGGACTCGCACCACCCTGAAGTCGATGGGGGCGAAAGTCCCTGTGCGGGCCGATATGTCGCCCGCGAACAAGGACGTCGCCGCGTGGCGCACGAAGCAGCGCGGCATCAAGACGAACATCCCTATCGGTGCCGATCTGACCAAGGCGCTGTCGGAGGTTCTGGCGTTCCGCAAGGTCGTCGGGAAGCCCATCGACATCGAGCTGAAGGTGTCCACGGCGTCGGCGAACCGTGCGGCTGAGCGGTTCGCGGCGGCGCTGTCCCGCAAGTCGATCAACGTCCCGATCAACCTGGACCCGAACTCGGTTCGCAAGACGGCGAACAAGATCGACACCGACCTGATCCGGCGTCTCCAGAATCAGGGCGGCGTGCTGATCGGCGTGGACGCGGACACGTCGCAGGCCGAAGCGCGGATGAACTTCTTCGAGGCCCGCGAGGAAGCCGACCCGATCCACAAGCAGGTTGATCTGGACCTGAAGCGCGCTCGACTGGAGATGTTGCAGCTCAAGGCTGAGCTGGCGCGCCGCAAGCTTGAGGTCCAGGTGGACGTCAAGACGAGCGGCGTGCAGCGGCTCCAGCGGTCGTTCGACATGATCGAGAAGCGCCTGGGCAACTTCTCGTTCGTCCGCTCTTTGGACGTCGGCCCGTTCAACCTGGGCAAGCCGACCGGCCTGATGGGCACCCTGACGACGCTGAACCTGCTCGCCGGGGCGCTGCCTACGGCGACGTACGGCGTGACGGCGCTGTCGAAGGCCCTGGTGGACCTGGGCGGCGCGGCGGCGCTCCTCCCTGGCATGTTGGGCGGGTTCCTGGCGAGCCTGTCCACCTTCTCAGTAGGCATTTCCGGTGTATCGGATGCGTTCGAGAAGCTGACCGACATGTGGACCGAGACGCCCGCAGAGGCGGCGTCGGCGGCGCGGCGGTCGGTGCAGGCCCACAACCAGCTGCGCCAGGCGGTGCGCGAGGAGGCACAGGCCCAGCGCGACGTCGCGGCGGCGCGGCGCGAAGCCACCAACGACCTGCGCAACCTGAACAACGAGCTGCGCGGGTCGGTGCTGAACGAGGCGCAAGCCATCCTGGACCTCCAGAAGGCCCGCGACCGGCTGGCCCAGGGTGGGTTCGAGAACGCCACCGACATGATGCAGGCCCAGCTCGATGAGGCGAAGGCGTACCAGAACCTCATCGACGTGCGGGAACGCAACACCCAGCTTCAGCAGAAGGCCAACGACGAGTCGGCGAAGGGTGTCGAGAACTCCGACAAGGTGCAGGAAGCGCTGGAGCGTCAGACGCGCGCATCCGAGCAGGCCGCGATGGCGCTGGAGGCCATCAGCTCCACCCAGGCCACCAGCGCCCTGGGCAAGTTTCAGCAAGAGCTTGACCAGCTGACCCCGAGTGCCCGCGAGTTCGTGCTGAGCATCGCGGGGATGCGCGGCGAGTTCGAGATGCTGCGCAACATGGTCCAGGAGACCATCTTCCAGGGCACCGGCCCAGCCTTCCAGCAGATGATCAGCAACCTGCTGCCCATTGTGGGGCCGGGTATGCAGCGCATCGCGGCGGCGATGAACGACAACATCCTCACCGTCTTCAAGGAACTGGAGTCGCCGACAGGCAAGAGCATCATCGAGCGCATCCTCGGTGGCACCGCCGAGGCGCAGAAGATGCTCAGCGGCCTGATCAACCCGCTGCTGCGCGGGTTCGGCACACTGATGGCTGCCGGTGCCGAGCACCTGCCACAGCTGGTGGACCTGTTCACCCGGCTCGCCGAGCGGTTCGCCAACTTCGTGGAGACCGCCGACCGTAACGGCAACCTGGACAAGTTCCTGGACCGTGGCATTGGCGCGCTGGAGAAGATGGCTGAGCTGGGCATCAACCTCATTCAGATCATCTCCAGCCTGGGTGACACCTTCGACGGCGACCTGTTGCAGTCGTTGGTGGACGCCACCCAGAAGTTCGAGGATTGGATCAACTCGGCTGAGGGCCAGGAGAAGATCAACGAGCTGATCCAAGACGCCCGGGACCTGTGGCAGCAGTGGAAGCCGATCCTTGAGGACTTGCCGGGGATCATGGGCCGGGTCGCCGACGCGGCGCAGATCGTGCTCAAGCCGCTGCTGTCCATCCTGGACCGGCTCACGTCGTTCATGGTTGAGCACCCCGGCCTGGTGGAGGGCTTCGTCGCGGCGTGGCTGGGCGCGAAGGTGCTGGTCGGCGCGGCGTCGCCGATCGTGCAGCTGGCGAAGATTCTGACCGGCGTGGTGTCGGCGGTGAAGGCGCTTCCGACGCTGCTGGCGAAAATCCCCGGCTCGCTGCCTGGCCTGCTCATGGGCAACAGCGCGGCGGGTGGCAAGTTCCCCGGCATCATCCCGGCGCTGGGGCCTGCGGCCACCATCGGCCTGCCCACCATCCTGGGCAATGAGATCGCGGGTCAGGTCACCGGCCAAGACCTTCCGCTGCCGACCGCCATCGGCCAGGGGTTCGGCGCTCCGAAGCGCATTTTCGATGACATCTTCGGCGGCGGGTCGGAGTGGACTCGCATCGGCCGCGCGAACGAAGCCATCAAGGATGTGAAGTTCGGTGGCCAGTCGGTGCCGAAGTACGAGCCGCTGGGCCACGACGCCTCGCAGGCGTTGATCGGGTCGCTTGTCGAGCGTGGCTCCAAGGCTGGCCAGTGGATCGCCGAGGCTGGCGTGGGCCTGGAGCAGGCGCGCCGGTACTCGTGGCTGTTCTCCCAGCCGAACGCTGAGGAGATCATCAACGACCCGAACTTCAACCCGCCCAAGGACTACCCGTCGTTCGACAAGGGCGGCTTCACGAACTGGGGTGTCGAGCAGGGCAAGCAGGTCGTTCTGCACGGCAAGGAATACGTCCAGCCGCACCAGACAGTGGACTACTACGGCGTGGAGGCGATGGAGGCCATCCACCAGCGCCGGGTGCCCAAGCAGGTGTTGGACAGCTTCTACGTGGGCGGATTCAACTTCCCGCTGCACCCGCAGGTGCCTGGACCAGCCCCGGCTCCCGCTCCGACTCCGGCTCCAGCGCCCACCCCAACTCCGGCTGCCACGCCAGCGCCGGTGCAGCACGGCACCGGCACGGTGCTGCCGTCTCTGACAACGGGTGTCGGCACCGGCCCGCTGCCGGGACCGGCCAACCCGCCGCAGGTGGACACCACGTTCACCCCGACCACCCCCGGCCCGATGGGCGGGCAGGACGAGCAGCAGTCCGTCAACATCCTCGGCTTCAACGTGCCCATCGGCGGTGCCGAACAGCAAATGCCCGACTGGTCCGATCCTGGCCTGTGGCCGTTCGGCATCCCCGGCATCGGGCGTCCCGGACACACGGAAGCCGACGCCGGGAAGTGGCTGGCCGACTGGGGCGCGAAGACTTTGCTAGGGTTCGGCGAAACGCTCCTCGGAGGCGTGCTGGGGTTCTTCGGTGCCGAGGGCCTGCTGAACAACCCGTACATGAACTCGATTCGCGGCGCTATCGGGTATTACTCGTCCCTGCCGGGGTCGGTGTCCAGCAAGAAGCAGGCCGATGGCGCGGACACCACGAACACCAACGTAGCGTCTTTGCTGGATCAGTATTACAACATGCCGCTGAACCCCCTCACGGGGGCTCCTCAGCTCCTCGCGCTGGGTAATGCCGCCCCGAACGCCCCCGGCAATGAGGGGTTGCAGGTCAACACTGCTCGCGGCAAGCAAATCATCCAGTCGGTGTTCCCGTGGGCCACCAACATCGGCGGCGTACGCCCCGATGCGCTGAAGTGGCACCCGAGCGGCCTGGCGCTGGACGTGATGATCCCCGGCGCGGGCGGGCTGAACGACCCGACGCCAGCGGAGGGCAAGGCCCTCGGCGACCAGATGTACGCCTGGCTTCAGGCGAACAAAGAGGCTCTGGGCATCGACTACATCATGTGGCAGGAGAAGGACCACTACAACCATCTCCATGTCAATTTCAAGGAAAGCGGCTTCGCCGCGCCGCACGGTAGCGGCGGCGGCAGCAGCGGCGGCGGCAGCAGCGGCGGGTCGGCCCAGTCGGAGGGCATGACGGCGCTGAGCGAGCTGAACGCCGCCCTGGGCCTGGACTCGGCCACCCCATCCCAGGGCAACGGGCCACTGAACCTTGACTCCGGCAACCCCGTGGGCCGCTCGCTGCCGTCTGGCTACAAGCGGCGCGCGGTGCGCGGCGGGCCGAAGGCCATCGGAGACCACGAGGCCGAAGGGCTCCTCCAGACCCCGACAGAGAAGGGCCTGCTGAAGGCTGCCGCCAAGCAGCTCTACATGCAGGCGGGTATGCCGCCCGCCGAGTGGCCCGCGTTCGATCGGCTCATCGAAAAAGAGTCGAGCTGGAACCCGACCGCGAAGAACCCGAAGTCCACGGCGTACGGCCTGGGCCAGTTCCTGGACAGCACGGATGCGAAGTACGGGCCGCGCAGCCCCGATCCGATGGTGCAGCTGCCGCGCATCTTCCAGTACATCCGCGACCGCTACGACGGTTCGCCAGCAAAGGCGCTGGACTTCCATAACAAGAACAACTGGTACGACCGTGGTGGCTGGCTGATGCCTGGCCAGTCCACGGTGGACAACGCCACCGGCAAGCCTGAGCTAGTCATCCCGTGGGACGACATCCCGTCGTTCTACCAGGGCGGGATGCTGCCGCGCGGCGCGGTCACCGGTGGTCGTCGGCCCCTGCCGCGTGGTCCCGAGATCGGGCGATTGCAGCCCCGGGGTCCGCAGGCCCAGCCGAAGGCCCCATCCGGCCCGCAGGCCCCGGCGAGCAGCGGCCCGATTCCGCCGTCTGCCAGCACTCCCGTGCAGCTGCCTCCGGTTCCGCATGGGGTGTCTGGGGCTCAGCCAGGTCCCGCCGCCCAGCAGCAGGGCGGCGGCGTCCAATCGGCGCAGCTGGGCACCGGCCCCGGCGTGTCGGCAGGTGGCAGCCACCTGCACCCTGCGGCGCAGAAGGGCATCCGTTCCGGCGCGGCGGCGCTGGGCAGCGTCGTGTCGTCGGCCATCAGTGCGGCGGCTGCGGCCGGGTCGTTCGGCGCGGCGGGCGCGGCGGGAGCGGCTGGCGGCGGCATGGGCTCTCTGATCGGCGGACTCTTCACGCAGGGCGGCAAGATTGTCGAGGACGTAGCAAACGTCGGTGCTAGCTTCTTGGTAGGCAACATCACCAACGGCACGACGCCGAACCCGTACGGCGTCACACAGCGGGCTACGAACCCGACAGGAGGGACGCGGATCGTTGACAACTCGCAGCGGTATGGGGACGTCTACACGCAGAGCCCGCGCGAGTTCTTCCGGCAGTTGGACTTGCGCGAGGCTCAGCATTCTCAGGGCTCGCTCGGTGGATACGACAGGTATGCGTAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(49Z73)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50