Protein
- Protein accession
- A0A481VVG0 [UniProt]
- Representative
- 7bbzK
- Source
- UniProt (cluster: phalp2_12278)
- Protein name
- Tape measure protein
- Lysin probability
- 100%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MAGGIIEIDVVLNDRDLTRRLDRAAQVARRAAGGITAAFGAAGLVAGLKQVVTVGRDFDLVMNTIRGTSGATAQEMAAISDKARQLGNDIALPATSAVDAATAMLELSKGGLSVKASMDAARGSLALAAAAQIDAGRAAEIQANAINTFRLKATDAEHVADVLANTANKSSAGMTDIADALQQSGSVAAQFNMTLDQTAGALGVIANNGIKGSDAGTLLKSTLLALQDTGKPAAAAMETLGITTYDAQGKFVGLGKILEQVNTAHKTLTEEEFNHASATLFGSDAMRLAGIAAKENADSFRNMTEAVGQAGGASRLAAEMNKGLPGALERVKNAAETTQLAIYELLRGPLTDLANFGADKLTGVAEWIGGQDMDAARFRAIADELRPLGERLREIGVDAADFAKGTTEAGGAVSMLGDGAKLAGEGAIAGLQPILGVVGGVVSAFTALPTPVQTAALAMGALALARGRMNRSLERAGSLEAGQRTRWQNMALAGRQFNQEMRTQAQLAQMQGQSLTRAQRAMAAYNTSTVGSVATMRNFTTQVAAARAGAVAAGTPISRLAASMQVLGQTRTDGIGRIAAAYTNAATAAGSYRTAAGLAAGGATALRVGAGALFNAVGGLSGLLMGGATLALAMYSQRQQKAAQAAAEHKQIVDGLVGSINAQTGALNSQGMAALQNSLAGAKFASSDKQGTNPYDLLARNQNLNIAPEQYLNAASGQQAQIDAVNQALDVQVRKSIDASDAWSTYGNILSRNGVSAEDLTLALRGNQEQIDRVEVALAGNAGIETFRKSLDSAGSDAIELGNQMGKANDALDDGQKLAIQQAEALRKVHPEARTLSDAMQVLSDNTASASDKASALKTALDALSGGAEGVDSARGQGFAEVEKLPGLWQAAAAAAGGFNGIIDQSTGRISINNDSTRELSASIGRVKDSMNNAAAAAYKFAIESGQSQDEARQAAVIASQGIYDGFMKTADGARQAGVDIDGLMRAMNVLPPQKLVEFISLGADKTKQEIYLIKQQLDQTPNAKSITVATLSDEAKKKLDELKVKTDTLPDGRVVVSALTDEARRRIDELLKPEEKIVTVSYRGYDIPSSIVRPDLLPPQPRATGGPVGYAVGGGIGPDGAIRGPGTGTSDSISTFAPAGGHVFTARETDAAGGHEGLRGMLAAMARPGRGRASGGAMQAVKLSNGEHFADPDQVAAVGGHDAIFSLRRALKSVQRFSLGGLVRSEQVGHANDGLPYITGARDCSMWVSWMVQAAKGQPLGRLFTTHTLIGGQTGGLAPGASPGDLLTVGTSSEHMAATVMTANGPVNTESGGNSSPSQVRWGRGAAGAFDPQFTHRFHLPLNLINPVPQKELTPGNPQSPYIPDALANRRAQNEIRNEGDGEQRQYIEPPKPPKLEDVSAEAAKIATRGLLETFGLENSILADPGQSTIGQALQIGANTERYRREQSMSPTENGDGTGVELSDEEKLRIKQDYERERFPRDQKYKEDQAAIRKQYPGDANKGVRDQKLFELKQRRDTEELPYKQQYQSRLKGASTTGGGAAGADPSSTDGGGGGSTVDGTGWGTRVQPQTPTDPYLMVDSAPYNPAFGAAQWGKEIAAALQISGMSQTFKGQMVGQGDIESKGDPRAVGPDSVDGRPEGWMQVKPGTFAGNRDARLPDDPFNVLANAVASLNYVKGRYGDPNGANWPTVAGYKDGGLRPMRGDKASVVPPNTWRVIGDRALNDEFFIPDTDDPQHVALGAEWARRRGMQLVQLHADGGISARARGGQAAMATALAEPRTEEYHTHEGPRNYNYSGPAEDARGFFGAARRHDRIDHQGAGITRVGRKGSNRG
- Physico‐chemical
properties -
protein length: 1833 AA molecular weight: 191281,7 Da isoelectric point: 6,18 hydropathy: -0,33
Representative Protein Details
- Accession
- 7bbzK
- Protein name
- 7bbzK
- Sequence length
- 2375 AA
- Molecular weight
- 249764,99530 Da
- Isoelectric point
- 4,86720
- Sequence
-
MATLDLGKLGFEIGVETSGLDKGLADAKRRVTDFEKQLDSAGKKKLAPVVDATQITKLDTSLGKVSQSASKLSSSRINPTADTAGLDRLQSAVQSAEKSSDELSKKKIRPDVDESGVDQGFKSIVGKAAGAAAAVGAAFSVIDFGRGVLQAGNEFQSQMNTLTAVSGASAQQLALVQAKARELGSATDLTATSASDAAAAMTELAKGGFTVDQAMSAAKGTLQLASAAQLDAAQAATIQSQALQAFGLGAEHAARVSDILAGAANASSAEMTGIAQGLQQAGTVSRQFGLTIEDTATALAMFANAGIQGSDAGTLLKTAMLALTDQGKPAQQAIEQLGLTVYDVNGKFVGMSSLMGQLQVASQNMTEEQYQAATATLFGSDAMRMAGIAAQQGSEGFDKLKEAVTRQGQAAEVAAAQTQGLPGALERWENTIEDLQLGVFDAMQDELVGLANAGVDMVNALQPAIEGVAKAAAGAAGDVLQFVTAVAQAPQSVKDVTVGLGEFAAILAVLNSSPAHTVFEKMSGGAKTAKGALSGVAEAAREASVQYGAQAMVLRDTAREQSVLAKTADTATARSNAFWAAHDARWAAGAATAKAHGAAIGGALQHIGGSAKAGLGSVVAMLGGPWMVGFAAAATAVTAFVSAGKAVDEANQRIEDSSRKAAEAQRELVLAASGTEGILSGDALSAAETMVEGALANITAKGEAANKVFGSVRTAIDEVGVGTFLLQGITSDATAEYQSALDAARGYGDQQKALSDALRDTGLTMDDVNRVVAEGGPEYDRLVESLRGMGPQGEAVADSLGQVRDEIERTAQAGRELPENYVQAARAIDLLADSATSAEDKLAAMDALMVAMGLRPKDAERAMLDFKESLDKTGQEIEMITRSSDFMGQAMFDASGKLQLEGANQATAELQKKIDGLATDLERAALKGADMDDAMANMEPTLQAIQQAWGLTDQQMEGIRDNLREIAGTSTFAVQLQGADEATQELSAILAMLKGAEEDQHEIRMDMPSEHVMAALDEMGVAVNDLGNGKVSIPVTAETREAIDQLQEVAWVADEAASKGVTIDALLNTDQLEGSAAHAQEILDILAIQNPSPEADVIIQKLLNGVDVSHGELAALAAVSSVPTADLEAKLLHAGVDGANRALGTIPKSTNTDLKGNANNVVKEAGRAKDSVNSVPSEKTVTFWARLRGAWDAIRGHFNGGVVGFHSGGLVPGFADGGFIPNIPGISDTERDPILGIDSTGVPVARIEPREYVVNRAATEKNLPLLHDINAGRVSMEDLPGYADGGVVSPAQLLAFAGGKTVNGKTAPRSLEGATYVWGGGLLGNWGDCSGAMSGLAAMAVGMPLQGRKFATGNEGSVLSSMGFSSGLGSGPRFAVGWFNGGPYGGHTAGTIYGADGTRVNVEMGGGRGNGQIGGRAAGADHSSFTNRAYLPLMGPGSGDITDPGSYGGGGGDVVNTSTDSVTMKKAGKTRKVDWGTASQLASDVEARNHKYKQLARYNAGLYDKGGILRDGQIAVNMSGHDEYVIPPALTNAIVTYLPQYAEALPGVTKALNDFTTLGNQVAGAGMNRAALTQVGWNTTLQARADLQNLPADASPFDRWAIYANRTAGEALMGAASMSNSQWIEAGEKLGLDFLGEYAGGIARAQEEIEDSYVAQVDAADALVEAEANLADAQRELNEVMEGAPELSKATARKVEDAERKVEEARKGGDAKKLADAERNLTRVREDAAEELEKNGAKDAQAILDARQAVTAAEADLTTAQGVVQAAAVATGQAQIAMAVEVASVVIKLTKKVAKVVKKVVEAERAARVGAAEAHAQIMANVRDLTEATEQQRRVVGGLMADMVRMKLQVLDSTWKVRQAQNSVWTAHLEGLVGIRKAEAALQAERDKLAGKQHYNFVGLDIEYDRLIGNIQSGLLDIEASEQDMLDMSTAAMVARIRGEQGLSDAYGDAMAQRLRTTKNLTKEEARLVSAVFAGRFAGYNDETASLEQRRAYASAYYSMQKAGLEGLMEQATRTSAELQALEALRSAAQWEREKNVLAMQLDSLEATYSQHEAVKQLARLGRDFNDEVARYNRLQSGAMGMQSDEAIIKAEIARLQAENAQARKDIKDTNFRAFWDFTKNGRILGIQSEASLTNQAAKATLAANDELLAELNNRLEALGKTVEPLSKEDQRIIEMAGMLRAKGENERADQLMRSTSYGKARDVQLLDDIDSRLADIARERKDTYDGLVDAMDERAYQGQRLPLDMQRYYAEGMEASYRSDAEGYREADGATRDAWWNLADWQREYAGAAADMSNRNPDSMMMGLNVQNERGAARNQTLIRMDLTQGALMYSDDVEAAFKRLASEMDGVKIDVRQLQRAGAPTGKTVQLKRAGIR
Other Proteins in cluster: phalp2_12278
| Total (incl. this protein): 13 | Avg length: 1691,0 | Avg pI: 5,40 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 7bbzK | 2375 | 4,86720 |
| 7cxT0 | 2250 | 4,95678 |
| A0A8T8JCT7 | 1359 | 5,09331 |
| A0A1C9EHW6 | 1835 | 5,87968 |
| A0A385D0D3 | 1361 | 5,08768 |
| A0A385UCA8 | 1361 | 5,06210 |
| A0A482MDJ2 | 1360 | 5,09024 |
| A0A4D6T4X7 | 1837 | 5,83648 |
| A0A7L7SQQ3 | 1359 | 5,09331 |
| A0A8F3EAQ5 | 1848 | 6,05224 |
| A0A976YDN7 | 1848 | 5,92532 |
| A0A9E8S2H6 | 1357 | 5,04585 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_18005
3c8yu
|
22 | 23,1% | 2119 | 2.795E-89 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Gordonia phage Dogfish [NCBI] |
2530117 | Nyceiraevirus > |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MK524505
[NCBI]
CDS location
range 11823 -> 17324
strand +
strand +
CDS
ATGGCCGGCGGCATTATCGAGATCGACGTCGTGCTCAACGACCGGGATCTGACCCGTCGTCTCGATCGTGCCGCGCAGGTGGCTCGTCGCGCGGCGGGCGGGATCACCGCCGCGTTCGGCGCGGCTGGTCTGGTCGCCGGTCTCAAGCAGGTCGTCACCGTCGGCCGCGACTTCGACCTCGTGATGAACACCATCCGAGGCACCTCCGGCGCCACCGCGCAGGAGATGGCGGCGATCTCGGACAAGGCACGCCAGCTCGGCAACGACATTGCACTACCCGCGACGTCGGCGGTCGACGCCGCAACCGCGATGCTCGAGCTGTCCAAGGGTGGCCTGTCGGTGAAGGCGTCGATGGACGCCGCCCGCGGTTCACTCGCACTCGCCGCCGCAGCCCAGATCGACGCCGGCAGGGCCGCGGAGATCCAAGCCAACGCCATCAACACCTTCCGGCTGAAGGCGACCGACGCCGAGCATGTCGCCGACGTCCTCGCCAACACCGCGAACAAATCGTCGGCCGGAATGACCGACATCGCCGACGCCCTGCAACAGTCCGGTTCGGTTGCCGCCCAGTTCAACATGACGCTGGATCAGACCGCCGGCGCGCTCGGCGTCATCGCGAACAACGGCATCAAGGGCAGCGACGCCGGCACTCTACTGAAGTCGACGCTGCTCGCGCTCCAGGACACAGGCAAACCCGCCGCCGCGGCGATGGAGACCCTCGGCATCACCACCTACGACGCCCAGGGGAAGTTCGTCGGGCTCGGCAAGATCCTGGAGCAGGTCAACACCGCCCACAAGACGCTCACCGAGGAAGAGTTCAACCACGCGAGCGCGACCCTGTTCGGCTCAGATGCGATGCGTCTCGCGGGCATCGCAGCCAAGGAGAACGCCGACTCATTCCGGAACATGACCGAGGCGGTCGGCCAGGCCGGCGGCGCGTCACGGCTGGCAGCCGAGATGAACAAGGGCCTGCCGGGCGCGCTCGAGCGAGTGAAGAACGCCGCCGAGACCACGCAGCTCGCGATCTACGAACTGCTGCGCGGCCCGCTGACCGATCTCGCGAACTTCGGCGCTGACAAGCTCACTGGCGTCGCCGAGTGGATCGGCGGCCAGGATATGGACGCGGCGCGGTTCCGGGCCATCGCCGACGAGCTCCGTCCGCTGGGGGAACGGTTGCGTGAGATCGGCGTCGACGCTGCCGACTTCGCGAAGGGCACGACCGAGGCCGGGGGTGCGGTCTCGATGCTGGGCGACGGCGCGAAACTCGCGGGCGAGGGTGCGATCGCTGGGCTGCAACCGATCCTCGGTGTCGTCGGTGGCGTGGTGTCAGCGTTCACTGCGCTGCCCACACCTGTCCAGACCGCCGCCTTGGCGATGGGCGCGCTCGCTCTTGCACGAGGGCGGATGAACCGCTCGCTCGAGCGCGCCGGTTCCCTCGAGGCTGGGCAGCGAACTCGGTGGCAGAACATGGCCCTGGCCGGTCGGCAGTTCAACCAGGAGATGCGTACCCAGGCGCAGCTGGCCCAGATGCAAGGCCAGAGCCTCACGCGCGCGCAGCGGGCAATGGCCGCGTACAACACCTCGACTGTCGGCTCGGTCGCAACGATGCGCAACTTCACCACCCAGGTTGCCGCCGCCCGGGCTGGCGCGGTGGCTGCTGGCACTCCGATCTCACGGCTCGCCGCCTCGATGCAGGTGCTCGGCCAGACCCGGACCGACGGCATCGGCCGGATCGCCGCGGCATACACGAACGCCGCGACCGCGGCGGGCTCGTACCGCACCGCGGCTGGCCTCGCCGCCGGCGGTGCGACCGCGCTCCGAGTGGGCGCCGGCGCCTTGTTCAACGCCGTCGGCGGACTGTCCGGTCTGCTGATGGGTGGCGCGACCCTGGCCCTGGCAATGTACTCGCAGCGCCAGCAGAAGGCGGCGCAGGCCGCGGCTGAGCACAAGCAGATCGTCGACGGCCTGGTCGGATCGATCAACGCGCAGACCGGCGCTCTGAACTCGCAGGGCATGGCGGCGCTACAGAACTCGCTGGCGGGCGCCAAGTTCGCCTCGTCGGACAAGCAGGGCACCAACCCGTACGATCTGCTAGCTCGCAATCAGAATCTGAACATCGCTCCGGAGCAATACCTGAATGCCGCCTCTGGCCAGCAGGCTCAGATCGACGCCGTGAACCAGGCCCTCGACGTCCAGGTCCGCAAGTCGATCGACGCGTCCGACGCGTGGAGCACGTACGGGAACATCCTGAGCCGCAACGGCGTATCCGCCGAGGACCTCACCCTCGCGCTCCGCGGAAACCAGGAGCAGATCGACCGTGTGGAGGTCGCGCTCGCGGGCAATGCCGGCATCGAGACGTTCCGTAAGTCACTCGACAGTGCCGGCAGCGATGCGATCGAGCTGGGCAACCAGATGGGCAAAGCGAACGACGCCCTCGACGACGGGCAGAAGCTGGCGATCCAGCAGGCCGAGGCGTTGCGCAAGGTGCACCCGGAGGCGCGGACGCTGTCTGACGCGATGCAGGTGCTCAGCGACAACACCGCGTCTGCCTCGGACAAGGCGAGTGCACTCAAGACAGCGCTCGACGCACTGTCCGGGGGCGCAGAGGGAGTCGACTCCGCGCGAGGCCAGGGATTCGCCGAGGTCGAGAAGCTGCCCGGCCTGTGGCAGGCGGCCGCCGCCGCGGCAGGCGGGTTCAACGGAATCATCGACCAGAGCACCGGACGAATCAGCATCAACAATGATTCCACCCGTGAGCTGTCGGCATCTATTGGCCGCGTCAAGGATTCGATGAACAACGCAGCTGCGGCGGCGTACAAGTTCGCAATCGAGTCCGGACAGAGCCAAGACGAAGCACGGCAGGCCGCGGTGATCGCATCGCAGGGAATCTACGACGGTTTCATGAAGACGGCCGACGGCGCGCGGCAAGCGGGTGTCGACATCGACGGTCTGATGCGCGCGATGAACGTGCTACCTCCGCAGAAGCTGGTGGAATTCATCTCGCTCGGGGCTGATAAGACCAAGCAAGAGATCTACCTGATTAAACAGCAGCTCGATCAGACGCCGAATGCGAAATCGATTACGGTCGCGACCCTTTCGGACGAGGCTAAGAAGAAGCTCGACGAACTGAAGGTCAAGACCGACACTCTCCCGGACGGCCGGGTGGTGGTGAGCGCCTTGACCGATGAGGCGCGTCGGCGGATCGACGAACTACTCAAGCCTGAAGAGAAGATCGTGACCGTGTCGTATCGCGGCTACGACATTCCCTCGTCGATCGTGCGCCCCGACTTGTTGCCACCGCAACCGCGTGCGACGGGTGGCCCGGTCGGTTACGCCGTGGGTGGCGGCATCGGTCCGGACGGTGCGATCCGCGGGCCGGGCACGGGTACCTCGGACAGCATCTCCACGTTCGCCCCGGCGGGTGGTCATGTGTTCACCGCACGCGAGACCGACGCCGCCGGTGGCCACGAAGGCCTCCGAGGAATGCTTGCCGCGATGGCGCGACCCGGCCGGGGCCGCGCGTCCGGCGGCGCTATGCAGGCGGTGAAGCTCTCCAACGGTGAGCATTTCGCCGATCCGGACCAGGTCGCTGCGGTAGGCGGACACGACGCGATCTTCAGTCTGCGCCGTGCGCTGAAGTCTGTGCAGCGGTTCTCGCTCGGCGGGCTGGTGCGCTCGGAGCAGGTAGGTCATGCCAACGATGGTCTGCCCTACATCACCGGCGCGCGCGATTGTTCGATGTGGGTGTCCTGGATGGTGCAGGCCGCCAAGGGCCAGCCGCTCGGCCGACTGTTCACCACCCACACCCTGATCGGCGGCCAGACCGGCGGGCTCGCACCGGGCGCATCACCGGGTGACCTGTTGACGGTGGGCACTTCGAGCGAGCACATGGCCGCAACGGTGATGACCGCCAACGGTCCGGTGAACACCGAGTCGGGAGGCAACTCGTCGCCGTCGCAGGTCCGGTGGGGCCGTGGCGCTGCGGGCGCATTCGATCCGCAGTTCACTCACCGATTCCATCTGCCGCTCAACCTGATCAACCCCGTCCCGCAGAAAGAGCTGACGCCGGGTAACCCACAGTCGCCCTACATCCCAGACGCGCTCGCCAACCGGCGCGCCCAGAACGAGATCCGCAACGAGGGCGACGGCGAACAGCGTCAGTACATCGAGCCCCCGAAGCCGCCGAAGCTCGAAGACGTCAGCGCCGAGGCGGCGAAGATCGCCACGCGCGGGCTGCTCGAGACGTTCGGGCTCGAGAACTCGATCCTCGCCGACCCCGGCCAGTCGACGATCGGCCAGGCATTGCAGATCGGCGCGAACACCGAACGCTACCGACGCGAACAATCGATGTCCCCGACCGAGAACGGCGACGGCACTGGGGTGGAGCTCTCTGATGAGGAGAAGCTGCGGATCAAGCAGGACTACGAGCGCGAGCGGTTCCCGCGGGACCAGAAGTACAAAGAGGACCAGGCCGCGATCCGCAAGCAGTACCCAGGTGACGCGAACAAGGGTGTGCGGGATCAGAAGCTGTTCGAGCTGAAGCAGCGCCGTGACACCGAGGAGCTGCCGTACAAGCAGCAATACCAGTCGCGGCTGAAGGGGGCTTCGACGACGGGCGGCGGTGCGGCCGGCGCCGATCCGAGCAGCACTGACGGCGGCGGCGGTGGGTCGACGGTCGACGGCACCGGGTGGGGCACGCGTGTGCAGCCGCAGACTCCCACCGACCCGTACCTGATGGTCGACTCGGCGCCCTACAATCCGGCGTTCGGTGCGGCGCAGTGGGGCAAGGAGATCGCCGCCGCGCTCCAGATCTCCGGCATGTCGCAGACGTTCAAGGGTCAGATGGTCGGCCAGGGTGACATCGAGTCGAAGGGTGACCCGCGCGCGGTCGGCCCGGACTCGGTCGACGGTCGGCCCGAGGGCTGGATGCAGGTGAAGCCCGGCACGTTTGCCGGCAACCGTGACGCCCGGCTGCCCGACGATCCGTTCAACGTCCTCGCGAACGCCGTTGCGTCACTGAACTACGTGAAGGGCCGGTACGGAGATCCGAACGGCGCGAACTGGCCGACGGTCGCGGGGTACAAGGACGGCGGGCTGCGGCCGATGCGCGGCGACAAGGCGTCGGTGGTCCCGCCGAACACCTGGCGCGTGATTGGCGACCGGGCGCTCAACGACGAGTTCTTTATCCCGGACACCGACGACCCGCAGCACGTGGCGCTCGGTGCGGAGTGGGCGCGGCGCCGCGGGATGCAGCTGGTGCAGCTCCACGCGGACGGCGGTATCTCGGCCCGGGCGCGCGGCGGTCAGGCGGCGATGGCCACCGCGCTCGCCGAGCCTCGCACCGAGGAGTACCACACGCACGAGGGGCCGCGGAACTACAACTACTCCGGTCCGGCCGAGGACGCGCGCGGGTTCTTCGGCGCCGCCCGCCGGCACGACCGCATCGACCATCAGGGCGCCGGGATTACGCGCGTCGGCCGGAAGGGATCGAACCGCGGATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(7bbzK)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50