Protein

Protein accession
Q6VY42 [UniProt]
Representative
6DolG
Source
UniProt (cluster: phalp2_39606)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MATAAVEVGVGYVSVVPSARGFAEDLQRQIGRPTVQVGAEIGEQSGQAASRGMLTTLGAGLKTGLAAVAVAGAAVFSAAFVQAIEQDKSNARLAAQLGLDPKQAKRLGAVAGEVYAKGYGESIDQVNDSLRTLAQNGVIAVNAPRKDIASLTKSATNLAEAFGVDVGDAARAAGQLIRTGMVKDAKQAFDLITRGFQSGADKGGDFIDTLNEYSTQFRKAGLDGQAAVGLITQALQAGARDGDIAADAIKEFSIRAVDGSESSAEGFKALGLNAVTMGQQFARGGTAANAVLDLTLDRLRAVKDPVKQSQLAVALFGTQAEDLGAALFAMDPSSAVSALGKVGGAADRMGDALHNTATNDFEVFKRQAMQSIVAVIDREVLPVLAKVGALLLEKVPPALSAVSDAFSAGVNWVREYGAWLLPLGVAVAGLTITLNASAIATGAVTAVFAVYRGVILAAAAVTRGYAIVQGLLNAVMTANPIGLIITGIAALVTLLVVAYQKSDTFRGIVQAAWAGIKAGWDVLWTTTLKPGFDGLMVGLRAVGDAAVWLWQTILSPVFSAIWTAAKVLFAIVVVAVVVPIILAFKVLAAIGAWLWKSALKPAFDGIAAGAIWLWKSALKPAFDAIVLTLKAVGAGATWLWTAVLAPSFRAIGAAGAWMWNSVLKPAFGALMDGMRAVGTALQYVWRTILSPVFTAIGAAGKWLWDNSLKPVFDKIKAGAKLMGAAFGLARDAISKAWDSVVKVSAKPVNFIIRHVYTEGIKAVWDKVAGFVGLGKLPDAPKLLARGGRTSGGIPGQDSIPALLMADEYVIKRSSARSVGFGALEHINRTGELPVQRFADGGIVGWLGDAAKKVGGVVMSGVDFLSDPGRMWETATKSVRDMIAKIGQSGIAKMLAQVPGKMLGGLKDKVLDAAKSLFGGSSGAADIGGSGVQRWSPVVLQALQMVGQSASLLPVVLRRMNQESGGNPAAINNWDINAKNGVPSKGLMQVIDPTFAAYAGALRGRGVWDPLANIYASMRYALSRYGSLASAYNRPGGYANGGRPRPGELAWVGERGPELVRFGGGDTEVFDHERSMQMAAGLVPLRGFAKGTKPRSARVRTDALLPTARIVDVQLPKPSASDLAAFTKSLTGSASAIGTAAAQLTKRLMLAGGAGRTLAAQVSKVSAELQGLATKRDRVSGIIATAREAAAGQRQTAADFLGLSNLSSTGSVEDLIMGMETRQDTLRGFQSTIRSLEKRGLNQDAIRQLVAMGPDSTLAKMITEGSGSDIRRINELTKSGGTLATAFGNSMADAMYDSGKDAGKGFLTGLLSQQRDLQTAMTRLGASLIQNIKVGMGLAKPSPTKAASRNPVLKPTKALTPKPTLLSSAPKLAATAAVQASMPARRAVIPSPAPRPEAGAGGLQAGDRLALRVGDRELNAYVETVVVDTLVPVAHAIAGRK
Physico‐chemical
properties
protein length:1440 AA
molecular weight:149381,7 Da
isoelectric point:9,89
hydropathy:0,20
Representative Protein Details
Accession
6DolG
Protein name
6DolG
Sequence length
650 AA
Molecular weight
66632,99540 Da
Isoelectric point
9,56351
Sequence
MPVEVGVGYVSVVPETRGFGRLLNQQISGESARVGTSAGEDAGDGFLGGMGGKLKAGIVGVAAGAGGLFAVGFAEAVEQDKATAKLGASLGLTEKETARAGKIAGKVYASGYGESIDQVDESLKSLQRNGVAAISAPRKELVGLSQDALNLADVFDADVADSTKAVGKLLSTGLVKNAKAGFDLLTAGFQSGADQAGDLIDTVNEYSVQWKKAGLSGATAIGLINQGLKNGARDGDLVADSIKEFSIRAVDGSTTTAAGFKMLGLSADDMAGKFAKGGKSANGVLQLTLDRLRGIKDPVKQAQAATSLFGTQAEDLGKALFALDPSKAAAGLGKVGGAAGRMGKQLSNTASHDVEVFKRQALQGLANFASKYALPALRDVGKFLVKYVLPPTRTVGGALVSYLVPAAQAVGTAFEASGKWLQKYGAWLIPVGVAVVGLTTVMAAQAITTGITAGVFSVYRGAILLWSNATRIATGVQAAFNAVASANPVGLIIVGVLALVALLVVAYKKSDTFRSIVQATWSGIKAGWDVLWNGALKPGFGYLMTGLRAIGSAASWLWSSVLSPVFSAIALGAKILFAVVAVAVIGPWVLAFKVLGAVGGWLWTNALQPAFNGIAAGAMWLWNNAIGPAVRGIVTLFSWWWTGAKLYFGL
Other Proteins in cluster: phalp2_39606
Total (incl. this protein): 3 Avg length: 1093,3 Avg pI: 9,85

Protein ID Length (AA) pI
6DolG 650 9,56351
A0A0K1Y5T7 1190 10,09293
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28298
11z7g
2 41,3% 460 2.479E-105
2 phalp2_38730
11Bbs
19 21,6% 596 3.425E-10
3 phalp2_40583
4TpSg
1 24,0% 499 7.869E-10

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage VWB
[NCBI]
10702 Veewebvirus > Veewebvirus vwb
Host Streptomyces venezuelae
[NCBI]
54571 Actinobacteria > Actinobacteria > Streptomycetales > Streptomycetaceae > Streptomyces >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY320035 [NCBI]
CDS location
range 33423 -> 37745
strand +
CDS
ATGGCTACAGCCGCGGTCGAGGTCGGCGTCGGGTACGTATCGGTCGTCCCGAGCGCCCGGGGATTCGCCGAGGACCTGCAGCGGCAGATCGGCCGGCCGACGGTGCAGGTCGGCGCGGAGATCGGCGAGCAGTCCGGGCAGGCGGCGAGCCGCGGCATGCTGACGACGCTCGGCGCAGGCCTGAAAACCGGACTCGCGGCTGTCGCGGTCGCCGGTGCCGCCGTGTTCTCCGCCGCGTTCGTCCAGGCGATCGAGCAGGACAAGAGCAACGCCCGACTTGCGGCGCAGCTCGGGCTCGACCCGAAGCAGGCGAAGCGGCTCGGGGCCGTCGCGGGCGAGGTGTACGCCAAGGGCTACGGCGAGTCGATCGACCAGGTCAACGACTCCCTGCGGACGCTCGCGCAGAACGGCGTGATCGCGGTCAACGCGCCGCGTAAGGACATCGCGTCACTGACCAAGTCGGCGACGAACCTCGCCGAGGCGTTCGGCGTCGACGTGGGGGACGCCGCTCGCGCGGCCGGGCAGCTGATCCGTACGGGCATGGTCAAGGACGCGAAGCAGGCGTTCGACCTGATCACCCGCGGATTCCAGTCGGGCGCCGACAAGGGCGGCGATTTCATCGACACCCTGAACGAGTACTCGACTCAGTTCCGGAAGGCGGGGCTCGACGGACAGGCGGCGGTAGGCCTGATCACGCAGGCGCTGCAGGCCGGCGCCCGCGACGGCGACATCGCCGCCGACGCGATCAAGGAATTCTCGATCCGGGCAGTCGACGGATCCGAGTCCTCGGCCGAGGGATTCAAGGCGCTCGGGCTGAACGCCGTCACCATGGGGCAGCAGTTCGCCCGGGGCGGCACGGCCGCGAACGCGGTCCTCGACCTGACCCTTGACCGGCTGCGCGCGGTGAAGGACCCCGTAAAGCAGTCGCAGCTCGCCGTCGCCCTGTTCGGCACGCAGGCCGAAGACCTCGGCGCCGCACTGTTCGCGATGGACCCGTCGTCGGCGGTCTCCGCGCTGGGCAAGGTGGGCGGCGCCGCGGACCGGATGGGCGACGCGCTGCACAACACGGCGACGAACGACTTCGAGGTGTTCAAGCGGCAGGCGATGCAGAGCATCGTCGCCGTGATCGACCGCGAGGTGCTCCCGGTCCTCGCGAAGGTCGGGGCGCTGCTGCTGGAAAAGGTGCCGCCCGCACTGTCCGCCGTGTCGGACGCGTTCAGCGCCGGCGTCAACTGGGTGCGCGAGTACGGGGCGTGGCTGCTGCCGCTCGGCGTCGCGGTCGCCGGTCTGACGATCACGCTCAACGCATCGGCGATCGCAACCGGCGCAGTGACCGCCGTGTTCGCCGTATACCGCGGGGTGATCCTCGCGGCGGCGGCGGTGACGCGCGGCTACGCGATCGTGCAGGGGCTGCTCAACGCCGTGATGACGGCGAACCCCATCGGCCTGATCATCACCGGGATCGCGGCGCTCGTGACCCTGCTCGTCGTCGCCTACCAGAAGAGCGACACGTTCCGGGGGATCGTGCAGGCAGCGTGGGCCGGGATCAAGGCCGGTTGGGATGTCCTGTGGACGACGACGCTCAAGCCGGGGTTCGACGGGCTCATGGTCGGTCTGCGGGCGGTCGGCGACGCTGCCGTGTGGCTGTGGCAGACGATCCTGTCGCCGGTGTTCTCGGCGATCTGGACGGCCGCGAAGGTCTTGTTCGCGATCGTCGTGGTCGCGGTCGTCGTCCCGATCATCCTCGCTTTCAAGGTGCTGGCGGCGATCGGGGCATGGCTGTGGAAGTCGGCGCTCAAGCCCGCGTTCGACGGGATCGCAGCGGGCGCTATCTGGCTGTGGAAGTCGGCGCTCAAGCCGGCGTTCGACGCGATCGTGCTCACCCTGAAGGCTGTGGGAGCAGGCGCAACGTGGCTGTGGACGGCCGTCCTCGCGCCGAGCTTCCGGGCGATCGGCGCGGCCGGCGCGTGGATGTGGAACAGCGTCCTCAAGCCCGCGTTCGGCGCGCTCATGGACGGGATGCGGGCGGTCGGCACCGCCCTGCAGTACGTGTGGCGCACGATCCTGTCGCCCGTGTTCACAGCCATCGGAGCGGCCGGTAAGTGGCTGTGGGACAACTCCCTCAAGCCCGTGTTCGACAAGATCAAGGCCGGCGCGAAGCTCATGGGGGCCGCCTTCGGGCTCGCCCGGGACGCGATCTCGAAGGCGTGGGACAGCGTCGTCAAGGTGTCGGCGAAGCCCGTGAATTTCATCATCCGGCACGTGTACACCGAGGGCATCAAGGCCGTTTGGGACAAGGTCGCCGGGTTCGTCGGTCTGGGCAAGCTGCCGGACGCGCCGAAACTGCTCGCCCGCGGCGGCCGAACGAGCGGCGGCATCCCGGGGCAGGACTCGATTCCCGCGCTGCTCATGGCCGATGAGTACGTCATCAAGCGGTCCTCGGCCCGCTCGGTGGGGTTCGGGGCACTGGAGCACATCAACCGCACCGGCGAGCTGCCGGTGCAGCGGTTCGCCGACGGCGGCATCGTCGGGTGGCTCGGGGACGCGGCGAAGAAGGTCGGCGGCGTCGTCATGAGCGGGGTTGACTTCCTGTCGGACCCCGGCCGCATGTGGGAGACGGCGACCAAGTCGGTCCGGGACATGATCGCCAAGATCGGGCAGTCCGGGATCGCGAAGATGCTCGCGCAGGTCCCGGGGAAGATGCTCGGCGGGCTGAAAGACAAGGTCCTCGACGCGGCCAAGTCGCTCTTCGGCGGGTCCTCAGGGGCCGCCGACATCGGCGGGTCCGGCGTGCAGCGTTGGTCGCCGGTGGTCCTGCAGGCCCTGCAAATGGTGGGCCAGTCCGCGAGCCTGCTGCCGGTTGTTCTGCGCCGGATGAACCAGGAGAGCGGCGGCAACCCGGCCGCGATCAACAACTGGGACATCAACGCGAAGAACGGCGTTCCGTCCAAGGGTCTGATGCAGGTGATCGACCCGACTTTCGCCGCGTACGCGGGCGCGCTCCGCGGCCGCGGCGTGTGGGACCCCCTCGCGAACATCTACGCCTCGATGCGGTACGCGCTGTCCCGCTACGGCTCGCTCGCCTCGGCCTACAACCGACCCGGCGGGTACGCCAACGGCGGCCGACCGCGCCCCGGCGAACTGGCGTGGGTGGGCGAACGCGGCCCCGAGCTCGTGCGCTTCGGGGGCGGCGACACCGAGGTGTTCGACCACGAACGATCAATGCAGATGGCGGCCGGGCTCGTGCCGCTGCGCGGGTTCGCCAAGGGCACCAAACCTCGATCCGCCCGCGTGCGCACGGACGCGCTGCTGCCGACTGCGCGCATCGTGGACGTGCAGCTGCCGAAGCCGTCCGCGAGCGATCTGGCGGCGTTCACGAAGTCGTTGACCGGGTCGGCGTCGGCGATCGGCACAGCCGCAGCGCAGCTGACGAAGCGCCTCATGCTGGCCGGCGGTGCAGGCCGGACCCTGGCGGCACAGGTCAGCAAGGTTTCAGCAGAGCTGCAGGGGCTCGCGACGAAGCGGGACCGGGTGAGCGGGATCATCGCCACGGCTCGGGAGGCGGCGGCCGGACAGCGGCAGACGGCCGCTGATTTCCTCGGCTTGTCGAACCTGTCGAGTACGGGCTCGGTCGAGGATCTGATCATGGGAATGGAGACGCGGCAGGACACCTTGCGAGGGTTCCAGTCCACGATCCGATCCCTCGAAAAGCGCGGGCTGAACCAGGACGCGATCCGGCAGCTCGTCGCCATGGGCCCCGACAGCACCCTTGCGAAGATGATCACGGAAGGGTCGGGGTCGGACATCCGGCGCATCAACGAGTTGACCAAGAGCGGCGGCACGCTTGCGACTGCCTTCGGGAACTCGATGGCTGACGCGATGTACGACAGCGGAAAGGATGCGGGGAAGGGGTTCCTTACCGGGCTGCTCTCACAGCAGCGGGACCTGCAGACGGCGATGACCCGTCTCGGAGCGAGCCTGATTCAGAACATCAAGGTCGGCATGGGCCTCGCCAAACCGAGCCCGACGAAGGCGGCGAGCCGAAACCCTGTCCTGAAACCCACGAAGGCTCTGACCCCTAAGCCGACCCTGCTGAGCAGCGCCCCGAAGTTGGCCGCTACCGCCGCCGTGCAGGCGTCTATGCCTGCTCGGCGGGCTGTGATCCCCTCGCCGGCACCTCGGCCCGAGGCGGGCGCAGGCGGGCTGCAGGCGGGCGACCGGCTCGCGTTGCGAGTCGGCGACCGAGAGCTGAACGCGTACGTCGAGACCGTCGTCGTCGACACGCTCGTGCCGGTGGCGCACGCGATTGCTGGCAGAAAGTGA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000023037d_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6DolG) rather than this protein.
PDB ID
6DolG
Method AlphaFoldv2
Resolution 65.16
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50