Protein

Protein accession
A0A0K1Y5T7 [UniProt]
Representative
6DolG
Source
UniProt (cluster: phalp2_39606)
Protein name
Tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MPVEVGVGYVSIVPEMRGFGRLLDQQLSGQTARAGQAAGRSSGQGFLGGMGGVLKAGVVGLAAGAGALFAAGFSKAVEQDKSNAKLAAQLGLNEQQSARLGKVAGSVYAKGYGESVDQVNDALRALAQNGVAAVNAPKKDLAGLSKAALNLSETFGVDVSDSARAAGQMIRTGMAKDAKGAFDLLTRGYQSGADKAGDLADTVNEYGTQFRKLGLDGSQALGLVSQAIKAGARDSDVAADALKEFSIRAVDGSTTTADGFKMLGLSASDMAAKIGKGGTSASSALDLTLDKLRGIEDPVKRGQAAVALFGTQAEDLGDALFSMDPSKATAALGQVGGAADKMGKTLHNTASQNIEVFKRQALQGLANFADKYALPALSAFGGFLNDYVLPPAKVVGGELVDVLVPAVKATGSAFAGGAQWIKDYGAWLLPVGIAIGGIAVVAGASTIATWGMTAAFTVYRGIILATTAVTRGWAVAQGILNAVMSANPVGLIIVGILALGAALVVAYQKSETFRAIVQGAWQAIQTAATVAWTGFLKPALDGIWAGLQAVGSAASWLWSTVLSPVFGFIGTAAKVLFAVVVVAVVTPIVLAFKALGAVAGWLWNNAIGPAFRGIVALASWWWAGVKVYFGLAKAGVQAVGAVAMWLYRNAIQPAFRGIVSVASWWWAGVKVYFNLIKAGIRAVGSVATWLYRNAVQPAFRGIGAAGSWLWNKALKPVFDAGKRGVSLFGGAFRTARDAISKAWSQVSKIAAKPVNFIIEFVYTKGIKAVWDKVAGFVGLGKLPKAPKLLADGGRTRGGTPGRDSIPALMMADEFVVKRSSARKIGFSALNYMNETGELPVQRFAGGGVVDTLKGWGSSAVDWTVDKAKKVGGVVMDGVDFLSNPGRLWDKATGFIRKKIAAIGQSKWAQVAGKIPLKMLTGLKDKVVNAAKSAFDFGGGGSIGGSGVKRWSSVVLAALKMVGQPASLLPTVLRRMNQESGGNPRAINNWDINARNGVASRGLMQVIPPTFAAYAGRLRGRGIWDPLANIYASMRYAMSRYGSLSRAYNRPGGYASGGRPRPGEVAWVGERGPELLQFGGGSRIFDSRSSLGGMQALTLSTARLADEIGAARVGGIRATLSQLDAAALRSTAAAVAPVAAGQQTAPAGLTEGQQLALVLADGTQLDAYVDTRVDAGLTTARQRSRAGVKGR
Physico‐chemical
properties
protein length:1190 AA
molecular weight:123055,2 Da
isoelectric point:10,09
hydropathy:0,20
Representative Protein Details
Accession
6DolG
Protein name
6DolG
Sequence length
650 AA
Molecular weight
66632,99540 Da
Isoelectric point
9,56351
Sequence
MPVEVGVGYVSVVPETRGFGRLLNQQISGESARVGTSAGEDAGDGFLGGMGGKLKAGIVGVAAGAGGLFAVGFAEAVEQDKATAKLGASLGLTEKETARAGKIAGKVYASGYGESIDQVDESLKSLQRNGVAAISAPRKELVGLSQDALNLADVFDADVADSTKAVGKLLSTGLVKNAKAGFDLLTAGFQSGADQAGDLIDTVNEYSVQWKKAGLSGATAIGLINQGLKNGARDGDLVADSIKEFSIRAVDGSTTTAAGFKMLGLSADDMAGKFAKGGKSANGVLQLTLDRLRGIKDPVKQAQAATSLFGTQAEDLGKALFALDPSKAAAGLGKVGGAAGRMGKQLSNTASHDVEVFKRQALQGLANFASKYALPALRDVGKFLVKYVLPPTRTVGGALVSYLVPAAQAVGTAFEASGKWLQKYGAWLIPVGVAVVGLTTVMAAQAITTGITAGVFSVYRGAILLWSNATRIATGVQAAFNAVASANPVGLIIVGVLALVALLVVAYKKSDTFRSIVQATWSGIKAGWDVLWNGALKPGFGYLMTGLRAIGSAASWLWSSVLSPVFSAIALGAKILFAVVAVAVIGPWVLAFKVLGAVGGWLWTNALQPAFNGIAAGAMWLWNNAIGPAVRGIVTLFSWWWTGAKLYFGL
Other Proteins in cluster: phalp2_39606
Total (incl. this protein): 3 Avg length: 1093,3 Avg pI: 9,85

Protein ID Length (AA) pI
6DolG 650 9,56351
Q6VY42 1440 9,88540
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28298
11z7g
2 41,3% 460 2.479E-105
2 phalp2_38730
11Bbs
19 21,6% 596 3.425E-10
3 phalp2_40583
4TpSg
1 24,0% 499 7.869E-10

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Streptomyces phage SF1
[NCBI]
1690817 Sfunavirus > Sfunavirus SF1
Host Streptomyces flavovirens
[NCBI]
52258 Actinobacteria > Actinobacteria > Streptomycetales > Streptomycetaceae > Streptomyces > Streptomyces griseus group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KT221033 [NCBI]
CDS location
range 27772 -> 31344
strand +
CDS
GTGCCGGTCGAGGTCGGCGTCGGGTACGTGTCCATCGTTCCCGAGATGCGCGGGTTCGGCCGACTGCTCGATCAGCAGCTCTCAGGCCAGACGGCCCGCGCCGGGCAGGCCGCGGGGCGGTCGTCGGGGCAGGGCTTCCTCGGCGGCATGGGCGGGGTCCTCAAGGCGGGGGTCGTGGGACTCGCTGCGGGCGCCGGGGCGCTGTTCGCCGCCGGGTTCTCGAAGGCCGTCGAACAGGACAAGAGCAACGCCAAGTTGGCGGCGCAGCTCGGTCTCAACGAGCAGCAGTCGGCGCGGCTCGGCAAGGTCGCGGGCTCGGTCTACGCGAAGGGCTACGGCGAGAGCGTCGATCAGGTCAACGACGCGTTGAGGGCGCTCGCGCAGAACGGCGTCGCGGCGGTCAACGCCCCGAAGAAGGATCTCGCCGGGCTGTCGAAGGCCGCGCTCAACCTGTCCGAGACGTTCGGCGTCGACGTCTCCGACTCGGCGCGCGCTGCGGGTCAGATGATCCGTACGGGCATGGCGAAGGACGCGAAGGGCGCGTTCGATCTGCTGACCCGCGGCTACCAGTCGGGCGCCGACAAGGCCGGGGACCTCGCCGACACGGTCAATGAGTACGGGACGCAGTTCCGGAAGCTCGGGCTCGACGGGTCACAGGCCCTCGGCCTCGTCTCGCAGGCGATCAAGGCCGGTGCACGCGACAGCGACGTCGCCGCCGACGCGCTGAAAGAGTTCTCGATCAGGGCGGTCGACGGCAGCACGACGACCGCCGACGGGTTCAAGATGCTCGGGCTGTCGGCGTCCGACATGGCCGCGAAGATCGGCAAGGGCGGCACCTCGGCGTCGTCCGCGCTCGACCTCACCCTCGACAAGCTGCGCGGGATCGAGGACCCCGTGAAGCGCGGGCAGGCCGCGGTCGCACTGTTCGGAACGCAGGCCGAAGACCTCGGCGACGCGTTGTTCTCGATGGACCCGAGCAAGGCGACGGCCGCGCTCGGCCAAGTCGGCGGCGCCGCCGACAAGATGGGCAAGACCCTGCACAACACGGCGTCGCAGAACATCGAGGTGTTCAAGCGGCAGGCGTTGCAGGGGCTCGCCAACTTCGCGGACAAGTACGCGCTACCGGCGCTGTCGGCGTTCGGCGGGTTCCTCAACGACTACGTGCTGCCGCCCGCGAAGGTCGTCGGCGGCGAGCTCGTCGACGTCCTCGTGCCCGCGGTCAAGGCCACCGGTTCGGCGTTCGCCGGCGGGGCCCAGTGGATCAAGGACTACGGGGCGTGGCTGCTGCCGGTCGGCATCGCGATCGGCGGCATCGCCGTCGTGGCCGGTGCGTCCACGATCGCCACATGGGGCATGACCGCCGCGTTCACGGTGTACCGCGGCATCATCCTCGCGACGACCGCGGTCACGCGCGGTTGGGCGGTCGCGCAGGGCATCTTGAACGCGGTCATGTCGGCGAACCCCGTCGGCTTGATCATCGTCGGGATTCTCGCGCTCGGCGCCGCCCTGGTCGTCGCCTACCAGAAGAGCGAGACGTTCCGGGCGATCGTTCAAGGCGCGTGGCAGGCCATCCAAACGGCCGCCACGGTTGCGTGGACCGGCTTCCTTAAGCCCGCGCTCGACGGCATTTGGGCCGGGCTACAGGCCGTCGGCTCGGCCGCCTCGTGGCTCTGGTCGACGGTGCTGTCGCCCGTGTTCGGGTTCATCGGCACGGCCGCGAAGGTGCTCTTCGCGGTGGTCGTCGTCGCCGTCGTGACGCCGATCGTCCTCGCGTTCAAGGCACTCGGCGCGGTCGCGGGGTGGCTGTGGAACAACGCGATCGGCCCAGCGTTCCGCGGGATCGTCGCGCTCGCCTCGTGGTGGTGGGCGGGCGTGAAGGTGTACTTCGGGCTCGCGAAGGCCGGCGTGCAGGCGGTCGGCGCGGTCGCGATGTGGCTGTACCGCAACGCGATCCAACCGGCGTTCAGGGGAATCGTTTCTGTCGCCTCGTGGTGGTGGGCCGGGGTCAAGGTCTATTTCAACTTGATCAAGGCCGGGATTCGCGCGGTCGGCTCGGTCGCAACGTGGCTGTACCGTAACGCCGTTCAACCGGCGTTCCGCGGGATCGGCGCGGCCGGTTCGTGGCTGTGGAACAAGGCGCTCAAGCCGGTGTTCGACGCCGGGAAGCGCGGCGTCTCACTCTTCGGCGGGGCGTTCCGCACCGCGCGAGACGCGATCAGCAAGGCATGGTCGCAGGTCTCGAAGATCGCGGCGAAGCCGGTGAACTTCATCATCGAATTCGTCTACACCAAGGGCATCAAGGCCGTTTGGGACAAGGTCGCCGGGTTCGTCGGCCTCGGCAAGCTGCCGAAGGCGCCGAAGCTGCTCGCCGACGGCGGCCGCACCCGCGGCGGCACCCCGGGCAGGGACTCGATCCCCGCGCTCATGATGGCCGACGAGTTCGTCGTGAAGCGCAGCAGCGCCCGGAAGATCGGGTTCAGTGCGCTCAACTACATGAACGAGACAGGCGAGTTGCCGGTGCAGCGCTTCGCGGGCGGCGGCGTTGTCGACACGCTCAAGGGGTGGGGCTCGTCGGCGGTCGACTGGACCGTCGACAAGGCGAAGAAGGTCGGCGGCGTCGTAATGGACGGCGTCGACTTCCTCTCGAACCCCGGGAGGCTGTGGGACAAGGCGACCGGGTTCATCCGGAAGAAGATCGCGGCGATCGGGCAATCGAAGTGGGCGCAGGTCGCCGGGAAGATCCCGCTCAAGATGCTCACGGGGTTGAAGGACAAGGTCGTCAACGCGGCCAAGTCGGCGTTCGACTTCGGCGGCGGGGGCAGCATCGGCGGCTCGGGCGTCAAGCGCTGGTCGTCCGTCGTGCTCGCGGCGCTCAAGATGGTTGGGCAGCCCGCGAGTCTGCTGCCGACCGTGCTGCGCCGCATGAATCAGGAGAGCGGCGGCAACCCCCGCGCGATCAACAATTGGGACATCAACGCCCGCAACGGGGTCGCGAGTAGGGGCCTCATGCAGGTGATCCCGCCGACCTTCGCCGCGTACGCGGGCAGGCTGCGCGGCCGGGGCATCTGGGACCCGTTGGCCAACATCTACGCGAGCATGCGCTACGCCATGTCGAGGTACGGCTCGCTGTCGCGGGCGTACAACCGGCCGGGCGGCTACGCCTCGGGCGGGCGGCCGCGGCCGGGCGAGGTCGCGTGGGTCGGCGAACGGGGCCCCGAGCTGCTCCAGTTCGGCGGCGGCTCGCGCATCTTCGACAGCCGCTCGTCGCTCGGCGGCATGCAGGCGCTCACCTTGTCAACGGCCCGGCTCGCCGACGAGATCGGCGCCGCCCGGGTCGGCGGTATCCGGGCGACGCTCTCGCAGCTCGACGCCGCGGCGCTGCGCAGCACGGCCGCGGCAGTCGCCCCGGTCGCCGCCGGTCAGCAGACGGCCCCGGCCGGTCTGACCGAGGGGCAGCAGCTCGCGCTCGTACTCGCCGACGGCACACAGCTCGACGCCTACGTCGACACCCGCGTCGACGCGGGTCTGACCACCGCCCGGCAGCGCAGCCGCGCGGGCGTGAAGGGGAGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001b538b3_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6DolG) rather than this protein.
PDB ID
6DolG
Method AlphaFoldv2
Resolution 65.16
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50