Protein

Protein accession
A0A8S5M673 [UniProt]
Representative
5tUfk
Source
UniProt (cluster: phalp2_30529)
Protein name
Tape measure protein N-terminal domain-containing protein
Lysin probability
93%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MSAFRNAITATTNAWKAQEIALKNSGDYTQAAKVKLDGLTQAMELQRAKIEELRTRQSGLDQSNKEQADQWLKLEKQISQANRQLASYESQAERAKNYYAYQSSGLADLQHSYKLTARSNEIFVEQLKAEGKNVKAQRAELDGMKTSLTSLKEQYNAQKQVVAQVAENSGKASNAYKEQKNKLDELRLEITQTDGKIKVLSKDLDKNHPAFLSGVREKLEKSNEAAEKSHSLFSKIFSANVAANLFTSALGKVHESFSLLTESVKEYDDKQQTMVATWDTLTGSAGKGQNMVNIGNKLAAAYGQNIEVVDELNQQFYHVFDNAPRTEKLTKSVLTLGDTLNMTDENVKRLGLNFTHMLSSGRMQLGDFNMITDQLPMYGEKLLEYERKVQNNSKLTMETLRDQMSAGKISAKDAETVMNELGDKYQKASENLMKTGPGMVRAVKTQAPALLEAFYKPIREMKNPLLGQVSKWVMADETKKEFSKLGDTLTSQVTSVTKALSKNSNFDFGKMMNSGLEKLNSLIIKLGQNAVKHKGDLKEMFNTFKSFSGTSFKVFVQTLKDLEPILKIIGGFAAKHPKAFAQTAAALLLINKATTILLPSVKLLQSTFKAVKFPITAIQKTRDGLKELPDKYNRVSDSIDAYGNKLKKVPTKVKTKVSAPVSAAKEKISGYNRKLKQVPKTVKTKAVASTTSAKGKISSFNATVKRVPRKIKVKASAETASAKISIRNLGMTAKAAAVTSKSAFTAIKLSAVTSIKAIGLAARANPLGALITGIELATAAFSFLYQHSKKFRSFCNGVAKNAKKAWSSIKKGAAEGWQDLKKKASDGADNVKRGWDRLSSDTARSAQQMFNKHKSTFQAGYKVIEDRTTTWHDLVSGRWDRLGEDTERTAQDMFKFNRKIFSDMYNKLNDMTGGRLGDMLKIWQDIFGKIQDAVGNAVGSVHRHFVDLVNGVLKPFKTMIDDVKGGINWVLDKVGGSKIGGDFSISMPSYANGTNDTHPGGFAKVNDGLTAHYREMFMTKDGQVGMFPAKRNLILPLPKGTSVLDGERSYQLSRMFGMIPHYADGVGNAFSSLLSKVGDATDDVLGMVDKIMSKPVEFMESVFQKFVHVSTPVKFAAELVKDVPVYIAKQMGNWIKKQFETLANPGGAGVERWRPYIIKAFKTLGVEATATKVSKLLKQIQTESGGNPTVPQKVWDINMANGNPAQGLLQFIPSTFNHWAIPGHKQILNGYDQILAAINALEHGGEGGWGNVGQGHGWANGGLISNHGVYEIAEKNMPEYVIPTDISRRSRAYQLLGEIVTRFRNDDPTLSHNAQYVGGNDRQSDALSHKLDELLSKFDILLRLSGDQVDAIKAQGSLDMQQLYKKEAKDARMRQLGF
Physico‐chemical
properties
protein length:1378 AA
molecular weight:152080,3 Da
isoelectric point:9,63
hydropathy:-0,44
Representative Protein Details
Accession
5tUfk
Protein name
5tUfk
Sequence length
865 AA
Molecular weight
95069,37010 Da
Isoelectric point
9,60561
Sequence
LIKPSKGATEALKSIGLSTKDFTDKNGNMKSMSDIFKELNEHTKNLSKQEKGALFKAIFGATGESAAIILSDSASEMEKLNKQVEKSYKGQGYVQRLANKNMGSVKMETAQLKESGEAASLMIGKALLPALRDASTAMAKAFNSKDGQKGLKVIAKGVGDFAKVVVDLVIALGKHTTTIKVFGATLGTAFAIFKTMKLVNTIKMTVTTFKELTLATKAFKIAMAGGGIALIITGVVIALKELYKHNKKFRNFVNGLAKDAKKFAKDFGKAFKDLGTLIVKRTKETNKEIGNWWKSTKKSFADGWKDLKKKTGDGIDAVKRGWDKLSGETVRSAQQMFNKHKSTFQAGYKVIEDRTTTWHDLVSGRWDRLGEDTERTAQDMFKFNRKIFSDMYNKLNDMTGGRLGDMLKIWQDIFGKIQDAVGNAVGSVHRHFVDLVNGVLKPFKTMIDDVKGGINWILDKVGGSKIGGDFSISMPSYANGTNDTHPGGFAKVNDGLTAHYREMFMTKDGQVGMFPAKRNLILPLPKGTSVLDGERSYQLSRMFGMIPHYADGVGNAFSSLLSKVGDATDDILGMVDKIMSKPVEFMESVFQKFVHVSTPVKFAAELVKDVPVYIAKQMGNWIKKQFETLANPGGAGVERWRPYIIKAFKTLGVEATATKVSKLLKQIQTESGGNPTVPQKVWDINMANGNPAQGLLQFIPSTFNHWAIPGHKQILNGYDQILAAINALEHGGEGGWGNVGQGHGWANGGLISNHGVYEIAEKNMPEYVIPTDISRRSRAYQLLGEIVTRFRNDDPTLGHNLQSVGNSDRQSDALSHKLDELLSKFDILLRLSGDQVDAIKAQGSLDMQQLYKKEAKDARMRQLGF
Other Proteins in cluster: phalp2_30529
Total (incl. this protein): 11 Avg length: 909,2 Avg pI: 9,67

Protein ID Length (AA) pI
5tUfk 865 9,60561
1FVMM 542 8,99684
1jEIz 912 9,87818
1orzq 1107 9,75221
5RxjE 865 9,61889
5tTT9 704 9,96180
60Qs4 699 9,78606
7py3R 998 9,78748
7x94O 965 9,68813
D2KRB7 966 9,65506
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23533
7euEk
5 32,9% 805 7.394E-136
2 phalp2_20979
7wDdb
23 29,8% 697 2.562E-131
3 phalp2_30711
7foWC
101 24,4% 1252 1.597E-99
4 phalp2_13297
3xFth
5 23,9% 936 1.885E-89
5 phalp2_18933
28wdb
2 26,1% 669 1.649E-76
6 phalp2_12326
7wR7Q
26 23,9% 894 3.209E-69
7 phalp2_380
7hjyf
2 26,0% 676 5.764E-69
8 phalp2_427
7zt3x
4 27,3% 585 3.338E-68
9 phalp2_10042
7lCpJ
2 26,2% 723 9.348E-62
10 phalp2_36879
7rXBb
1 25,9% 635 7.039E-55

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Siphoviridae sp. ctDwe1
[NCBI]
2826200 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK014827 [NCBI]
CDS location
range 26115 -> 30251
strand +
CDS
ATGTCCGCCTTCCGAAACGCAATTACGGCAACGACCAATGCCTGGAAGGCTCAAGAAATAGCCTTAAAAAACAGCGGCGACTATACGCAGGCGGCTAAGGTAAAGCTTGATGGCCTTACTCAAGCAATGGAACTGCAGCGTGCCAAGATTGAAGAACTGCGCACTCGGCAATCAGGGCTTGATCAATCTAATAAGGAACAAGCTGATCAATGGTTAAAGCTTGAGAAGCAAATTAGTCAAGCTAATCGGCAATTGGCTAGCTATGAATCACAAGCTGAACGAGCGAAAAATTACTATGCTTATCAGTCAAGTGGTTTGGCGGATCTTCAGCATTCTTACAAGCTAACGGCAAGAAGTAATGAGATTTTCGTCGAACAACTGAAAGCAGAAGGGAAAAACGTTAAGGCTCAGCGAGCAGAACTTGATGGAATGAAAACTTCCCTCACTTCGTTAAAAGAGCAATATAATGCTCAAAAGCAAGTGGTGGCTCAAGTTGCTGAAAACAGTGGCAAGGCTTCGAATGCTTATAAAGAGCAGAAGAATAAACTTGATGAACTGCGATTAGAGATTACTCAAACTGATGGAAAGATTAAGGTTTTGAGTAAGGATTTGGATAAGAATCATCCAGCGTTTCTTTCTGGTGTTCGCGAAAAGCTCGAAAAGTCAAACGAAGCTGCGGAAAAAAGCCACTCGCTTTTTTCTAAGATTTTCTCGGCTAATGTTGCTGCTAACCTTTTTACTTCTGCATTAGGCAAGGTTCATGAGTCCTTCTCATTATTAACCGAATCAGTCAAAGAATATGATGATAAGCAGCAAACGATGGTAGCTACTTGGGATACTCTTACTGGCAGTGCTGGTAAAGGGCAGAACATGGTTAACATCGGGAACAAACTTGCGGCAGCTTATGGTCAAAATATTGAGGTCGTTGATGAGTTGAATCAGCAATTCTATCACGTGTTTGATAATGCACCAAGAACCGAAAAGCTTACTAAGTCTGTCTTGACGCTTGGTGATACTTTAAATATGACTGATGAGAATGTTAAACGTCTTGGGCTTAACTTCACTCACATGTTATCGAGTGGTCGTATGCAGCTTGGCGACTTTAATATGATCACTGATCAATTGCCTATGTATGGTGAAAAACTGCTTGAATATGAGCGTAAGGTTCAAAATAACAGCAAGCTGACGATGGAAACTCTTCGCGATCAAATGTCGGCCGGAAAGATTTCAGCCAAGGATGCGGAAACAGTTATGAATGAGCTTGGCGATAAATATCAAAAAGCTTCAGAAAACCTTATGAAGACTGGACCTGGTATGGTTCGAGCGGTTAAGACGCAAGCACCCGCTTTACTAGAAGCATTCTACAAACCAATTCGAGAGATGAAAAATCCATTGCTTGGGCAAGTTTCAAAGTGGGTTATGGCTGATGAAACGAAAAAAGAATTTTCTAAATTAGGTGATACCCTTACTTCTCAAGTAACAAGCGTTACAAAAGCTCTTAGTAAAAACAGTAACTTTGATTTCGGTAAGATGATGAATAGCGGTTTGGAAAAGCTAAACAGTTTGATAATTAAGCTTGGCCAAAACGCTGTTAAACATAAAGGCGATCTGAAAGAGATGTTTAATACTTTTAAATCTTTTTCAGGAACATCTTTTAAGGTTTTCGTTCAAACTTTGAAAGACTTAGAACCAATCTTAAAAATCATTGGTGGTTTTGCAGCTAAGCATCCAAAGGCATTTGCTCAGACTGCAGCAGCCCTTTTGCTTATTAATAAGGCGACGACGATCTTGTTGCCATCGGTTAAGCTTTTGCAATCTACTTTCAAAGCGGTGAAGTTTCCAATTACTGCTATTCAAAAAACTCGAGACGGGCTTAAAGAGCTGCCAGATAAATACAATCGCGTTAGTGATAGCATTGATGCTTATGGGAACAAGCTTAAGAAAGTACCTACCAAAGTTAAGACTAAGGTATCTGCTCCAGTATCAGCTGCTAAAGAAAAGATTAGTGGATACAACAGAAAGCTTAAACAAGTTCCTAAAACTGTCAAAACGAAAGCAGTTGCTTCAACAACTTCTGCTAAAGGTAAGATAAGTTCTTTTAATGCTACTGTTAAAAGGGTTCCTCGAAAGATTAAGGTTAAGGCAAGTGCTGAAACAGCAAGCGCCAAGATCAGCATAAGAAATCTTGGCATGACTGCTAAAGCCGCAGCGGTTACGTCGAAAAGTGCCTTTACAGCTATTAAATTATCAGCTGTGACATCGATTAAAGCGATTGGCTTAGCTGCGCGTGCAAATCCATTAGGTGCATTAATTACAGGAATTGAGTTAGCTACGGCTGCATTCAGTTTCTTATACCAACATAGCAAGAAGTTTCGTAGTTTCTGTAATGGTGTTGCTAAAAACGCTAAAAAAGCATGGTCTTCTATTAAAAAAGGAGCTGCTGAAGGTTGGCAAGACCTTAAGAAGAAGGCTAGTGATGGAGCTGATAATGTAAAACGCGGCTGGGACAGATTGAGCAGCGATACTGCCCGCTCGGCTCAGCAGATGTTTAACAAGCATAAGTCAACATTCCAAGCGGGCTATAAAGTTATCGAGGACCGGACTACTACTTGGCATGATCTTGTTTCGGGGCGTTGGGATCGCTTAGGCGAAGACACCGAACGTACTGCACAGGACATGTTTAAGTTCAACCGCAAGATCTTTTCTGATATGTACAACAAGCTGAATGACATGACGGGCGGTCGTTTAGGCGACATGCTTAAGATTTGGCAGGACATTTTTGGCAAGATCCAAGACGCGGTTGGCAATGCAGTTGGCAGCGTACATCGCCATTTCGTTGATTTGGTAAATGGTGTACTTAAGCCATTTAAGACCATGATCGATGACGTCAAAGGCGGTATCAACTGGGTTCTTGATAAAGTCGGTGGATCTAAGATTGGTGGAGATTTCAGCATTTCGATGCCAAGCTACGCCAATGGTACCAACGACACTCATCCCGGTGGCTTTGCCAAAGTCAATGACGGCTTGACGGCACATTACCGTGAGATGTTTATGACCAAAGATGGCCAAGTTGGGATGTTCCCGGCTAAGCGCAACTTGATTCTGCCGTTGCCAAAAGGTACCAGTGTGCTTGATGGCGAACGGAGCTATCAGCTGTCGCGTATGTTCGGGATGATTCCGCATTATGCAGACGGTGTGGGCAATGCTTTCAGCTCACTGCTCAGCAAAGTCGGGGATGCAACTGATGATGTTTTAGGCATGGTCGACAAAATCATGTCAAAGCCGGTCGAATTTATGGAATCCGTATTCCAAAAATTTGTCCACGTCAGCACACCGGTTAAGTTTGCTGCAGAGTTAGTCAAGGACGTACCGGTTTATATCGCTAAACAGATGGGCAACTGGATCAAGAAACAGTTTGAGACGCTTGCCAACCCAGGCGGTGCCGGGGTCGAACGTTGGCGGCCTTACATCATCAAAGCCTTCAAGACTTTGGGTGTTGAAGCTACTGCTACGAAGGTTTCTAAACTGTTAAAACAGATCCAGACCGAATCTGGCGGTAATCCAACCGTACCGCAAAAGGTCTGGGACATTAACATGGCGAACGGCAATCCCGCGCAAGGGTTGCTGCAGTTCATCCCAAGTACTTTTAACCATTGGGCAATTCCTGGCCACAAACAAATCCTTAATGGATATGACCAGATCTTAGCTGCTATCAACGCTCTGGAACATGGCGGTGAAGGTGGCTGGGGCAATGTCGGTCAAGGCCACGGCTGGGCTAATGGTGGTCTGATCTCTAACCATGGCGTTTATGAGATTGCAGAAAAGAACATGCCAGAATATGTCATCCCGACCGACATCAGCAGACGGTCACGTGCATATCAGCTGCTTGGCGAGATCGTTACCAGATTCCGCAACGATGATCCTACTTTGAGCCATAATGCTCAATATGTCGGCGGAAATGATCGCCAATCAGACGCTCTGAGCCATAAGTTAGACGAATTACTGTCCAAGTTTGATATTTTGTTGCGTCTGAGCGGCGATCAGGTAGACGCGATTAAAGCTCAAGGCAGTCTCGACATGCAACAGTTGTACAAAAAGGAAGCAAAAGACGCGCGGATGCGTCAATTAGGATTCTAG

Gene Ontology

Description Category Evidence (source)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5tUfk) rather than this protein.
PDB ID
5tUfk
Method AlphaFoldv2
Resolution 52.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50