Protein
- Protein accession
- A0A8S5M673 [UniProt]
- Representative
- 5tUfk
- Source
- UniProt (cluster: phalp2_30529)
- Protein name
- Tape measure protein N-terminal domain-containing protein
- Lysin probability
- 93%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MSAFRNAITATTNAWKAQEIALKNSGDYTQAAKVKLDGLTQAMELQRAKIEELRTRQSGLDQSNKEQADQWLKLEKQISQANRQLASYESQAERAKNYYAYQSSGLADLQHSYKLTARSNEIFVEQLKAEGKNVKAQRAELDGMKTSLTSLKEQYNAQKQVVAQVAENSGKASNAYKEQKNKLDELRLEITQTDGKIKVLSKDLDKNHPAFLSGVREKLEKSNEAAEKSHSLFSKIFSANVAANLFTSALGKVHESFSLLTESVKEYDDKQQTMVATWDTLTGSAGKGQNMVNIGNKLAAAYGQNIEVVDELNQQFYHVFDNAPRTEKLTKSVLTLGDTLNMTDENVKRLGLNFTHMLSSGRMQLGDFNMITDQLPMYGEKLLEYERKVQNNSKLTMETLRDQMSAGKISAKDAETVMNELGDKYQKASENLMKTGPGMVRAVKTQAPALLEAFYKPIREMKNPLLGQVSKWVMADETKKEFSKLGDTLTSQVTSVTKALSKNSNFDFGKMMNSGLEKLNSLIIKLGQNAVKHKGDLKEMFNTFKSFSGTSFKVFVQTLKDLEPILKIIGGFAAKHPKAFAQTAAALLLINKATTILLPSVKLLQSTFKAVKFPITAIQKTRDGLKELPDKYNRVSDSIDAYGNKLKKVPTKVKTKVSAPVSAAKEKISGYNRKLKQVPKTVKTKAVASTTSAKGKISSFNATVKRVPRKIKVKASAETASAKISIRNLGMTAKAAAVTSKSAFTAIKLSAVTSIKAIGLAARANPLGALITGIELATAAFSFLYQHSKKFRSFCNGVAKNAKKAWSSIKKGAAEGWQDLKKKASDGADNVKRGWDRLSSDTARSAQQMFNKHKSTFQAGYKVIEDRTTTWHDLVSGRWDRLGEDTERTAQDMFKFNRKIFSDMYNKLNDMTGGRLGDMLKIWQDIFGKIQDAVGNAVGSVHRHFVDLVNGVLKPFKTMIDDVKGGINWVLDKVGGSKIGGDFSISMPSYANGTNDTHPGGFAKVNDGLTAHYREMFMTKDGQVGMFPAKRNLILPLPKGTSVLDGERSYQLSRMFGMIPHYADGVGNAFSSLLSKVGDATDDVLGMVDKIMSKPVEFMESVFQKFVHVSTPVKFAAELVKDVPVYIAKQMGNWIKKQFETLANPGGAGVERWRPYIIKAFKTLGVEATATKVSKLLKQIQTESGGNPTVPQKVWDINMANGNPAQGLLQFIPSTFNHWAIPGHKQILNGYDQILAAINALEHGGEGGWGNVGQGHGWANGGLISNHGVYEIAEKNMPEYVIPTDISRRSRAYQLLGEIVTRFRNDDPTLSHNAQYVGGNDRQSDALSHKLDELLSKFDILLRLSGDQVDAIKAQGSLDMQQLYKKEAKDARMRQLGF
- Physico‐chemical
properties -
protein length: 1378 AA molecular weight: 152080,3 Da isoelectric point: 9,63 hydropathy: -0,44
Representative Protein Details
- Accession
- 5tUfk
- Protein name
- 5tUfk
- Sequence length
- 865 AA
- Molecular weight
- 95069,37010 Da
- Isoelectric point
- 9,60561
- Sequence
-
LIKPSKGATEALKSIGLSTKDFTDKNGNMKSMSDIFKELNEHTKNLSKQEKGALFKAIFGATGESAAIILSDSASEMEKLNKQVEKSYKGQGYVQRLANKNMGSVKMETAQLKESGEAASLMIGKALLPALRDASTAMAKAFNSKDGQKGLKVIAKGVGDFAKVVVDLVIALGKHTTTIKVFGATLGTAFAIFKTMKLVNTIKMTVTTFKELTLATKAFKIAMAGGGIALIITGVVIALKELYKHNKKFRNFVNGLAKDAKKFAKDFGKAFKDLGTLIVKRTKETNKEIGNWWKSTKKSFADGWKDLKKKTGDGIDAVKRGWDKLSGETVRSAQQMFNKHKSTFQAGYKVIEDRTTTWHDLVSGRWDRLGEDTERTAQDMFKFNRKIFSDMYNKLNDMTGGRLGDMLKIWQDIFGKIQDAVGNAVGSVHRHFVDLVNGVLKPFKTMIDDVKGGINWILDKVGGSKIGGDFSISMPSYANGTNDTHPGGFAKVNDGLTAHYREMFMTKDGQVGMFPAKRNLILPLPKGTSVLDGERSYQLSRMFGMIPHYADGVGNAFSSLLSKVGDATDDILGMVDKIMSKPVEFMESVFQKFVHVSTPVKFAAELVKDVPVYIAKQMGNWIKKQFETLANPGGAGVERWRPYIIKAFKTLGVEATATKVSKLLKQIQTESGGNPTVPQKVWDINMANGNPAQGLLQFIPSTFNHWAIPGHKQILNGYDQILAAINALEHGGEGGWGNVGQGHGWANGGLISNHGVYEIAEKNMPEYVIPTDISRRSRAYQLLGEIVTRFRNDDPTLGHNLQSVGNSDRQSDALSHKLDELLSKFDILLRLSGDQVDAIKAQGSLDMQQLYKKEAKDARMRQLGF
Other Proteins in cluster: phalp2_30529
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_23533
7euEk
|
5 | 32,9% | 805 | 7.394E-136 |
| 2 |
phalp2_20979
7wDdb
|
23 | 29,8% | 697 | 2.562E-131 |
| 3 |
phalp2_30711
7foWC
|
101 | 24,4% | 1252 | 1.597E-99 |
| 4 |
phalp2_13297
3xFth
|
5 | 23,9% | 936 | 1.885E-89 |
| 5 |
phalp2_18933
28wdb
|
2 | 26,1% | 669 | 1.649E-76 |
| 6 |
phalp2_12326
7wR7Q
|
26 | 23,9% | 894 | 3.209E-69 |
| 7 |
phalp2_380
7hjyf
|
2 | 26,0% | 676 | 5.764E-69 |
| 8 |
phalp2_427
7zt3x
|
4 | 27,3% | 585 | 3.338E-68 |
| 9 |
phalp2_10042
7lCpJ
|
2 | 26,2% | 723 | 9.348E-62 |
| 10 |
phalp2_36879
7rXBb
|
1 | 25,9% | 635 | 7.039E-55 |
Domains
Domains [InterPro]
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Siphoviridae sp. ctDwe1 [NCBI] |
2826200 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
BK014827
[NCBI]
CDS location
range 26115 -> 30251
strand +
strand +
CDS
ATGTCCGCCTTCCGAAACGCAATTACGGCAACGACCAATGCCTGGAAGGCTCAAGAAATAGCCTTAAAAAACAGCGGCGACTATACGCAGGCGGCTAAGGTAAAGCTTGATGGCCTTACTCAAGCAATGGAACTGCAGCGTGCCAAGATTGAAGAACTGCGCACTCGGCAATCAGGGCTTGATCAATCTAATAAGGAACAAGCTGATCAATGGTTAAAGCTTGAGAAGCAAATTAGTCAAGCTAATCGGCAATTGGCTAGCTATGAATCACAAGCTGAACGAGCGAAAAATTACTATGCTTATCAGTCAAGTGGTTTGGCGGATCTTCAGCATTCTTACAAGCTAACGGCAAGAAGTAATGAGATTTTCGTCGAACAACTGAAAGCAGAAGGGAAAAACGTTAAGGCTCAGCGAGCAGAACTTGATGGAATGAAAACTTCCCTCACTTCGTTAAAAGAGCAATATAATGCTCAAAAGCAAGTGGTGGCTCAAGTTGCTGAAAACAGTGGCAAGGCTTCGAATGCTTATAAAGAGCAGAAGAATAAACTTGATGAACTGCGATTAGAGATTACTCAAACTGATGGAAAGATTAAGGTTTTGAGTAAGGATTTGGATAAGAATCATCCAGCGTTTCTTTCTGGTGTTCGCGAAAAGCTCGAAAAGTCAAACGAAGCTGCGGAAAAAAGCCACTCGCTTTTTTCTAAGATTTTCTCGGCTAATGTTGCTGCTAACCTTTTTACTTCTGCATTAGGCAAGGTTCATGAGTCCTTCTCATTATTAACCGAATCAGTCAAAGAATATGATGATAAGCAGCAAACGATGGTAGCTACTTGGGATACTCTTACTGGCAGTGCTGGTAAAGGGCAGAACATGGTTAACATCGGGAACAAACTTGCGGCAGCTTATGGTCAAAATATTGAGGTCGTTGATGAGTTGAATCAGCAATTCTATCACGTGTTTGATAATGCACCAAGAACCGAAAAGCTTACTAAGTCTGTCTTGACGCTTGGTGATACTTTAAATATGACTGATGAGAATGTTAAACGTCTTGGGCTTAACTTCACTCACATGTTATCGAGTGGTCGTATGCAGCTTGGCGACTTTAATATGATCACTGATCAATTGCCTATGTATGGTGAAAAACTGCTTGAATATGAGCGTAAGGTTCAAAATAACAGCAAGCTGACGATGGAAACTCTTCGCGATCAAATGTCGGCCGGAAAGATTTCAGCCAAGGATGCGGAAACAGTTATGAATGAGCTTGGCGATAAATATCAAAAAGCTTCAGAAAACCTTATGAAGACTGGACCTGGTATGGTTCGAGCGGTTAAGACGCAAGCACCCGCTTTACTAGAAGCATTCTACAAACCAATTCGAGAGATGAAAAATCCATTGCTTGGGCAAGTTTCAAAGTGGGTTATGGCTGATGAAACGAAAAAAGAATTTTCTAAATTAGGTGATACCCTTACTTCTCAAGTAACAAGCGTTACAAAAGCTCTTAGTAAAAACAGTAACTTTGATTTCGGTAAGATGATGAATAGCGGTTTGGAAAAGCTAAACAGTTTGATAATTAAGCTTGGCCAAAACGCTGTTAAACATAAAGGCGATCTGAAAGAGATGTTTAATACTTTTAAATCTTTTTCAGGAACATCTTTTAAGGTTTTCGTTCAAACTTTGAAAGACTTAGAACCAATCTTAAAAATCATTGGTGGTTTTGCAGCTAAGCATCCAAAGGCATTTGCTCAGACTGCAGCAGCCCTTTTGCTTATTAATAAGGCGACGACGATCTTGTTGCCATCGGTTAAGCTTTTGCAATCTACTTTCAAAGCGGTGAAGTTTCCAATTACTGCTATTCAAAAAACTCGAGACGGGCTTAAAGAGCTGCCAGATAAATACAATCGCGTTAGTGATAGCATTGATGCTTATGGGAACAAGCTTAAGAAAGTACCTACCAAAGTTAAGACTAAGGTATCTGCTCCAGTATCAGCTGCTAAAGAAAAGATTAGTGGATACAACAGAAAGCTTAAACAAGTTCCTAAAACTGTCAAAACGAAAGCAGTTGCTTCAACAACTTCTGCTAAAGGTAAGATAAGTTCTTTTAATGCTACTGTTAAAAGGGTTCCTCGAAAGATTAAGGTTAAGGCAAGTGCTGAAACAGCAAGCGCCAAGATCAGCATAAGAAATCTTGGCATGACTGCTAAAGCCGCAGCGGTTACGTCGAAAAGTGCCTTTACAGCTATTAAATTATCAGCTGTGACATCGATTAAAGCGATTGGCTTAGCTGCGCGTGCAAATCCATTAGGTGCATTAATTACAGGAATTGAGTTAGCTACGGCTGCATTCAGTTTCTTATACCAACATAGCAAGAAGTTTCGTAGTTTCTGTAATGGTGTTGCTAAAAACGCTAAAAAAGCATGGTCTTCTATTAAAAAAGGAGCTGCTGAAGGTTGGCAAGACCTTAAGAAGAAGGCTAGTGATGGAGCTGATAATGTAAAACGCGGCTGGGACAGATTGAGCAGCGATACTGCCCGCTCGGCTCAGCAGATGTTTAACAAGCATAAGTCAACATTCCAAGCGGGCTATAAAGTTATCGAGGACCGGACTACTACTTGGCATGATCTTGTTTCGGGGCGTTGGGATCGCTTAGGCGAAGACACCGAACGTACTGCACAGGACATGTTTAAGTTCAACCGCAAGATCTTTTCTGATATGTACAACAAGCTGAATGACATGACGGGCGGTCGTTTAGGCGACATGCTTAAGATTTGGCAGGACATTTTTGGCAAGATCCAAGACGCGGTTGGCAATGCAGTTGGCAGCGTACATCGCCATTTCGTTGATTTGGTAAATGGTGTACTTAAGCCATTTAAGACCATGATCGATGACGTCAAAGGCGGTATCAACTGGGTTCTTGATAAAGTCGGTGGATCTAAGATTGGTGGAGATTTCAGCATTTCGATGCCAAGCTACGCCAATGGTACCAACGACACTCATCCCGGTGGCTTTGCCAAAGTCAATGACGGCTTGACGGCACATTACCGTGAGATGTTTATGACCAAAGATGGCCAAGTTGGGATGTTCCCGGCTAAGCGCAACTTGATTCTGCCGTTGCCAAAAGGTACCAGTGTGCTTGATGGCGAACGGAGCTATCAGCTGTCGCGTATGTTCGGGATGATTCCGCATTATGCAGACGGTGTGGGCAATGCTTTCAGCTCACTGCTCAGCAAAGTCGGGGATGCAACTGATGATGTTTTAGGCATGGTCGACAAAATCATGTCAAAGCCGGTCGAATTTATGGAATCCGTATTCCAAAAATTTGTCCACGTCAGCACACCGGTTAAGTTTGCTGCAGAGTTAGTCAAGGACGTACCGGTTTATATCGCTAAACAGATGGGCAACTGGATCAAGAAACAGTTTGAGACGCTTGCCAACCCAGGCGGTGCCGGGGTCGAACGTTGGCGGCCTTACATCATCAAAGCCTTCAAGACTTTGGGTGTTGAAGCTACTGCTACGAAGGTTTCTAAACTGTTAAAACAGATCCAGACCGAATCTGGCGGTAATCCAACCGTACCGCAAAAGGTCTGGGACATTAACATGGCGAACGGCAATCCCGCGCAAGGGTTGCTGCAGTTCATCCCAAGTACTTTTAACCATTGGGCAATTCCTGGCCACAAACAAATCCTTAATGGATATGACCAGATCTTAGCTGCTATCAACGCTCTGGAACATGGCGGTGAAGGTGGCTGGGGCAATGTCGGTCAAGGCCACGGCTGGGCTAATGGTGGTCTGATCTCTAACCATGGCGTTTATGAGATTGCAGAAAAGAACATGCCAGAATATGTCATCCCGACCGACATCAGCAGACGGTCACGTGCATATCAGCTGCTTGGCGAGATCGTTACCAGATTCCGCAACGATGATCCTACTTTGAGCCATAATGCTCAATATGTCGGCGGAAATGATCGCCAATCAGACGCTCTGAGCCATAAGTTAGACGAATTACTGTCCAAGTTTGATATTTTGTTGCGTCTGAGCGGCGATCAGGTAGACGCGATTAAAGCTCAAGGCAGTCTCGACATGCAACAGTTGTACAAAAAGGAAGCAAAAGACGCGCGGATGCGTCAATTAGGATTCTAG
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0098003 | viral tail assembly | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(5tUfk)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50