Protein

Protein accession
D9ZNE6 [UniProt]
Representative
7pwUg
Source
UniProt (cluster: phalp2_33471)
Protein name
Putative tape measure protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MALSAGTIMATLALSTSPFKASLQSAQNDLKTFANGSESTGKRITALGSAANTVGASLAKHVTLPIVGVGAAAIKMSSDFDAQMSRVQSVAGASGSQIETLRKQAIDLGASTSFSASEAAVGMENLASAGFSVNEITSAMPGMMNLAAASGEDLATSSDIAATTLRGFGLAASEAGHVADVLAQNANATNASVASTGEAMKYVAPIAHTAGMSMEEVTAAIGEMADQGIQGSQAGTTLRSALNSLSNPSKQAAGLMKQIGFSAFDTNGKMLPLNEIIGKLQSSTKGMTQQQKALTLSTIFGSDALSGMQVLIGDGQDKLKGLTKELKNSDGAAKAAAKTNQDNLKGSIEGLKGSLESAALAIGKTLTPAIRSITDHLGNLVKAFNKMSPASQTFIVAVGGVVAAIGPALLIFAKTVKAIQSIHQAFTIVKDVKAVSTAISGIGKAFNVLTLGANPVMIAIYGIAIAALIIYKNWDKLAPYFKKVWAVVTGIFTSAKNTIMGAWSGITGFFSGIWNGIKAGVYIAIAGISTGWTAAVTGIKTAFSAIGNFFAGVWNGIKATSLAIWNAMKVAGLAVWNGLVGGIKSIWNGLVAFFKAFPGVMANIGKSIFNFFKNGAISLLTNAVAGIKALWNGAVNFFRSMPTVFANTGRNIFSFFKNGAISLLTGAVAGIKKMWNGLVNWFKNFPKNFVNIGKDIMQGLWNGLKAIGGKVVAFAKKLAKDLLTGMKRVFGIASPSKQTYAMGGYLMKGLENALLSGAGHIKAVVQKVFGGAINFAHGIVGSAKVGAWLTTALAQSGKPLAWLPALQTLVQKESGGNPLSVNSQAVGGEHATGLMQMLGSTFNSYAASGHGNIMNPIDNIMSALNYIKARYGSPYNIPHLFSGNYVGYATGTDNATPGAHAVGEKGLEVVLGNALKWFRGGETVLSNMQTNNLVTNMTSVLNTLQGLVTGVQAGITGISNTFATANGLIGNNKVELAKKAQDSLNTTGGTPAKVELHLDGKYAFTDRESIDYLSTVISRKITGNVRRRK
Physico‐chemical
properties
protein length:1029 AA
molecular weight:106389,5 Da
isoelectric point:9,89
hydropathy:0,23
Representative Protein Details
Accession
7pwUg
Protein name
7pwUg
Sequence length
1097 AA
Molecular weight
115066,15200 Da
Isoelectric point
7,74877
Sequence
MAIDGGTILATLALTTSPFNASLASAGKSLRTFADSAQTVNTRIGALGNAANAIGSSLSKYVTLPILGIGAAALKMSTQFDAQMSRVQSVAGASASQMKALHDQALELGASTAFSASEAAEGMENLASAGFTVNEITKAMPGMLDLAAASGEDLANSADIAASTLRGFGLAADQAGHVADVLAKNANATNAAVADTGEAMKYVAPVAHSMGLSLEEVTAAIGEMADQGIKGSQAGTTIRSALISLAKPSKQAAELMQSIGFSAFDAQGKMLPLNQIVANLQKSMAGMTQEQKANTLATIFGTEAMSGMQILVNDGSTSLQKLTKELQNSTGAAADAAKINQDNLKGSLDGLSGAFETLGIKIGETATPQLRGLVDWFAKVVTAFGNSSDGTRTFIVETGLAVAAIGPLLLVLSQLILSLQRLNEAYQFLMRFQAMQSIMNGVRGAMAGLLSPISLTIGALALLGIGIYEAATHWQYLRGVMANVDRYIHEDLTKDLKYAYETVVMITKNIVQWLQDNLPGIINTVITVGGQIFQYLSEHAGQFLTVAKQIVFNIANGIITNLPQMIAAASNIINTLMTSFLNSLPTLMLMGQQLLQNILNGINTYLPNILTTGLNIITQLINSIVVNLPYIINIGSQVIITLLTAIIQALPILISGAANIISALIIGIINVLPMLINVGLQLVLALANAILQNLPQIITAGMNLIVALINGILQNLPQIIATIVEVIVLLAQAIIQNLPLILSAGIRIIGAIIIGLGQALPQLLSYGGQAMHQLWNILKNVDWAAVGKAVINGIIAGIKAIAKNLWDTAKWLGEQLLTKLRETFGIGSPSKEMYSIGKYLIQGLANAMTDGKDHIMSVVNKVFGGALSYAKGMFSSAQVGAWLSMALMESGTPLSWLPAMQQLVKKESGGNPGAYNSTSVGGQHATGLMQMLPSTFRAYMASGHGNITNPLDNILSAINYIKARYGSPYNIPHLFGGNYVGYATGIKSAVKGMHAVAEKGLEVIGGKALRYFSGGQTVLNNRDSMNLLTTVANITDALNNAKGLDTITQAQLATPNGTPVSIELNLNGNYAFTDKAAIDELSTQVSRTIGGRMRGRF
Other Proteins in cluster: phalp2_33471
Total (incl. this protein): 5 Avg length: 1042,6 Avg pI: 9,46

Protein ID Length (AA) pI
7pwUg 1097 7,74877
3xOEf 1029 9,89604
7BUDE 1029 9,88715
A0AAF0T9W4 1029 9,86342
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24823
7mvK1
8 24,2% 1390 5.214E-91
2 phalp2_15998
7RoFJ
265 26,2% 1218 1.116E-83
3 phalp2_19743
4SuaV
8 25,9% 953 3.236E-63
4 phalp2_12326
7wR7Q
26 25,3% 1021 3.236E-63
5 phalp2_7791
7r78a
2 26,3% 709 1.056E-58
6 phalp2_13297
3xFth
5 24,1% 833 1.841E-43
7 phalp2_10100
78Jr
6 25,3% 721 1.080E-40
8 phalp2_22260
7m77N
19 22,6% 1169 9.904E-40
9 phalp2_15327
1owvI
3 25,3% 1063 1.002E-36
10 phalp2_7794
7rQ6d
55 22,0% 1125 4.018E-33

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Clostridium phage phiCTP1
[NCBI]
871584 No lineage information
Host Clostridium tyrobutyricum
[NCBI]
1519 Firmicutes > Clostridia > Clostridiales > Clostridiaceae > Clostridium >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HM159959 [NCBI]
CDS location
range 12537 -> 15626
strand +
CDS
ATGGCATTAAGTGCTGGAACAATAATGGCAACATTGGCATTATCTACAAGCCCATTCAAAGCCTCCCTACAATCAGCTCAGAATGATTTAAAAACATTTGCCAATGGAAGTGAAAGTACTGGCAAACGTATAACAGCATTAGGAAGTGCGGCAAATACTGTTGGAGCTTCTTTAGCTAAACATGTAACCCTACCTATTGTGGGTGTTGGAGCGGCGGCAATAAAGATGTCAAGCGACTTTGATGCTCAAATGAGTAGGGTTCAATCTGTTGCAGGAGCATCAGGTTCACAAATAGAGACATTAAGAAAACAGGCTATTGATTTGGGTGCTTCTACATCTTTTAGTGCTAGTGAAGCGGCTGTAGGTATGGAAAACCTAGCCTCTGCTGGTTTCTCAGTAAATGAAATTACTAGTGCTATGCCAGGTATGATGAACTTAGCGGCGGCTTCAGGAGAGGATTTGGCAACAAGTTCAGATATAGCGGCTACTACATTAAGAGGATTTGGTTTGGCGGCATCAGAAGCTGGTCATGTTGCAGATGTATTAGCACAAAATGCTAATGCAACAAATGCTTCAGTAGCTAGTACTGGGGAAGCTATGAAATATGTTGCACCAATAGCCCATACAGCAGGGATGAGCATGGAAGAAGTTACAGCGGCTATAGGCGAAATGGCTGACCAAGGTATACAAGGTTCACAAGCGGGTACTACCCTAAGAAGTGCATTGAATAGTTTGTCTAACCCTTCTAAACAAGCGGCTGGGCTTATGAAACAAATTGGATTTAGTGCTTTTGATACTAATGGAAAAATGTTGCCACTAAATGAAATTATAGGTAAATTACAATCTTCTACTAAAGGAATGACTCAGCAACAAAAAGCATTAACGTTATCAACCATATTTGGTTCTGATGCTTTAAGTGGTATGCAGGTATTAATCGGTGATGGTCAAGATAAATTAAAAGGCTTAACTAAAGAACTCAAAAACTCAGATGGTGCGGCTAAAGCGGCGGCAAAAACTAACCAAGATAACTTGAAAGGTTCGATAGAAGGTTTAAAAGGTTCTCTGGAATCAGCCGCCCTAGCTATAGGGAAAACATTGACCCCTGCTATAAGAAGTATTACTGACCATTTAGGTAATTTAGTAAAGGCTTTTAATAAGATGTCACCAGCTTCGCAAACATTTATTGTGGCAGTAGGTGGAGTAGTAGCGGCAATTGGACCTGCATTATTAATATTTGCTAAAACAGTTAAAGCAATACAAAGTATACATCAAGCCTTTACTATAGTTAAGGATGTAAAAGCTGTATCTACTGCTATAAGTGGAATTGGTAAAGCATTTAATGTTTTAACTTTAGGGGCTAATCCAGTAATGATTGCTATATATGGGATTGCCATAGCGGCATTAATTATATATAAAAATTGGGATAAGTTAGCACCATATTTTAAAAAGGTATGGGCGGTTGTTACTGGAATATTTACAAGTGCTAAAAATACTATTATGGGTGCGTGGAGCGGTATTACTGGATTTTTTAGTGGAATCTGGAATGGTATAAAAGCCGGTGTGTATATTGCAATTGCTGGGATATCTACAGGTTGGACAGCGGCAGTAACTGGAATTAAAACTGCATTTTCTGCAATTGGTAATTTTTTTGCTGGTGTTTGGAATGGTATAAAGGCAACAAGTCTTGCGATATGGAATGCAATGAAGGTTGCTGGGCTTGCAGTATGGAATGGTTTAGTAGGGGGAATTAAATCTATATGGAATGGGTTAGTAGCGTTCTTTAAAGCTTTCCCGGGTGTAATGGCTAATATAGGTAAGAGTATATTTAACTTCTTTAAAAACGGGGCTATATCTCTTTTAACTAATGCTGTTGCTGGTATAAAGGCTTTGTGGAATGGGGCTGTTAATTTCTTTAGAAGTATGCCTACAGTATTTGCTAATACGGGTAGAAATATATTTAGCTTTTTTAAGAATGGTGCAATCAGTTTATTAACTGGGGCAGTTGCTGGAATTAAAAAGATGTGGAATGGTTTAGTTAACTGGTTTAAAAATTTCCCTAAAAACTTTGTAAATATTGGTAAGGATATCATGCAAGGTTTATGGAACGGATTAAAGGCTATTGGGGGTAAAGTTGTAGCTTTTGCTAAAAAATTAGCCAAGGATTTATTAACTGGTATGAAGAGGGTATTTGGTATTGCTTCACCATCCAAACAAACTTATGCAATGGGTGGCTACTTGATGAAAGGACTTGAAAATGCCCTGCTGAGTGGTGCCGGACATATTAAGGCAGTAGTACAGAAAGTATTCGGTGGTGCTATTAATTTTGCACATGGTATTGTTGGTAGTGCCAAAGTAGGTGCATGGCTTACAACTGCATTAGCACAATCAGGAAAGCCTCTAGCATGGTTGCCAGCATTGCAAACTTTAGTACAGAAAGAATCAGGTGGTAACCCATTATCTGTTAATAGTCAGGCAGTGGGAGGAGAGCATGCCACTGGGTTAATGCAGATGTTAGGTTCTACTTTTAACAGTTATGCGGCTAGTGGACATGGTAATATAATGAACCCCATTGACAATATAATGTCAGCCTTAAATTATATTAAAGCTAGATATGGTAGCCCATATAATATACCACATTTATTTAGTGGAAATTATGTTGGTTATGCAACCGGAACAGATAATGCAACCCCGGGAGCACATGCAGTAGGTGAAAAAGGACTTGAGGTTGTACTGGGAAATGCGTTAAAATGGTTCAGAGGAGGAGAAACAGTTTTAAGTAATATGCAAACTAATAATTTAGTTACGAATATGACTTCTGTTTTAAATACTCTACAAGGATTAGTAACTGGTGTACAAGCTGGAATTACTGGTATAAGTAATACATTTGCAACTGCAAATGGGCTTATTGGTAATAATAAAGTAGAATTAGCAAAGAAAGCTCAAGATAGTTTGAATACTACAGGTGGCACACCTGCCAAAGTAGAATTACACTTAGATGGTAAATATGCTTTCACCGATAGAGAATCAATTGATTATTTATCTACTGTAATATCAAGAAAAATAACTGGTAATGTGAGGAGGAGAAAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001e07824_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7pwUg) rather than this protein.
PDB ID
7pwUg
Method AlphaFoldv2
Resolution 60.14
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50