Protein

Protein accession
D2KRB7 [UniProt]
Representative
5tUfk
Source
UniProt (cluster: phalp2_30529)
Protein name
Putative tape measure protein
Lysin probability
74%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MMATIKLSHMINPHFYPMWNTDKPYVICKGGRGSFKSSVISMRLVTKVKHWTMLGHKVNVVCVRENASYLHDSVYSQIRWALNMLHMDDEYHFYKSPLRITHKRTGSTFYFYGADDPMKLKSNIVDSVIAVWFEEAANFKGPDVFDQATPTFVRQKPDYVDHVTVYYSYNPPKNPYETLGKIISTIFSGIGDAFKPNGDAIGAITKPLRESSGFAKPLNSALKEIASHKTTLKAVGAAIGSIAAGFIAMKSVSSTISGIKLAVSALGRLKTAISGLSLLSNPVGIIVLSIVALGTAFVTAYKHSKTFRDGVNSVIKAISKTVGNVLKTVTKAFGGLWKSIKPSVDQIIKGFRDLFKTISPILKFIGIVWTTIWKANFVVVSTVFAMITKLVIAGLKAVWTNVKTYAKIIVDIWKLEWDIFKNAVIIVFKALATVVKTGVKVIKDVIDLVMNIITGNWKGAWNDIKDIFSSIWKGISKIGKDLFGSVRDMIKDVLGDIKGIWEDLWNGMAKFFSNIWDGIKDAAKAGLNGVIGFINTGVDGLNSVIHFFGGKKETITPIKKLAHGTSANDRDELALVNDEGGDTYQEAIVRANGKVEIPKNRNQLVFLNRGDEVIPAKKTAEMFGLSHYAKGKKGWLSAAWDNVKDWAGDTFEAIEDALKDPLSVLTGLFHKGKNTATAVWHDVGEGAANYLPKVGADWFKKELQKLEDALTPSNPSGSGAQRWKPYIEKAFKELHVNASEAKINKLLRQIQTESGGNPTIRQQISDKNSRSGNPAQGLLQFIPSTFNHWALKGHGQILNGYDQILAAINALEHGGEGGWGNVGNGHGWANGGWADRPSIFGEVKGQKEIAINPARPTSERHILEAIRARAAKSPSGFAAKLNQIITRQQMASHQIQPATPSANEVPTLGNGGRLNGNLTMNFVVDGTTMARVTYPKYKALMAHEITIRGAGGAVPVGQAIPVGGGF
Physico‐chemical
properties
protein length:966 AA
molecular weight:105429,4 Da
isoelectric point:9,66
hydropathy:-0,09
Representative Protein Details
Accession
5tUfk
Protein name
5tUfk
Sequence length
865 AA
Molecular weight
95069,37010 Da
Isoelectric point
9,60561
Sequence
LIKPSKGATEALKSIGLSTKDFTDKNGNMKSMSDIFKELNEHTKNLSKQEKGALFKAIFGATGESAAIILSDSASEMEKLNKQVEKSYKGQGYVQRLANKNMGSVKMETAQLKESGEAASLMIGKALLPALRDASTAMAKAFNSKDGQKGLKVIAKGVGDFAKVVVDLVIALGKHTTTIKVFGATLGTAFAIFKTMKLVNTIKMTVTTFKELTLATKAFKIAMAGGGIALIITGVVIALKELYKHNKKFRNFVNGLAKDAKKFAKDFGKAFKDLGTLIVKRTKETNKEIGNWWKSTKKSFADGWKDLKKKTGDGIDAVKRGWDKLSGETVRSAQQMFNKHKSTFQAGYKVIEDRTTTWHDLVSGRWDRLGEDTERTAQDMFKFNRKIFSDMYNKLNDMTGGRLGDMLKIWQDIFGKIQDAVGNAVGSVHRHFVDLVNGVLKPFKTMIDDVKGGINWILDKVGGSKIGGDFSISMPSYANGTNDTHPGGFAKVNDGLTAHYREMFMTKDGQVGMFPAKRNLILPLPKGTSVLDGERSYQLSRMFGMIPHYADGVGNAFSSLLSKVGDATDDILGMVDKIMSKPVEFMESVFQKFVHVSTPVKFAAELVKDVPVYIAKQMGNWIKKQFETLANPGGAGVERWRPYIIKAFKTLGVEATATKVSKLLKQIQTESGGNPTVPQKVWDINMANGNPAQGLLQFIPSTFNHWAIPGHKQILNGYDQILAAINALEHGGEGGWGNVGQGHGWANGGLISNHGVYEIAEKNMPEYVIPTDISRRSRAYQLLGEIVTRFRNDDPTLGHNLQSVGNSDRQSDALSHKLDELLSKFDILLRLSGDQVDAIKAQGSLDMQQLYKKEAKDARMRQLGF
Other Proteins in cluster: phalp2_30529
Total (incl. this protein): 11 Avg length: 909,2 Avg pI: 9,67

Protein ID Length (AA) pI
5tUfk 865 9,60561
1FVMM 542 8,99684
1jEIz 912 9,87818
1orzq 1107 9,75221
5RxjE 865 9,61889
5tTT9 704 9,96180
60Qs4 699 9,78606
7py3R 998 9,78748
7x94O 965 9,68813
A0A8S5M673 1378 9,63185
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23533
7euEk
5 32,9% 805 7.394E-136
2 phalp2_20979
7wDdb
23 29,8% 697 2.562E-131
3 phalp2_30711
7foWC
101 24,4% 1252 1.597E-99
4 phalp2_13297
3xFth
5 23,9% 936 1.885E-89
5 phalp2_18933
28wdb
2 26,1% 669 1.649E-76
6 phalp2_12326
7wR7Q
26 23,9% 894 3.209E-69
7 phalp2_380
7hjyf
2 26,0% 676 5.764E-69
8 phalp2_427
7zt3x
4 27,3% 585 3.338E-68
9 phalp2_10042
7lCpJ
2 26,2% 723 9.348E-62
10 phalp2_36879
7rXBb
1 25,9% 635 7.039E-55

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage phiPYB5
[NCBI]
438780 No lineage information
Host Lactobacillus fermentum
[NCBI]
1613 Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > Lactobacillus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
GU323708 [NCBI]
CDS location
range 8231 -> 11131
strand +
CDS
ATGATGGCGACGATAAAGCTTAGTCACATGATCAATCCACACTTCTATCCAATGTGGAATACTGACAAGCCATACGTGATCTGTAAAGGTGGCCGTGGCTCGTTTAAATCGTCGGTAATTAGCATGAGACTAGTTACTAAGGTTAAGCATTGGACGATGCTGGGACACAAGGTCAACGTGGTTTGTGTGCGAGAGAACGCCAGCTATCTGCATGATTCGGTGTACAGCCAGATTAGGTGGGCATTGAACATGTTGCATATGGATGATGAGTATCATTTCTACAAGTCACCGTTGCGTATTACGCATAAGCGCACCGGGAGTACGTTTTACTTCTATGGTGCTGATGATCCAATGAAGCTTAAGTCCAACATTGTGGATAGCGTGATTGCTGTGTGGTTTGAGGAAGCGGCAAACTTCAAAGGCCCGGACGTATTTGACCAAGCTACCCCAACATTCGTTCGACAGAAACCAGATTATGTGGATCATGTAACGGTGTACTACTCATACAACCCGCCTAAAAATCCTTACGAGACGCTGGGTAAGATCATCAGTACAATCTTCAGTGGGATCGGTGACGCCTTTAAGCCAAATGGTGACGCAATCGGTGCCATCACTAAGCCGCTTCGCGAGAGTTCCGGCTTTGCCAAGCCATTAAATAGTGCTCTAAAAGAAATTGCTAGCCATAAGACTACATTAAAAGCGGTCGGGGCGGCGATTGGATCTATTGCTGCTGGCTTTATTGCGATGAAGTCGGTTAGTTCCACCATAAGTGGGATCAAGCTAGCAGTGAGTGCACTTGGTAGGCTTAAAACTGCCATCTCCGGACTATCGCTATTAAGTAACCCGGTTGGAATTATAGTTCTATCGATCGTGGCACTTGGAACGGCGTTTGTAACGGCGTACAAGCACTCTAAGACGTTTCGTGACGGTGTTAACAGCGTCATCAAGGCGATTAGCAAGACGGTTGGTAATGTGCTTAAGACGGTAACCAAAGCGTTCGGTGGGCTGTGGAAGAGCATTAAGCCTTCAGTTGATCAGATCATTAAGGGATTCCGTGATTTGTTTAAAACCATTAGCCCGATTTTAAAATTCATCGGTATCGTCTGGACAACTATTTGGAAGGCTAATTTTGTTGTGGTGTCGACCGTATTCGCTATGATTACGAAGCTGGTCATCGCCGGATTAAAGGCAGTCTGGACGAATGTTAAGACTTACGCCAAGATTATCGTGGATATCTGGAAGCTAGAATGGGACATTTTTAAGAATGCAGTTATTATCGTATTTAAGGCCTTAGCAACGGTGGTAAAAACCGGCGTAAAGGTCATTAAAGATGTGATTGACTTGGTAATGAATATAATTACCGGTAACTGGAAGGGTGCTTGGAACGACATTAAAGATATCTTCTCAAGTATCTGGAAGGGGATTTCCAAGATTGGCAAGGACCTGTTCGGTAGTGTCCGGGATATGATTAAAGATGTTTTAGGCGACATCAAAGGGATCTGGGAAGATCTCTGGAACGGGATGGCGAAATTCTTCTCAAACATTTGGGACGGAATTAAAGACGCTGCTAAAGCTGGCTTGAACGGTGTCATTGGTTTCATCAACACTGGTGTTGACGGGCTTAATTCCGTGATCCACTTCTTCGGTGGAAAGAAGGAGACCATTACCCCGATCAAAAAGCTTGCGCATGGGACGTCAGCTAACGATCGTGACGAACTAGCATTAGTCAATGACGAAGGTGGGGACACCTACCAGGAAGCGATTGTTCGAGCTAACGGTAAGGTTGAAATTCCGAAGAACCGTAACCAACTCGTTTTCTTAAATCGTGGCGATGAAGTCATTCCGGCTAAGAAGACGGCAGAAATGTTCGGTCTTAGTCATTATGCTAAGGGCAAAAAGGGCTGGCTATCGGCCGCATGGGACAACGTCAAGGATTGGGCCGGCGATACATTCGAGGCGATTGAAGACGCACTTAAGGATCCGCTAAGCGTACTTACTGGACTCTTCCACAAGGGTAAGAATACCGCAACTGCCGTATGGCATGATGTGGGTGAAGGAGCCGCTAACTACTTACCTAAAGTCGGTGCCGATTGGTTTAAAAAAGAACTTCAAAAATTGGAGGACGCTTTAACGCCATCGAACCCAAGTGGATCTGGTGCGCAACGGTGGAAGCCGTACATTGAGAAAGCCTTCAAAGAGCTTCATGTAAATGCCAGCGAAGCTAAGATCAACAAATTACTTCGCCAGATTCAAACGGAATCTGGTGGGAATCCAACCATTAGACAGCAAATCAGTGACAAAAACTCAAGATCCGGCAATCCGGCACAAGGGTTACTTCAATTTATCCCATCGACATTTAACCATTGGGCGCTCAAAGGACACGGTCAAATCCTTAATGGGTATGACCAGATCTTAGCCGCTATTAATGCGCTTGAACACGGTGGTGAAGGCGGTTGGGGTAATGTTGGTAATGGCCATGGTTGGGCTAACGGTGGTTGGGCTGATCGTCCAAGTATCTTTGGTGAGGTTAAAGGCCAAAAAGAGATTGCAATTAATCCCGCCCGGCCAACGTCTGAGCGTCATATTTTGGAAGCGATTCGGGCAAGAGCCGCTAAGTCACCTAGTGGGTTTGCCGCAAAACTCAATCAAATTATTACGCGCCAACAAATGGCAAGCCACCAAATTCAACCCGCAACTCCGTCCGCTAACGAAGTCCCAACACTAGGTAATGGCGGACGATTAAACGGAAACTTAACAATGAATTTCGTGGTTGATGGTACGACGATGGCCCGCGTAACTTACCCTAAGTACAAGGCCTTGATGGCCCATGAAATCACAATTCGTGGGGCTGGAGGAGCTGTTCCAGTTGGTCAAGCTATCCCAGTAGGAGGTGGATTCTAA

Gene Ontology

Description Category Evidence (source)
GO:0016020 membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001c0ad85_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5tUfk) rather than this protein.
PDB ID
5tUfk
Method AlphaFoldv2
Resolution 52.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50