Protein

Protein accession
E9LUR1 [UniProt]
Representative
5tZKW
Source
UniProt (cluster: phalp2_28908)
Protein name
Minor tail protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MATEKIQGYEFAINMDDGGMTRTLREIKNEAKLLKSGMQANFAEIRSGEGIMAAYAGKVKDAGRAIEAQRLVIERLKSEQNGLDQTTQKGREAYVKYENQINATKRSIASLEGQQERAQKSLDLQKSGVLQLKDAMEAQNKISANYIAKLKAEGKEDEALASQKDSLKDKIKSLKALYESESHQLIKMSEDSKVTAESYQQQKIRVSELTAELAKSKSELNSLNSPEQTHLKNMENLRGQSKLIEGSMAALVSRFKAEGNELEANKAKSNGLREAYDNLNKQLKAESEELNRIKSVSGESSNAYKQQSIRVNELGTKIAQTRTKMKELDEQLSKKPQSGLTSVISQLNRVNEHADKANHLFGKILGAHLVANGITSAFQSITSHIHEAISAGMEYEKEQQKMVATWLTLTGTVGKSNAMVKTINDLSVKTGQAVDVVNELEQGFYHLHSNKKESDELTKSMLNMADAVGLDSQQIQAVTQDMVNGLSRGKANAGMLNQISQYFPMFREQLAKYETQVNHGKEVTVADLAAMAKAGKISAKDIENTFNQLGSGKYDKAADNMLHTMVGMERTIKARVPALIGDIEKPILTAQNPIYGAVSKWVSDKRTDKEFDKVGVAAEKGISTITKAFAKAFDVKSAPKAMNDAMDNLAKGVTKASDSIARNAPEIVDFFKTVKNLGGLGFETLIESLKITNALLKPLLSMVGGHTETIAKFGAAWWLTSKAVKETSSVLSTFKKISDTVSWAEKVLGIKQETKALEEQNAVLKTNAELSMASEENVGTGYRRVKGRKAGNIGADLSSISVEAENTEKIAKSSKWSLLGRTIGTRIINGAGLAMTAWDAGSSIAKAVSSGKASDKYKATGKTAGTLIGGGIGAALGSVIPGAGTAAGAMLGASIGDGVGGTKTANTIVKRISDALKGKSIEAPKIKTESTKHSLSDLSKAYSSYYSKKQKQDLNDVNVLHKAGMLTDAEYKKQLASIKKNDSETNRFEKMSASDRNAIAKYYAQQKASIISKWNARERQTSSSWDAKIASDERRFGANSIIVQKDMSKKKAAIQAEENKKSAALDKLRIKSATETTAQEARLHTTLTGKIKSAANKQNDILRNLAKSKGKITREQANDAISQSNKEYKKTVSLANQEYKDRVSAAEKQHNKVIKAAERQASEAISQAKSQYSKTVDAAKNQYSGNSKYAEKQRAAIISKAKDQKQKSIDNALEQENKTEQHADRQYKHTTDDADKQRSQVVKHAKDQNSSVVDQAKSQSKGVLGHAVKQANGSMKAADKQGSGIHSIWKNITSFFSNLVKGFGIKPINVGAYQSGYNPVTIGAYASGGIVGTARALVGEGGVEAKIDRDNGKVSFLGMNGAEVVNVKPGDQILNAGDTAKLFNGGLGHTLPGYAKGTIDIASFLKKIKNGATSIFDSISDKAMDALSKITHPLKTLKSMALKTFDPTKTPGVGSIGHDLGKGLVDRALKGFAKAISDLADNFGGAGGSVGNPAGSSVSRWKPYVVRALKANGFGATASQVSAWMRVIARESNGNPRAINLWDSNAKKGIPSMGLVQTIRPTFEAYKFSGHGQIYNGYDDLLAGINYMKHIYGKGDSAFARVSGPEGYANGGFGNKAGVYKLFEGNLPEAIVPMDLSKRSRAYQIMQQIMAKFGAQDGANVINTGNDQIDSDEAFKQRVIASLDALVTGQGDVKAVVANSDVVNAVKSNTKKTSQYSQMMGY
Physico‐chemical
properties
protein length:1722 AA
molecular weight:185699,1 Da
isoelectric point:9,55
hydropathy:-0,52
Representative Protein Details
Accession
5tZKW
Protein name
5tZKW
Sequence length
381 AA
Molecular weight
41393,99020 Da
Isoelectric point
9,83312
Sequence
MRSKKQVLLVNGMLERQTSSSWDAKIASDERRFGANSIIVQKDMSKKKAAIQAEENKKSAALDKLRIKSATETTAQEARLHTTLTGKIKSAANKQNDILRNLAKSKGKITREQANDAISQSNKEYKKTVSLANQEYKDRVSAAEKQHNKVIKAAERQASEAISQAKSQYSKTVDAAKNQYSGNSKYAEKQRAAIISKAKDQKQKSIDNALEQENKTEQHADRQYKHTTDDADKQRSQVVKHAKDQNSSVVDQAKSQSKGVLGHAVKQANGSMKAADKQGSGIHSIWKNITSFFSNLVKGFGIKPINVGAYQSGYNPVTIGAYASGGIVGTARALVGEGGVEAKIDRDNGKVSFLGMNGAEVVNVKPGDQILNAGDTAKLFN
Other Proteins in cluster: phalp2_28908
Total (incl. this protein): 2 Avg length: 1051,5 Avg pI: 9,69

Protein ID Length (AA) pI
5tZKW 381 9,83312
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18624
3UBv
1 23,7% 358 2.994E-10

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Lactobacillus phage Sha1
[NCBI]
947981 No lineage information
Host Lactobacillus sp.
[NCBI]
1591 Firmicutes > Bacilli > Lactobacillales > Lactobacillaceae > Lactobacillus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HQ141411 [NCBI]
CDS location
range 13799 -> 18967
strand +
CDS
ATGGCAACAGAGAAAATTCAAGGCTACGAATTCGCAATTAACATGGACGATGGTGGCATGACTCGCACGTTGCGAGAAATAAAGAATGAAGCAAAATTACTAAAATCTGGTATGCAAGCTAACTTTGCTGAAATCCGTTCGGGTGAAGGTATTATGGCGGCTTATGCGGGTAAAGTCAAAGATGCTGGCCGAGCTATTGAAGCACAACGATTAGTAATTGAGCGTCTCAAAAGCGAGCAAAACGGATTAGACCAAACCACTCAAAAAGGCCGAGAAGCTTATGTTAAATATGAAAATCAGATTAACGCTACCAAGCGCTCAATCGCCAGTTTAGAGGGGCAACAAGAACGAGCACAGAAGTCACTTGATCTGCAAAAAAGTGGTGTCTTACAATTAAAGGACGCCATGGAAGCACAAAATAAAATAAGTGCTAATTATATTGCCAAATTAAAGGCTGAAGGTAAAGAAGACGAAGCGCTCGCAAGTCAAAAAGATAGCCTTAAAGATAAAATAAAAAGTCTCAAGGCTCTTTATGAATCTGAATCGCATCAGCTGATAAAAATGTCAGAAGACTCCAAGGTAACTGCTGAATCATATCAACAACAAAAGATCAGAGTTAGTGAACTCACTGCTGAGTTAGCAAAATCCAAGTCAGAACTGAATAGTCTAAATTCACCAGAGCAAACCCACTTAAAAAATATGGAAAACTTGCGTGGGCAGAGCAAACTAATTGAAGGGTCTATGGCTGCTTTAGTGTCGAGATTTAAGGCCGAAGGAAATGAGTTGGAAGCAAATAAGGCTAAATCTAATGGCTTGCGAGAGGCCTATGATAACCTTAATAAACAATTAAAAGCAGAAAGCGAAGAGCTCAATAGAATTAAATCGGTCAGTGGCGAAAGCTCAAATGCATATAAGCAACAATCTATTCGAGTGAACGAATTAGGCACTAAAATTGCCCAAACTCGGACTAAGATGAAAGAGCTTGATGAGCAATTAAGCAAAAAGCCACAGTCAGGATTAACGTCAGTCATTAGCCAGCTAAATAGAGTAAACGAGCACGCAGATAAGGCCAATCATTTATTTGGCAAAATTCTGGGTGCTCATTTAGTTGCCAATGGTATTACGAGCGCTTTTCAATCAATTACTTCACATATTCACGAAGCTATTAGCGCTGGTATGGAATATGAAAAAGAGCAGCAGAAGATGGTGGCCACTTGGTTGACTTTAACTGGCACTGTTGGCAAATCTAACGCAATGGTTAAAACAATCAACGACTTATCTGTTAAGACCGGTCAAGCCGTAGATGTTGTAAATGAATTAGAGCAAGGTTTTTATCACTTACATTCCAATAAAAAAGAATCAGATGAACTAACCAAATCCATGCTGAACATGGCTGACGCTGTTGGTTTAGATAGCCAACAAATTCAGGCGGTTACCCAAGATATGGTCAACGGTCTGTCACGGGGAAAAGCAAATGCTGGCATGTTAAACCAAATTAGCCAGTATTTCCCGATGTTCCGTGAACAGTTAGCTAAGTATGAAACTCAAGTCAATCATGGTAAGGAAGTAACGGTTGCTGATTTAGCGGCCATGGCTAAAGCGGGCAAAATATCTGCTAAAGATATTGAAAATACGTTTAATCAACTTGGATCCGGAAAATACGATAAAGCCGCCGACAACATGTTACATACGATGGTTGGTATGGAACGTACAATCAAAGCGCGTGTTCCAGCTTTAATTGGTGACATTGAAAAGCCAATTTTGACCGCTCAAAATCCAATCTATGGTGCAGTTTCAAAATGGGTATCTGACAAACGGACTGACAAGGAGTTTGATAAGGTCGGTGTGGCGGCAGAAAAGGGTATTAGCACGATTACTAAAGCTTTTGCTAAAGCCTTTGATGTCAAGTCAGCACCAAAAGCAATGAATGATGCAATGGATAACTTGGCCAAGGGTGTCACCAAAGCTTCTGACTCTATTGCCAGAAATGCTCCGGAAATTGTTGATTTCTTCAAAACTGTCAAAAACTTGGGTGGTCTGGGCTTTGAAACGTTAATTGAATCGCTTAAAATAACCAATGCACTTTTAAAGCCATTACTCAGTATGGTTGGTGGGCACACAGAAACAATTGCAAAATTTGGCGCAGCATGGTGGTTAACAAGTAAAGCCGTCAAAGAGACTAGTTCAGTTCTGTCAACTTTTAAAAAAATCAGTGATACTGTTAGCTGGGCTGAAAAAGTTCTAGGTATTAAACAAGAAACTAAAGCTTTAGAAGAGCAAAACGCGGTTCTTAAAACTAATGCTGAACTAAGTATGGCCAGTGAAGAAAATGTTGGAACTGGTTATCGGAGAGTTAAAGGTAGAAAGGCTGGGAATATAGGCGCTGATTTAAGCTCTATATCAGTTGAAGCAGAAAACACTGAAAAAATTGCTAAAAGCAGTAAATGGTCATTGCTAGGAAGAACAATTGGTACAAGGATTATCAATGGTGCTGGATTAGCCATGACTGCTTGGGACGCTGGTAGTAGCATTGCGAAAGCAGTTAGCTCCGGTAAGGCGTCTGATAAATATAAAGCAACTGGTAAAACAGCTGGAACACTTATCGGGGGTGGCATTGGTGCAGCCCTTGGAAGTGTTATCCCGGGAGCAGGAACAGCTGCGGGAGCAATGTTAGGAGCAAGCATTGGTGATGGTGTTGGTGGTACTAAAACTGCAAATACGATTGTTAAAAGAATTAGTGATGCGCTAAAAGGGAAGAGCATTGAAGCTCCCAAGATTAAGACAGAGTCCACTAAGCACTCACTGAGTGATCTAAGTAAGGCGTATAGTTCCTATTATTCTAAAAAGCAGAAGCAAGATTTAAATGATGTGAACGTACTTCATAAAGCAGGTATGCTAACCGATGCGGAGTATAAAAAGCAATTAGCTTCAATTAAAAAGAATGATAGTGAGACAAATCGTTTTGAAAAAATGTCAGCTTCTGATCGCAACGCTATTGCGAAGTATTATGCGCAGCAAAAAGCAAGTATTATTAGTAAATGGAATGCTAGAGAGAGACAAACTAGTTCTAGCTGGGATGCTAAAATAGCATCTGACGAACGACGGTTTGGTGCCAACTCGATTATTGTTCAGAAAGACATGTCTAAAAAGAAAGCAGCTATTCAGGCTGAAGAAAACAAAAAGTCAGCCGCTCTTGATAAACTCCGGATTAAAAGTGCAACAGAAACTACTGCACAAGAAGCCCGTTTACACACAACTTTAACGGGAAAGATAAAGTCAGCTGCTAATAAGCAGAATGATATTTTGAGAAATCTTGCCAAGAGCAAGGGGAAAATCACTCGTGAACAAGCAAATGATGCTATTTCACAGTCGAATAAAGAGTACAAAAAGACAGTCTCACTGGCAAACCAAGAATACAAAGATCGTGTTTCTGCGGCTGAAAAGCAACACAATAAGGTTATAAAAGCAGCTGAAAGACAAGCTAGCGAGGCAATCAGCCAAGCAAAGAGCCAGTATAGTAAAACAGTTGATGCTGCTAAAAATCAATATTCTGGTAATTCTAAGTATGCCGAGAAGCAACGTGCAGCTATTATTAGTAAAGCTAAGGACCAAAAACAAAAGTCAATTGACAACGCTTTAGAGCAGGAAAACAAAACTGAACAACATGCGGATCGTCAGTACAAGCACACTACTGATGACGCAGATAAGCAAAGATCACAAGTTGTTAAACATGCTAAGGATCAAAACAGTTCGGTAGTTGATCAGGCAAAATCACAGTCAAAAGGTGTTTTGGGGCATGCTGTTAAGCAAGCCAATGGCTCCATGAAAGCTGCCGATAAGCAAGGCTCCGGTATTCATAGTATTTGGAAAAACATTACTAGTTTCTTTAGTAATCTAGTTAAAGGATTTGGTATTAAACCAATCAATGTTGGCGCTTATCAATCGGGATATAATCCAGTAACGATTGGAGCTTATGCTTCCGGCGGTATTGTTGGCACTGCTAGAGCTTTAGTTGGTGAAGGCGGTGTCGAGGCTAAAATTGATAGAGACAATGGGAAAGTGTCATTTCTTGGTATGAATGGTGCTGAAGTGGTTAATGTTAAACCTGGTGATCAGATTCTTAATGCTGGTGATACTGCTAAGCTTTTTAACGGTGGCCTAGGACATACGCTTCCTGGCTATGCTAAAGGCACTATTGATATCGCGTCGTTTTTAAAGAAAATTAAGAACGGTGCTACTTCTATTTTCGACAGCATTAGTGATAAAGCAATGGACGCATTGTCTAAGATAACTCACCCATTGAAAACTTTAAAGTCAATGGCTTTAAAGACATTTGATCCAACCAAAACTCCAGGAGTCGGTTCAATCGGTCATGATTTAGGCAAAGGACTAGTTGACCGAGCTTTAAAAGGATTTGCGAAAGCTATTTCTGATTTAGCTGACAACTTCGGTGGAGCTGGTGGCAGTGTAGGAAACCCTGCAGGTAGCTCGGTTTCACGATGGAAGCCATATGTTGTTCGGGCACTTAAAGCTAATGGTTTTGGTGCTACCGCTAGCCAAGTATCTGCTTGGATGCGTGTTATTGCACGTGAATCAAATGGTAATCCAAGAGCTATTAACTTGTGGGATTCTAACGCTAAAAAGGGTATTCCATCAATGGGCTTAGTTCAAACTATTCGGCCAACATTTGAAGCATATAAATTTTCAGGACATGGTCAGATTTATAATGGGTATGATGACTTATTAGCTGGGATTAACTATATGAAACATATATATGGTAAAGGCGACAGTGCATTCGCTAGGGTAAGCGGCCCTGAAGGATATGCAAATGGTGGTTTCGGAAACAAAGCGGGCGTTTACAAATTGTTTGAAGGCAACTTGCCAGAAGCCATAGTTCCGATGGACTTATCTAAGCGTTCAAGGGCTTACCAAATTATGCAACAGATAATGGCTAAGTTCGGAGCTCAAGATGGCGCTAATGTGATAAATACCGGTAACGACCAGATTGATTCCGACGAAGCATTCAAACAGCGGGTTATAGCTTCACTAGATGCTTTGGTCACTGGCCAAGGAGATGTTAAAGCAGTTGTTGCCAACTCTGACGTGGTTAATGCTGTCAAGTCAAATACCAAGAAGACGTCACAATATAGTCAAATGATGGGGTATTAG

Gene Ontology

Description Category Evidence (source)
GO:0098003 viral tail assembly biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001f7368b_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5tZKW) rather than this protein.
PDB ID
5tZKW
Method AlphaFoldv2
Resolution 70.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50