Protein

Protein accession
S5MMQ8 [UniProt]
Representative
4E5Bh
Source
UniProt (cluster: phalp2_13435)
Protein name
Tail protein
Lysin probability
100%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MVVQRYKPNAEVTFFTEEGQLVARGVADPNSKVDNDIVAVYTNRDIGEDAPVFNITLTNRKPWHRWITANDMLIIKMCRPPEALAEVMFGLVDFAGKTVDANNDAPSRTISVKGRGFAKAFIQFDIGIVPEAQFNIEKLGWVQTLGITLDQATPDQLAKAAYDKIAKPFINYKWKGSKALFDILKTKFSARKDMKLLDTSGLMAWQGSLLGMYNAIAEKPFHEIFYEVENGSPTMVIRETPFNKDKWDKLPSVEIGDQDVVTDDTGRGDLETYTMFSVSAKTLMAPDDMFKTFGVRPYWYPPYKNKYGIRRLTVETSYLAVNGTPTGGGTGTGGTGVGAGGTTGAPLNPDPPGVGGNTGGNNGGTGTNPTNPTTPTDPSQPNAGNGSQTTTNQDGSTGTTPANGNGTGSVDLMKGLMEDLYNWNILNNHFYSGNLVVKGSNKYKVGTRLVYKSVEDNSTIEYYIKSVTQNFNTFGAWVTTLGVIRGCEPSKRFSPPVGKFEQYEGHGFLGDNSTIAEQKASGGLPNLNDLWSQIFGGMFGGGGLLGGLIPGLGGGLGVGIDGSAAQKVVAGAQSILQNGINGVRVRYTFGGGNPASGALDCSSFTQYVYKTYAGIDIGRVTGEQVKKGTEVSKQNLQPGDLVFFKNTYNSGYIYGVSHVGIYVGNGNFIENSSSKSVTLTALSNSYATAHWLMGRRVLAASTGGGGTGGGAGGAGNGQVSAGAGGTKFIATVYSSPNIDNYSPNTTTAVGAPTVEGVTIAVDPKVIPLHSQVQITCPSYPAVNGTYTAQDTGSAIKGNRIDIYWEGRPPRNAEAVKKAMNNFGKKEVFVKVLRYGKG
Physico‐chemical
properties
protein length:837 AA
molecular weight:88997,9 Da
isoelectric point:8,67
hydropathy:-0,31
Representative Protein Details
Accession
4E5Bh
Protein name
4E5Bh
Sequence length
447 AA
Molecular weight
48387,89020 Da
Isoelectric point
5,80920
Sequence
MAIQRYKPNAEVTFFTEEGQLVARGVADPNSKIDNDIVAVYTNRDIGEDAPVFNITLTNRKPWHRWITANDMLIIKMCRPPEALAEVMFGLVDYAGKTVDANNDAPSRTVSVKGRGFAKAFIQFDIGIVPEAQFNIEKLGWVQTLGITLDQATPDELAKAAYEKIAKPFINYKWKGSKALFDILKTKFSARKDMKLLDTSGLMAWQGSLLGMYNAIAEKPFHEIFYEVEGGSPTMVIRETPFNKDKWDKLPSVSVGDQDVVTDDTGRGDLETYTMFSVSAKTLMAPDDMFKTFGVRPYWYPPYKNKYGIRRLTVETSYLAVNGTPSGGAGAGGTGIGADGTTGAPLNPDPPGVGGNTGGNNGGTGTNPTNPTAPTDPSQPNSGNGNQGTTNQDGSTGITPANGNGTGSVDLMKGLMEDLYNWNILNNHFYSGNIVVKGSNKYKVGTR
Other Proteins in cluster: phalp2_13435
Total (incl. this protein): 16 Avg length: 706,6 Avg pI: 8,15

Protein ID Length (AA) pI
4E5Bh 447 5,80920
4EqLp 498 5,83830
A0A217EQL1 722 8,29568
A0A068EMN7 838 8,42790
A0A7G5CGJ0 722 8,67520
A0A7G5CH64 722 8,67520
A0A7T3N4C4 722 8,47071
B6V2P9 722 8,29961
S5MM68 837 8,66547
A0A9Y1YTI2 722 8,30754
A0A9Y1YTS9 722 8,30754
A0A9Y1YUM0 629 8,33874
A0AAE9YD03 722 8,78287
A0AAE9YIR9 722 8,29568
A0AAU8BES1 722 8,59907
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33582
8LHfr
135 28,0% 331 2.911E-42

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Bacillus phage Shanette
[NCBI]
1296656 Herelleviridae > Siminovitchvirus > Siminovitchvirus shanette
Host Bacillus cereus
[NCBI]
1396 Firmicutes > Bacilli > Bacillales > Bacillaceae > Bacillus > Bacillus cereus group

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KC595513 [NCBI]
CDS location
range 56998 -> 59511
strand +
CDS
ATGGTAGTTCAACGATATAAACCTAATGCCGAGGTCACTTTCTTTACTGAAGAGGGTCAGCTAGTAGCAAGGGGTGTAGCTGACCCTAATAGTAAGGTAGACAATGACATTGTGGCGGTCTACACTAATAGAGATATAGGGGAAGATGCTCCTGTATTCAATATAACCCTCACTAACCGAAAGCCTTGGCATAGGTGGATTACTGCAAATGATATGTTAATTATCAAAATGTGCAGACCACCTGAAGCCTTGGCTGAAGTTATGTTTGGACTAGTAGACTTTGCAGGCAAGACTGTAGACGCTAACAACGATGCGCCGTCTCGTACTATTTCAGTAAAGGGCAGAGGGTTTGCTAAAGCATTCATACAGTTTGATATTGGTATCGTGCCTGAAGCACAGTTCAATATTGAAAAGCTAGGTTGGGTTCAGACTCTTGGTATTACACTAGACCAAGCAACCCCAGACCAGCTAGCAAAAGCAGCATACGATAAGATAGCCAAGCCTTTCATAAACTACAAGTGGAAAGGCTCTAAAGCTTTATTCGACATACTGAAAACTAAGTTTAGTGCAAGAAAAGATATGAAATTACTAGACACGTCTGGTTTAATGGCATGGCAAGGTAGTTTGCTAGGTATGTACAATGCTATAGCAGAAAAGCCTTTCCATGAGATATTCTACGAGGTAGAGAATGGCTCCCCTACTATGGTTATTAGAGAGACTCCTTTCAATAAGGACAAGTGGGATAAGTTGCCTTCTGTAGAAATTGGAGACCAAGACGTAGTGACGGACGATACAGGCAGAGGAGACCTAGAGACTTACACAATGTTCTCTGTAAGTGCCAAGACGTTAATGGCTCCAGATGATATGTTTAAGACCTTTGGGGTTCGTCCATACTGGTATCCACCATATAAGAATAAATATGGTATCAGACGCTTGACAGTAGAGACTTCTTATCTAGCAGTAAATGGTACCCCTACTGGAGGAGGAACGGGTACAGGTGGTACAGGTGTAGGTGCTGGAGGTACTACAGGTGCTCCCCTTAATCCTGACCCTCCAGGTGTGGGCGGTAATACAGGAGGTAACAACGGAGGTACTGGTACTAACCCAACCAATCCAACTACTCCTACTGACCCTAGCCAACCAAATGCTGGTAACGGTAGCCAAACTACTACAAACCAAGATGGTTCTACGGGTACAACACCTGCTAATGGTAACGGTACTGGTTCTGTTGACTTGATGAAAGGTCTTATGGAAGACCTATATAATTGGAATATTCTTAATAACCATTTCTATAGTGGAAATCTCGTTGTAAAAGGCAGTAACAAATACAAGGTGGGAACAAGACTTGTATATAAATCGGTAGAAGACAATTCCACTATTGAATATTACATTAAGTCAGTCACCCAGAATTTCAATACCTTTGGTGCATGGGTTACAACTCTAGGGGTGATTAGAGGGTGTGAACCATCTAAACGCTTTAGCCCACCAGTAGGCAAGTTCGAACAGTACGAAGGTCATGGCTTCTTAGGTGACAACAGTACTATTGCAGAGCAAAAAGCTAGTGGCGGTTTACCTAACCTCAATGACCTGTGGAGTCAGATATTTGGAGGTATGTTCGGTGGAGGCGGTCTACTAGGGGGCTTAATACCTGGACTAGGTGGTGGATTAGGAGTAGGTATAGATGGTAGTGCTGCACAGAAGGTAGTAGCAGGTGCCCAGAGTATCCTACAAAATGGTATCAATGGTGTGAGAGTTCGCTATACGTTCGGTGGAGGTAACCCTGCATCAGGTGCTCTAGACTGTTCATCCTTCACTCAATACGTTTACAAGACTTATGCAGGTATAGATATTGGCAGGGTTACTGGGGAACAGGTTAAGAAAGGTACTGAAGTATCTAAGCAAAATCTACAACCAGGTGACTTAGTATTCTTCAAGAATACTTACAATAGTGGATACATCTATGGGGTTTCTCACGTAGGTATTTATGTAGGTAATGGTAACTTTATTGAGAACTCTAGTTCTAAGTCAGTTACCCTAACTGCTTTAAGTAATTCCTATGCTACTGCCCACTGGCTGATGGGAAGGCGTGTACTAGCAGCCTCCACTGGTGGAGGAGGTACAGGCGGTGGTGCTGGAGGTGCAGGTAATGGTCAAGTATCTGCTGGTGCTGGTGGTACTAAGTTCATAGCGACTGTGTATAGTTCACCAAACATTGACAATTACTCACCAAACACTACTACTGCTGTAGGTGCTCCTACTGTAGAAGGTGTTACTATAGCAGTTGACCCTAAAGTCATACCACTACACAGTCAGGTACAGATTACTTGTCCTTCCTATCCAGCAGTCAATGGTACCTATACTGCACAGGATACAGGTAGTGCAATCAAAGGTAACCGTATTGATATCTACTGGGAAGGTAGACCACCTAGGAATGCGGAAGCAGTTAAGAAGGCTATGAATAACTTCGGTAAGAAGGAAGTCTTTGTTAAGGTACTAAGATACGGGAAAGGGTGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds molecular function None (UniProt)
GO:0008234 cysteine-type peptidase activity molecular function None (UniProt)
GO:0009254 peptidoglycan turnover biological process None (UniProt)
GO:0019867 outer membrane cellular component None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00035ab025_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4E5Bh) rather than this protein.
PDB ID
4E5Bh
Method AlphaFoldv2
Resolution 78.47
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50