Protein

Protein accession
B5BTX9 [UniProt]
Representative
4CJ0A
Source
UniProt (cluster: phalp2_30334)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAALRQRLAVGVLTASLAGLAFITSGEKREYRAYADPALGWKVPTICDGHTGPDVYRGQRANDQMCDAWRAKDAEVSIKAIRRCSGDAKLTQYEFDALVSLVHNIGPTAYCGSTMSRLIREGKLDQVPGQFDRWVYSGGRKLRGLVNRRQSERNLWERGDYGTVR
Physico‐chemical
properties
protein length:165 AA
molecular weight:18424,7 Da
isoelectric point:9,57
hydropathy:-0,48
Representative Protein Details
Accession
4CJ0A
Protein name
4CJ0A
Sequence length
207 AA
Molecular weight
22330,21500 Da
Isoelectric point
4,86459
Sequence
MTITLAQAEDMLRAEVERFEKAVNRLVEVPLSQNQFDALVSFAYNVGDGALARSTLLRLLNQGDYDAVPGQLARWTHGGGRVLPGLVKRRRMEGELWMSPAAQIYEIEEPMAQDVTEEPVGSNVMLEMIIGLARHLVTGAGGLTAASSGGDASAPATWLGLTIFVAGVAMSVFDKIKREGDASILNVVRESIDAIHDRLDEKDGPKA
Other Proteins in cluster: phalp2_30334
Total (incl. this protein): 8 Avg length: 170,0 Avg pI: 8,82

Protein ID Length (AA) pI
4CJ0A 207 4,86459
A0A2K9VI88 165 8,92857
A0A6J5P5R3 163 9,77097
A0A9E7QXY8 165 9,35051
A0A9E7QYB1 165 9,35051
A0A9E8G7G4 165 9,35051
A0AAF0CXD4 165 9,35051
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13857
7s4x2
6 44,2% 140 2.028E-26
2 phalp2_14618
5vvs4
11 40,2% 154 2.390E-20
3 phalp2_4127
1EJtW
7 34,6% 205 8.243E-20

Domains

Domains [InterPro]
GH24
Disordered region
Representative sequence (used for alignment): 4CJ0A (207 AA)
Member sequence: B5BTX9 (165 AA)
1 207 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Ralstonia phage RSB1
[NCBI]
551790 Autographiviridae > Higashivirus > Higashivirus RSB1
Host Ralstonia solanacearum
[NCBI]
305 Proteobacteria > Betaproteobacteria > Burkholderiales > Burkholderiaceae > Ralstonia >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AB451219 [NCBI]
CDS location
range 40992 -> 41489
strand +
CDS
ATGGCCGCGCTCCGTCAACGCCTTGCCGTGGGTGTGCTCACGGCCTCGCTGGCGGGGCTGGCATTCATCACGTCCGGCGAGAAGCGGGAGTATCGGGCATACGCCGACCCCGCCCTCGGCTGGAAGGTGCCCACCATCTGCGATGGGCACACCGGCCCGGACGTGTATCGAGGCCAGCGTGCCAACGACCAGATGTGCGACGCATGGCGGGCCAAGGATGCCGAGGTGTCCATCAAGGCTATCCGTCGCTGCTCCGGCGACGCTAAGCTAACCCAGTACGAGTTCGACGCTCTCGTGTCGCTCGTGCATAACATCGGACCCACCGCCTACTGCGGGAGCACCATGAGTCGCTTGATCCGCGAGGGCAAGCTGGATCAGGTACCCGGCCAGTTCGACCGCTGGGTGTACTCAGGAGGTAGGAAGCTGCGCGGCCTCGTAAATCGACGCCAGAGCGAACGAAACCTGTGGGAGAGGGGTGACTATGGGACTGTTCGATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi00017f650c_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4CJ0A) rather than this protein.
PDB ID
4CJ0A
Method AlphaFoldv2
Resolution 73.77
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50