Protein

Protein accession
A0A7S5RBI2 [UniProt]
Representative
4K52y
Source
UniProt (cluster: phalp2_33109)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MANRLQKGSAAAAMAVALVGSFEGLRQNAYPDPATQGQPWTICYGSTNGVKPGDYKTVAQCKALLSLELQKYADGIEKCVTVPLPDSRFVALTSFSYNVGVKAACGSSAVKLINQGKTAEGCEALLKWNRAAGIVFPGLTRRRQKERQFCLEGL
Physico‐chemical
properties
protein length:154 AA
molecular weight:16474,8 Da
isoelectric point:9,34
hydropathy:-0,12
Representative Protein Details
Accession
4K52y
Protein name
4K52y
Sequence length
225 AA
Molecular weight
24554,09450 Da
Isoelectric point
7,64396
Sequence
MSNVELDTQVTKINPKGRGGWVAVTVTVLVLFEGMTNVHGGKSVGEHQSFDPKGVNTVCYGHIEDVHIGDTYTKIQCEEMLAKDLPRYEKMVERCIHVPMPGYRHAAILSFTYNIGGGGLCKSSVARYINAGSPEKGCDALLLYDKAAGRVLPGLVKRRQAERKMCYNVGEPPLPDAEVMKEAHVNVDQATAVKEAKKIDLTEPVVKPATPAPANSGVLHWLMEK
Other Proteins in cluster: phalp2_33109
Total (incl. this protein): 18 Avg length: 203,7 Avg pI: 8,85

Protein ID Length (AA) pI
4K52y 225 7,64396
34mnR 163 9,55900
3LWBQ 192 8,58153
4Fogc 219 9,05937
4GQVO 231 8,45517
4IEkF 199 8,92869
4IJiB 208 9,07136
4JAmW 201 9,29842
4KIHP 219 8,23869
6DPs3 208 9,30010
6UiVx 212 7,66300
6W4xi 224 8,81884
6WSJj 238 9,18083
6WtP3 235 9,20649
729kk 163 9,03036
86XP5 222 8,45614
A0A7S5UTE5 154 9,45379
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_4305
8aoFg
2477 45,3% 150 1.195E-46
2 phalp2_3556
4E0oc
53 45,5% 158 1.182E-43
3 phalp2_26525
83EoO
66 38,6% 145 3.963E-37
4 phalp2_11002
4Mz75
307 42,0% 145 3.019E-32
5 phalp2_21229
1fv51
17 32,7% 174 2.369E-30
6 phalp2_22286
7A6Fj
714 34,5% 168 2.094E-29
7 phalp2_21319
1KsyR
184 30,9% 155 3.901E-29
8 phalp2_16574
7xiDE
15 35,4% 158 2.221E-27
9 phalp2_4532
3QBcV
9 34,0% 141 2.335E-25
10 phalp2_2176
3Yhf4
930 33,3% 147 4.342E-25

Domains

Domains [InterPro]
GH24
Disordered region
Representative sequence (used for alignment): 4K52y (225 AA)
Member sequence: A0A7S5RBI2 (154 AA)
1 225 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Rhizobium phage RHph_Y25
[NCBI]
2509588 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN988538 [NCBI]
CDS location
range 17097 -> 17561
strand +
CDS
ATGGCAAACCGCCTTCAGAAGGGTAGCGCTGCGGCTGCAATGGCCGTGGCGCTGGTCGGCAGCTTCGAAGGGCTTCGGCAGAACGCCTATCCCGATCCGGCCACACAGGGCCAGCCGTGGACGATCTGCTACGGCAGCACGAACGGCGTGAAGCCGGGAGACTACAAGACTGTCGCGCAGTGCAAGGCGCTTCTGTCGCTAGAACTGCAAAAGTACGCCGATGGCATCGAGAAGTGCGTCACGGTCCCGCTGCCGGATTCGCGCTTCGTCGCCCTCACGTCATTTTCCTACAACGTCGGCGTCAAGGCTGCATGTGGCTCCAGCGCGGTCAAGCTCATCAATCAGGGCAAGACGGCGGAAGGCTGCGAAGCGCTGTTGAAATGGAACCGCGCCGCTGGGATCGTCTTCCCCGGCCTGACCCGCCGCCGGCAGAAGGAACGCCAGTTCTGCCTTGAGGGCCTTTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4K52y) rather than this protein.
PDB ID
4K52y
Method AlphaFoldv2
Resolution 73.51
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50