Protein

Protein accession
A0A509EQQ2 [UniProt]
Representative
4LOCD
Source
UniProt (cluster: phalp2_24503)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MANRRIGQAGLALIKQFESCRLIAYQCSAGVWTIGYGHTVGVYKGMKITQKKAEAYLLQDVAKFEKYINNPSYVPFTAQLNQNQFDALVSFAFNLGQGNVKKLCTGRVMNQIPSAMQRYCKAAGKTLPGLQRRRKAEAALYNKKVESCTGATTTTVKESEDYSMNTIKKGSKGNAVKVWQIIIGTTADGIFGSGTENMTKTWQKNHGLMADGIVGKNSWKTGLESL
Physico‐chemical
properties
protein length:226 AA
molecular weight:24811,3 Da
isoelectric point:9,76
hydropathy:-0,35
Representative Protein Details
Accession
4LOCD
Protein name
4LOCD
Sequence length
153 AA
Molecular weight
17182,25270 Da
Isoelectric point
4,60540
Sequence
MVITQEQAEQMLANDLGIYEAHVNNPALVPFTEQLNQNQFDALVSFCYNCGSGNLRGLCEGISIDEVPNVLPLYNKAGAKTLTGLVRRRQAEVDLYVKEVEPVLDKGVYQTIYNTWLKPSWDEAFAAKDTAQCDYIHWLAMELRKAAGVEEPE
Other Proteins in cluster: phalp2_24503
Total (incl. this protein): 3 Avg length: 201,7 Avg pI: 8,02

Protein ID Length (AA) pI
4LOCD 153 4,60540
A0A8S5N9H9 226 9,68820
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13104
8kSiJ
1 46,8% 96 8.126E-25
2 phalp2_37101
1bvx6
877 50,0% 100 3.927E-21
3 phalp2_33221
5jbqV
447 48,1% 106 4.835E-20
4 phalp2_4451
31DIk
4919 43,2% 104 9.056E-20
5 phalp2_2632
6RhYr
14867 47,6% 105 3.175E-19
6 phalp2_39267
4724r
247 44,6% 103 7.293E-18
7 phalp2_31105
1YRHZ
456 46,4% 99 7.293E-18
8 phalp2_14757
6QRn1
45 42,4% 99 1.221E-16
9 phalp2_12161
6q0L8
1 41,8% 98 7.977E-16
10 phalp2_23209
7Cg0p
1 37,8% 103 1.091E-15

Domains

Domains [InterPro]
GH24
Unannotated
Disordered region
Representative sequence (used for alignment): 4LOCD (153 AA)
Member sequence: A0A509EQQ2 (226 AA)
1 153 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Roseburia phage Jekyll
[NCBI]
2570912 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LR596902 [NCBI]
CDS location
range 1874 -> 2554
strand -
CDS
ATGGCAAATAGAAGAATCGGACAGGCCGGTCTTGCACTTATTAAACAGTTTGAGAGTTGCCGGCTGATAGCCTATCAGTGTTCTGCAGGTGTGTGGACCATTGGATACGGTCATACAGTGGGCGTATACAAAGGGATGAAGATCACACAGAAAAAGGCGGAAGCCTATCTGTTGCAGGACGTGGCGAAGTTTGAAAAGTATATTAATAATCCGTCCTATGTCCCGTTTACGGCTCAGCTTAATCAGAATCAGTTTGATGCACTGGTCAGCTTTGCTTTCAACTTGGGACAGGGCAATGTGAAAAAACTGTGTACAGGCAGAGTAATGAATCAGATACCGTCTGCAATGCAGCGGTACTGTAAGGCTGCCGGTAAAACATTACCAGGATTACAGCGGAGAAGAAAAGCCGAGGCAGCTCTTTATAATAAGAAAGTAGAGAGTTGCACCGGTGCAACCACTACCACAGTGAAAGAAAGTGAGGATTACAGTATGAATACAATTAAAAAAGGCAGCAAGGGCAATGCAGTTAAGGTATGGCAGATCATCATCGGTACGACGGCGGATGGCATTTTCGGTAGCGGCACGGAGAACATGACAAAGACTTGGCAGAAGAACCATGGACTGATGGCTGATGGAATTGTTGGAAAGAACTCTTGGAAAACGGGGTTAGAGTCGTTATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
A0A509EQQ2
Method SMR
Resolution
Chain position
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00018a194e_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4LOCD) rather than this protein.
PDB ID
4LOCD
Method AlphaFoldv2
Resolution 94.80
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50