Protein

Protein accession
A0A2K9VI88 [UniProt]
Representative
4CJ0A
Source
UniProt (cluster: phalp2_30334)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSKVGTAVRAAVATLAISALGIGALQKEEGTVNRVYLDSIGKPTVCTGHMDTRMKVGTYYSDADCALLLSQDTASAAATVRRLVKVPLYQHEFDALVLFCFNVGDGNCRSATLFKLLNNGDYNAVPGQLLRWRYAGGLDCKVRSNNCYGVYLRRQHEGDIWRNGY
Physico‐chemical
properties
protein length:165 AA
molecular weight:18074,5 Da
isoelectric point:8,93
hydropathy:-0,09
Representative Protein Details
Accession
4CJ0A
Protein name
4CJ0A
Sequence length
207 AA
Molecular weight
22330,21500 Da
Isoelectric point
4,86459
Sequence
MTITLAQAEDMLRAEVERFEKAVNRLVEVPLSQNQFDALVSFAYNVGDGALARSTLLRLLNQGDYDAVPGQLARWTHGGGRVLPGLVKRRRMEGELWMSPAAQIYEIEEPMAQDVTEEPVGSNVMLEMIIGLARHLVTGAGGLTAASSGGDASAPATWLGLTIFVAGVAMSVFDKIKREGDASILNVVRESIDAIHDRLDEKDGPKA
Other Proteins in cluster: phalp2_30334
Total (incl. this protein): 8 Avg length: 170,0 Avg pI: 8,82

Protein ID Length (AA) pI
4CJ0A 207 4,86459
B5BTX9 165 9,56861
A0A6J5P5R3 163 9,77097
A0A9E7QXY8 165 9,35051
A0A9E7QYB1 165 9,35051
A0A9E8G7G4 165 9,35051
A0AAF0CXD4 165 9,35051
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13857
7s4x2
6 44,2% 140 2.028E-26
2 phalp2_14618
5vvs4
11 40,2% 154 2.390E-20
3 phalp2_4127
1EJtW
7 34,6% 205 8.243E-20

Domains

Domains [InterPro]
GH24
Disordered region
Representative sequence (used for alignment): 4CJ0A (207 AA)
Member sequence: A0A2K9VI88 (165 AA)
1 207 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Pseudomonas phage PollyC
[NCBI]
2079290 Autographiviridae > Pollyceevirus > Pollyceevirus pollyC
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MG775261 [NCBI]
CDS location
range 38195 -> 38692
strand +
CDS
ATGAGTAAGGTAGGTACGGCAGTTCGTGCTGCTGTAGCTACCCTTGCGATCAGTGCGCTCGGCATTGGCGCCCTTCAAAAAGAAGAAGGGACCGTCAACCGAGTGTATCTCGACAGCATCGGTAAGCCCACGGTGTGTACCGGGCACATGGATACCCGGATGAAGGTCGGGACGTACTACTCCGACGCTGATTGTGCGCTGCTGTTGAGCCAAGACACTGCTAGTGCTGCCGCTACGGTACGCCGACTGGTTAAGGTTCCACTGTACCAGCACGAATTCGACGCCCTTGTGTTGTTCTGCTTCAATGTTGGCGACGGTAACTGCCGATCCGCCACGCTGTTCAAGCTCCTTAACAACGGGGACTATAACGCTGTACCGGGTCAACTGCTTCGCTGGCGTTATGCTGGTGGACTTGACTGCAAGGTACGCTCCAACAACTGCTACGGTGTGTATCTTCGTCGTCAACACGAGGGTGACATCTGGAGGAACGGGTACTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi000cdbb2bd_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4CJ0A) rather than this protein.
PDB ID
4CJ0A
Method AlphaFoldv2
Resolution 73.77
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50