Protein

Protein accession
I7K3G5 [UniProt]
Representative
6k2xA
Source
UniProt (cluster: phalp2_23419)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSAIRQSMIYCSVAAVIGIVTTVYPNDLQTSRAGLELIASYENCVSCTYKDSIGKNTIGIGSTRGLDGKPVPNNQLLSNDDIARLLVRDIKESEECVIKYFNGQKMPQPVFDSVVSLVYNNGCYGTRWNKSANRPTFISRYAVSGDWNNVCYRFSDFVNAGGVRSKGLVNRRTAEQKLCLQYRQ
Physico‐chemical
properties
protein length:184 AA
molecular weight:20488,1 Da
isoelectric point:9,10
hydropathy:-0,27
Representative Protein Details
Accession
6k2xA
Protein name
6k2xA
Sequence length
273 AA
Molecular weight
30341,36000 Da
Isoelectric point
9,43561
Sequence
MKTSNVGIELIQKYEGCMLTAYRCPAGVLTIGYGHTGKVDGQEIYRGMKITKAKAIELLKSDLSKFEKHVMKYDSIYHYNQNQFDALVSFSFNIGRIDQLTANGTRSISQISSKIPAYNKAGGKVLAGLVKRRNEEKLLFDSSISITTQETSVGTTYTVVNGDTLSGIGHKVGMNWRKIAEINSIQSPYTIHPGQVLKLTETTANNNGSKFPYKIIVATEKDNLNVRSGPGLNYKKVKSLKKGSENSIIEEENGWGRLEDKSGWVCLEYIKRI
Other Proteins in cluster: phalp2_23419
Total (incl. this protein): 3 Avg length: 213,7 Avg pI: 9,21

Protein ID Length (AA) pI
6k2xA 273 9,43561
A0A1L7DQJ6 184 9,09592
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10187
xiY3
70 51,1% 180 1.169E-39
2 phalp2_34008
81fvs
10 44,2% 183 1.595E-39
3 phalp2_28113
7zmZV
50 45,5% 193 7.272E-34
4 phalp2_40316
3fQk2
50 37,1% 277 1.840E-33
5 phalp2_14481
4Lj5N
23 34,2% 216 1.419E-23
6 phalp2_24849
7vCQq
8 36,4% 203 2.211E-22
7 phalp2_20210
8eGjW
19 33,6% 214 1.809E-16
8 phalp2_19412
2Foe6
6 33,1% 208 2.441E-16
9 phalp2_38353
ATmM
19 29,3% 184 1.090E-11
10 phalp2_19992
6T980
5 33,8% 183 1.090E-11

Domains

Domains [InterPro]
Representative sequence (used for alignment): 6k2xA (273 AA)
Member sequence: I7K3G5 (184 AA)
1 273 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959, PF01476, PF08239

Taxonomy

  Name Taxonomy ID Lineage
Phage Yersinia phage phi80-18
[NCBI]
1206559 Autographiviridae > Pokrovskaiavirus > Pokrovskaiavirus pv8018
Host Yersinia enterocolitica
[NCBI]
630 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Yersinia >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HE956710 [NCBI]
CDS location
range 40926 -> 41480
strand +
CDS
ATGAGTGCTATCAGACAGTCGATGATTTATTGTAGCGTAGCAGCCGTTATAGGCATTGTTACGACCGTTTATCCCAATGACCTTCAAACATCAAGAGCAGGTCTTGAGCTTATAGCAAGTTATGAGAATTGTGTAAGCTGTACGTACAAAGACAGCATCGGTAAGAATACCATTGGGATAGGCAGTACTCGCGGCCTGGATGGTAAGCCGGTTCCAAACAACCAACTACTATCCAATGACGACATCGCCCGGTTACTGGTGCGTGATATCAAAGAATCTGAAGAATGTGTAATCAAGTACTTTAATGGTCAGAAGATGCCACAGCCAGTATTTGATAGTGTGGTTTCGTTAGTGTACAACAATGGCTGCTATGGCACCCGTTGGAACAAGAGTGCCAATCGCCCGACATTCATCTCCAGGTATGCAGTCTCTGGTGACTGGAATAATGTATGCTACCGGTTCAGCGACTTCGTTAATGCCGGTGGTGTACGTAGTAAGGGTTTAGTTAATCGACGTACAGCGGAGCAGAAGCTATGTCTACAGTACCGGCAGTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi00026dade1_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6k2xA) rather than this protein.
PDB ID
6k2xA
Method AlphaFoldv2
Resolution 87.92
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50