Protein

Protein accession
A0A2D2W2U6 [UniProt]
Representative
1KOZ4
Source
UniProt (cluster: phalp2_29484)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNLTAAEKLVRHFEGLKLESYLCPAGIWTIGYGQTKGIKKGMVWTKAQAEADMSATVRRFAEEVRQLVRKDTTDAELSAFTSLAYNIGTPNFKTSSALRWHNARESASKVCAGIRMWNKATVNGKKIILKGLVRRREAECSVYTTGDFNE
Physico‐chemical
properties
protein length:150 AA
molecular weight:16765,1 Da
isoelectric point:9,68
hydropathy:-0,37
Representative Protein Details
Accession
1KOZ4
Protein name
1KOZ4
Sequence length
197 AA
Molecular weight
21780,03530 Da
Isoelectric point
9,18360
Sequence
MRYPNAVGAINIINVSLLIALGIAVTAPTVQPDQFVTYAPPPALEVVWPTRELIVYPEVATVSGVDLIKRFEGLRLKAYKCTAKRWTIGYGRAYGVKSGQVITQAEADRMLSVDYAKTKSAVLSAVKADISDQQIEVLVSFAYNVGITRFKESTMLKMINAGDMRGARDQFRFWIHVDGQPIDGLINRRAEEAALFI
Other Proteins in cluster: phalp2_29484
Total (incl. this protein): 4 Avg length: 175,0 Avg pI: 9,19

Protein ID Length (AA) pI
1KOZ4 197 9,18360
1KwAg 197 9,32266
A0A8S5P2Z8 156 8,55852
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28113
7zmZV
50 50,0% 140 1.115E-47
2 phalp2_2632
6RhYr
14867 48,8% 135 1.126E-44
3 phalp2_6830
8n5Jv
121 46,8% 143 2.904E-41
4 phalp2_4451
31DIk
4919 47,5% 141 2.617E-40
5 phalp2_33221
5jbqV
447 45,9% 137 3.227E-39
6 phalp2_126
5jCCA
637 47,3% 133 1.550E-38
7 phalp2_26378
1q5mp
207 43,3% 143 3.576E-37
8 phalp2_14481
4Lj5N
23 44,0% 134 6.697E-37
9 phalp2_4532
3QBcV
9 43,0% 144 1.127E-35
10 phalp2_15045
7iQxx
18 42,7% 138 1.542E-35

Domains

Domains [InterPro]
Disordered region
GH24
Representative sequence (used for alignment): 1KOZ4 (197 AA)
Member sequence: A0A2D2W2U6 (150 AA)
1 197 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Escherichia phage PGT2
[NCBI]
2047782 Autographiviridae > Ermolevavirus > Ermolevavirus PGT2
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MG201401 [NCBI]
CDS location
range 41242 -> 41694
strand +
CDS
GTGAATCTGACCGCAGCGGAGAAGTTAGTTCGACATTTTGAGGGTCTGAAGTTGGAATCGTACCTGTGTCCTGCTGGTATATGGACTATCGGGTACGGGCAGACCAAGGGTATTAAGAAAGGCATGGTGTGGACCAAGGCTCAGGCCGAGGCAGATATGTCTGCCACTGTCCGGCGCTTTGCCGAAGAAGTACGTCAACTGGTCCGCAAGGATACGACCGACGCTGAACTGAGCGCATTCACATCTCTGGCCTATAACATCGGTACTCCGAACTTTAAGACGTCCTCTGCTCTGCGTTGGCATAATGCCCGTGAGTCCGCGTCCAAGGTATGTGCTGGTATCCGTATGTGGAACAAGGCCACCGTCAATGGCAAGAAGATTATCCTGAAAGGTCTGGTACGTCGCCGAGAGGCTGAATGCTCCGTATATACAACTGGAGACTTTAATGAATAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi000c4417d0_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1KOZ4) rather than this protein.
PDB ID
1KOZ4
Method AlphaFoldv2
Resolution 79.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50