Protein

Protein accession
I3UMC3 [UniProt]
Representative
5Ij4M
Source
UniProt (cluster: phalp2_4883)
Protein name
Lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MITIDQYKKEERKQTIHRSINTALSVVIVVFGFFFIKAQVTPMYEQLTGITEQKQINKMIYSEIVQINADEGYRRCVYNDTEGYPTVGWGHKLLPHEKFKCIDPTYAVALLRKDYEYAKRDVKSRYPWANDDLQLVLINMTFNMGAIGLSKFEDTLAFLKAEEYDLAAGELLDSKYARKLPRRASSMAVRIMRLQQ
Physico‐chemical
properties
protein length:196 AA
molecular weight:22793,1 Da
isoelectric point:8,91
hydropathy:-0,33
Representative Protein Details
Accession
5Ij4M
Protein name
5Ij4M
Sequence length
192 AA
Molecular weight
22070,23610 Da
Isoelectric point
6,22099
Sequence
MFTKEEYEKAERSQTLHRSMNTILLVLVLILGFFFVKSEVQPLYQEITGVTEQRAIESEIVQIVEDEGYKRCVYKDSLGLPTVGFGHLMLPTETFKCIDSHYAVKLLRKDYTDAQLDVETRYPWADSDVKLVLINLTFNMGSTRLSKFTKTLSHLKDGEHDLAAGELLDSVYARQVPRRASMMAARIMRLQQ
Other Proteins in cluster: phalp2_4883
Total (incl. this protein): 25 Avg length: 194,6 Avg pI: 7,03

Protein ID Length (AA) pI
5Ij4M 192 6,22099
2BfTs 194 6,23827
2LkXV 176 5,39012
4CAIB 197 6,45420
4CPAe 180 6,95956
4GxJD 181 8,95403
4JiBF 178 6,32080
6MZsn 181 9,13860
7Pxyb 196 8,91090
7VfEQ 195 7,73268
81nxd 199 5,85347
86fnX 197 6,84816
8Cfot 197 6,84816
8DWDv 180 9,41852
8Eglk 195 7,73268
8HiGH 196 5,68267
8kZ1p 230 8,71640
8wSDn 211 5,98477
8xRyv 211 5,78174
8xcWP 194 5,87837
WQEn 179 8,95403
eOAU 195 6,19564
qJyl 220 4,99083
A0A1W5PTQ1 196 5,68267
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17008
8jNgs
3526 41,4% 128 5.434E-31
2 phalp2_27968
RI1z
5083 41,8% 122 2.847E-28
3 phalp2_32797
2yPO8
5731 36,7% 128 3.892E-28
4 phalp2_16834
1uDRT
24 37,0% 127 9.948E-28
5 phalp2_21661
31ima
875 34,5% 133 4.750E-27
6 phalp2_30135
3hDmC
1 32,1% 168 1.659E-26
7 phalp2_9780
80F6A
154 33,9% 165 2.267E-26
8 phalp2_19234
42usC
219 38,3% 133 3.098E-26
9 phalp2_24334
3Qw2p
463 33,8% 139 9.617E-25
10 phalp2_2924
QDRo
8927 35,3% 130 2.453E-24

Domains

Domains [InterPro]
Representative sequence (used for alignment): 5Ij4M (192 AA)
Member sequence: I3UMC3 (196 AA)
1 192 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Colwellia phage 9A
[NCBI]
765765 Franklinbayvirus > Franklinbayvirus fv9A
Host Colwellia psychrerythraea 34H
[NCBI]
167879 Proteobacteria > Gammaproteobacteria > Alteromonadales > Colwelliaceae > Colwellia >
Host Colwellia
[NCBI]
28228 Proteobacteria > Gammaproteobacteria > Alteromonadales > Colwelliaceae >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
HQ317390 [NCBI]
CDS location
range 22214 -> 22804
strand -
CDS
ATGATAACCATAGACCAGTATAAGAAAGAGGAACGTAAGCAAACTATTCACAGAAGCATTAATACTGCACTTTCTGTAGTTATAGTGGTCTTTGGGTTTTTCTTTATCAAAGCACAAGTAACACCAATGTATGAACAGCTTACAGGTATTACAGAACAAAAGCAAATCAATAAAATGATTTATAGTGAGATTGTACAGATAAATGCTGATGAAGGTTATCGCAGGTGTGTCTATAACGATACTGAGGGCTATCCAACAGTTGGTTGGGGTCACAAGCTATTACCTCATGAAAAGTTTAAATGTATTGATCCTACGTATGCTGTAGCCCTTCTTAGAAAAGACTATGAGTATGCAAAGCGAGATGTTAAAAGTAGATACCCTTGGGCGAATGATGATTTACAATTAGTATTAATCAACATGACCTTTAATATGGGTGCTATTGGATTATCCAAATTTGAAGATACTTTAGCTTTCCTTAAGGCTGAAGAGTATGATTTAGCTGCAGGTGAATTATTAGATAGTAAATACGCCCGTAAACTACCCCGTAGAGCCTCTTCGATGGCAGTACGCATAATGAGGCTACAACAATGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016020 membrane cellular component None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
RuleBase:RU003788

Tertiary structure

PDB ID
I3UMC3
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi000263373a_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5Ij4M) rather than this protein.
PDB ID
5Ij4M
Method AlphaFoldv2
Resolution 93.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50