Protein

Protein accession
Q5G7N2 [UniProt]
Representative
4B1BD
Source
UniProt (cluster: phalp2_13430)
Protein name
Lysozyme
Lysin probability
100%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MSFTKVIAFEEGFRAKPYKDSLGYPTIGYGIKLANKNANIDHFECEIPEPVARMWLESHALKEEAKLCQYAWFIDQPKDVQDILVSMCYQLGLKGLLGFKKMIAALCVNDTETAAKEALDSKWARSDSPNRAQRHARVIRGESIDDVYGGLI
Physico‐chemical
properties
protein length:152 AA
molecular weight:17130,5 Da
isoelectric point:6,30
hydropathy:-0,28
Representative Protein Details
Accession
4B1BD
Protein name
4B1BD
Sequence length
163 AA
Molecular weight
18258,41100 Da
Isoelectric point
8,93927
Sequence
MGTRQEEGFRKKIYTDPSDKGNKAVGTGFNLDDPVTAAMVPKEVREGKRGITRAEDAVTYKNRMDLAKKDAASYLGSKTYKALDQDRKDVINDMSYNMGLPSLSGFTDLKAAIVAKDYDTAAKEILDSDYARKDVPARARRNAEKMRGKERELETEILRQAIE
Other Proteins in cluster: phalp2_13430
Total (incl. this protein): 32 Avg length: 158,6 Avg pI: 7,77

Protein ID Length (AA) pI
4B1BD 163 8,93927
16G8h 153 5,06642
1K459 149 7,77048
1fTg0 153 5,24655
1mF6V 151 7,88895
1mvUz 150 8,91644
1nNNh 181 9,15498
2S8ev 159 5,43622
2aodK 149 8,51745
2vche 170 5,87769
35hgc 175 9,62044
3lwzm 137 5,33914
3ndpu 161 9,47442
3nf5Q 164 9,13396
4Hsjy 161 8,94114
4NHwH 137 7,84446
4Nq3K 149 8,53241
5F4JZ 158 9,51910
5HtR 191 7,65669
5dOTV 159 7,68670
5nHxd 152 7,89114
6AJVg 153 5,73860
6DcKe 181 9,34432
6Ixj3 149 8,54820
7Y9sx 156 8,96112
8iqiM 165 5,56808
8nZEC 159 8,99355
94gb 170 6,78137
d2fg 147 9,14408
kxDc 170 6,60875
A0A9X9JSG8 152 8,35512
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_27968
RI1z
5083 38,5% 148 6.542E-41
2 phalp2_32797
2yPO8
5731 34,9% 146 2.093E-39
3 phalp2_17008
8jNgs
3526 37,5% 144 1.011E-38
4 phalp2_9780
80F6A
154 31,0% 145 3.566E-38
5 phalp2_6928
2KMCh
358 30,0% 133 4.016E-36
6 phalp2_7697
6I6lz
232 33,9% 156 1.282E-34
7 phalp2_24213
2KXlJ
83 30,7% 140 1.756E-34
8 phalp2_8442
12q1i
28 26,5% 147 2.984E-33
9 phalp2_6283
6Psj3
6 32,1% 143 3.699E-32
10 phalp2_1386
1iVBH
20 30,4% 138 9.507E-32

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4B1BD (163 AA)
Member sequence: Q5G7N2 (152 AA)
1 163 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Listonella phage phiHSIC
[NCBI]
310539 No lineage information
Host Vibrio pelagius
[NCBI]
28169 Proteobacteria > Gammaproteobacteria > Vibrionales > Vibrionaceae > Vibrio >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY772740 [NCBI]
CDS location
range 26214 -> 26672
strand -
CDS
ATGAGCTTTACAAAAGTTATCGCATTCGAAGAAGGTTTTCGAGCTAAGCCATACAAGGATTCGCTAGGTTATCCAACTATTGGATACGGAATCAAATTGGCAAATAAAAACGCCAATATCGACCACTTTGAATGCGAAATTCCTGAGCCAGTAGCGCGCATGTGGCTTGAGTCTCACGCGCTCAAGGAAGAAGCCAAACTTTGCCAGTACGCATGGTTTATCGACCAACCCAAAGACGTTCAGGACATTCTAGTTTCAATGTGCTACCAGTTGGGCTTAAAAGGCTTACTGGGCTTCAAAAAGATGATTGCTGCTCTATGTGTTAACGATACTGAAACAGCCGCTAAGGAGGCGTTAGATTCGAAATGGGCGCGCAGTGATTCGCCAAACCGCGCACAACGTCACGCACGAGTAATTAGAGGAGAGTCTATTGATGATGTATATGGCGGTCTTATTTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0031640 killing of cells of another organism biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
RuleBase:RU003788

Tertiary structure

PDB ID
upi00004c741e_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4B1BD) rather than this protein.
PDB ID
4B1BD
Method AlphaFoldv2
Resolution 91.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50