Protein

Protein accession
Q3ZFI3 [UniProt]
Representative
8Fk8i
Source
UniProt (cluster: phalp2_33574)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MAKVQFTKRQETSQFFVHCSATKANMDVGVREIRQWHKEQGWLDVGYHFIIRRDGTVEAGRDQDAVGSHVKGYNSTSVGVCLVGGIDAKGNPEANFTPQQMSALNGVLHELRGTYPKAVIMAHHDVAPKACPSFDLQRWVKTGELVTSDRG
Physico‐chemical
properties
protein length:151 AA
molecular weight:16710,7 Da
isoelectric point:8,48
hydropathy:-0,44
Representative Protein Details
Accession
8Fk8i
Protein name
8Fk8i
Sequence length
237 AA
Molecular weight
26664,30600 Da
Isoelectric point
4,97514
Sequence
mqllalanklikwfaetrhfgtptttpvklnqnnmaniilngltkskrsidriflhcsatpedrdvsgedirqwhknkgwrdigyhyvvrldgtieegrsvnldgahargynqgsiavcyigglnkdfdpkdtrtepqlyalhnlllglteiypkatlhghnefsnkacpsfnvqeqyatlinkteqlmiddfqtdiqdheegtdsfadfvndltedkanenavcgiddddceacgs
Other Proteins in cluster: phalp2_33574
Total (incl. this protein): 28 Avg length: 153,9 Avg pI: 7,71

Protein ID Length (AA) pI
8Fk8i 237 4,97514
1Y6k4 139 7,00588
1ufGg 143 6,46455
23lAU 197 5,13423
28rHO 143 6,36951
3aimN 145 8,95867
4rX3r 143 5,96658
4v0Uo 193 5,32129
53tFF 144 9,37353
7GpZ8 192 5,23376
7IVxG 143 7,78554
7UPqF 139 7,08830
7Upg 141 9,26161
Pwu0 139 8,42036
A0A481S247 151 8,45595
A0A2P1CL13 151 8,51455
K4NXC9 151 8,51320
Q6WYE8 151 8,80820
B3VCZ3 151 8,48225
A0A192Y7W2 151 8,44370
A0A5J6TE17 151 8,45595
A0A649WKP5 151 8,45595
A0A7D7F716 151 8,52957
A0A7S6R7F0 151 8,45595
A0A8X8M5L2 151 6,75261
A0A9X9JN87 151 8,80820
A0A9E8G680 109 9,42181
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23766
13ajC
287 55,4% 146 2.377E-63
2 phalp2_37328
8le1e
14 44,0% 177 2.622E-49
3 phalp2_30941
13o5z
7 39,4% 147 2.574E-30
4 phalp2_12261
71VXz
7 29,2% 178 1.216E-29
5 phalp2_39380
4H4oF
116 39,4% 147 7.366E-24
6 phalp2_27876
8EUCV
18 33,5% 179 1.365E-23
7 phalp2_26819
3QMwQ
3 33,5% 158 1.181E-22

Domains

Domains [InterPro]
Representative sequence (used for alignment): 8Fk8i (237 AA)
Member sequence: Q3ZFI3 (151 AA)
1 237 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Enterobacteria phage K11 (Bacteriophage K11)
[NCBI]
532077 Autographiviridae > Przondovirus > Przondovirus K11
Host Klebsiella sp. 390
[NCBI]
1497796 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Klebsiella >
Host Klebsiella
[NCBI]
570 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
AY871790 [NCBI]
CDS location
range 1 -> 456
strand +
CDS
ATGGCCAAGGTTCAATTCACTAAGCGACAGGAGACCTCTCAGTTTTTCGTTCACTGTTCCGCCACCAAGGCAAACATGGACGTAGGCGTCCGTGAGATTCGCCAGTGGCACAAAGAGCAGGGCTGGCTGGATGTAGGGTATCACTTCATCATCCGTCGTGACGGTACCGTTGAGGCGGGCCGCGACCAAGACGCTGTGGGTTCACACGTCAAGGGATACAACTCGACCTCTGTCGGTGTGTGTCTGGTAGGTGGTATCGACGCCAAGGGTAACCCCGAGGCAAACTTCACGCCTCAGCAGATGAGCGCACTGAATGGGGTGCTGCACGAGCTGAGGGGGACCTACCCCAAGGCTGTCATTATGGCGCACCACGATGTAGCGCCGAAGGCTTGTCCTAGCTTCGACCTGCAACGTTGGGTAAAGACTGGCGAGCTGGTCACTTCTGACCGTGGGTAA

Gene Ontology

Description Category Evidence (source)
GO:0008270 zinc ion binding molecular function None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0032897 negative regulation of viral transcription biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.5.1.28 None Hydrolyzes the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. match to sequence model evidence used in automatic assertion
ECO:ECO:0000255
HAMAP-Rule:MF_04111

Tertiary structure

PDB ID
Q3ZFI3
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi00005aa1f0_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (8Fk8i) rather than this protein.
PDB ID
8Fk8i
Method AlphaFoldv2
Resolution 73.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50