Protein

Protein accession
I6PBN2 [UniProt]
Representative
4Lj5N
Source
UniProt (cluster: phalp2_14481)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MVAAIMLVLAGGNVRTNQEGLELIGNAEGCRVNPYVCPAGVLTDGIGNTHGVRYGKSMEQIAKDWERNILEAERCVIRYGAGDRLTENAFSAAVSLTFRVGCGKVKNSTLFRMFRQGEQRATCDQFLRWRYAAGKELPGLVTRSQKERALCLKE
Physico‐chemical
properties
protein length:154 AA
molecular weight:17015,4 Da
isoelectric point:9,17
hydropathy:-0,24
Representative Protein Details
Accession
4Lj5N
Protein name
4Lj5N
Sequence length
217 AA
Molecular weight
23851,41960 Da
Isoelectric point
6,59039
Sequence
MKVSQQGLKLTEASEGCRLKAYQDTGGVWTIGYGHTNGVRPNQVITQEVAEHLLQLDQQTAENCVNKFALPCTQGQFDALVDFVFNVGPSQFLSSHLYKYHKAGEYDKAAAEFPKWKYDNGKVIQGLVTRRLAERALYSRQEPAEDHQQTDVPERSASSAVGSVLASQETGVVQSSPVQPVPTSNTVPASNNAQSVFEFRFGVLLRKLQGIASQLVH
Other Proteins in cluster: phalp2_14481
Total (incl. this protein): 23 Avg length: 202,6 Avg pI: 7,25

Protein ID Length (AA) pI
4Lj5N 217 6,59039
4KCrY 156 6,90516
4KxWx 212 5,30742
4LIn5 212 5,14861
4LjMw 180 5,86274
4Lp7y 212 5,70490
4Ltba 212 5,21079
4MBeL 216 5,37779
4MS1n 212 5,12218
4MSa4 216 5,24808
4gcgi 233 8,54698
6Eukg 232 9,75918
6Jc0g 244 5,07432
87sF2 232 9,21835
87uds 232 9,09638
8efhr 239 5,27184
H9C0Q4 178 8,64142
A0A3Q9GFM1 179 9,05937
A0A5Q2F220 180 8,40663
A0A6G5YJT8 180 9,21558
A0A6J5T8A1 135 9,67788
A0AAU9EEF5 196 9,19824
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33221
5jbqV
447 54,1% 144 4.565E-59
2 phalp2_2498
5GMvl
297 49,6% 141 1.060E-57
3 phalp2_2632
6RhYr
14867 53,1% 141 1.988E-57
4 phalp2_29738
1pBjt
830 48,0% 156 7.811E-55
5 phalp2_126
5jCCA
637 55,0% 140 1.465E-54
6 phalp2_28113
7zmZV
50 48,1% 160 2.479E-53
7 phalp2_26378
1q5mp
207 42,1% 171 2.017E-51
8 phalp2_10966
4MkR6
70 45,3% 152 5.176E-51
9 phalp2_6830
8n5Jv
121 49,6% 147 2.490E-50
10 phalp2_2176
3Yhf4
930 46,1% 154 5.756E-49

Domains

Domains [InterPro]
GH24
Disordered region
Representative sequence (used for alignment): 4Lj5N (217 AA)
Member sequence: I6PBN2 (154 AA)
1 217 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Cronobacter phage phiES15
[NCBI]
1168280 No lineage information
Host Cronobacter sakazakii ES15
[NCBI]
1138308 Proteobacteria > Gammaproteobacteria > Enterobacteriales > Enterobacteriaceae > Cronobacter >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
JQ780327 [NCBI]
CDS location
range 14924 -> 15388
strand +
CDS
GTGGTGGCGGCGATCATGTTGGTGCTGGCTGGTGGGAACGTCCGAACCAACCAGGAAGGGCTTGAGCTGATCGGTAATGCCGAGGGGTGTCGCGTTAATCCATACGTCTGCCCGGCTGGCGTGCTTACCGACGGGATCGGCAACACTCACGGCGTGCGCTACGGCAAATCGATGGAGCAGATCGCCAAAGACTGGGAGCGCAACATCCTGGAGGCTGAACGCTGCGTAATTCGCTATGGCGCTGGTGACAGGCTGACCGAAAACGCATTCAGCGCTGCCGTATCTCTGACGTTTCGCGTGGGCTGCGGCAAAGTGAAAAACTCGACGCTGTTTCGCATGTTCCGGCAGGGCGAACAGCGCGCCACCTGCGATCAGTTCCTGAGATGGCGTTACGCCGCAGGTAAAGAGCTGCCAGGGCTGGTTACCAGGAGCCAGAAAGAGCGTGCGCTCTGCCTCAAGGAGTGA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
upi00025f67cb_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4Lj5N) rather than this protein.
PDB ID
4Lj5N
Method AlphaFoldv2
Resolution 76.97
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50