Protein

Protein accession
A0A2I7RNI5 [UniProt]
Representative
4XbYy
Source
UniProt (cluster: phalp2_12036)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKLSKKILCSVVAAIGLITGGNSLYGDEFTTSVGQVVIEGEAQGELRVSPKALDITGNAEGCRLEPYTCPAGLITNGIGNTHGVPDQPITLDQVAKDWVVNLQGAEQCIESAEKAAKRPMSQGQFDAFTSFSFNTGCSRFMKNHDGSATRIFTYIKQGDYERACKELPKWVYGGGEKLPGLMTRRGLEYARCMEVD
Physico‐chemical
properties
protein length:196 AA
molecular weight:21236,9 Da
isoelectric point:5,78
hydropathy:-0,27
Representative Protein Details
Accession
4XbYy
Protein name
4XbYy
Sequence length
109 AA
Molecular weight
12238,72550 Da
Isoelectric point
9,16155
Sequence
GQQITIEKVAIDWSKNIEDAQSCLSNTTDITSLTQGQLDAFVSFVFNVGCSRYRHNADGSETRIYKKIKTGQFEAACHELKHWVYAGGKKLNGLVKRRQRETALCLSSK
Other Proteins in cluster: phalp2_12036
Total (incl. this protein): 7 Avg length: 184,6 Avg pI: 7,90

Protein ID Length (AA) pI
4XbYy 109 9,16155
A0A7L5GQC0 218 8,13315
A0A7D7ESD0 196 8,84740
R9TQ00 196 6,58516
A0AAE7V8C0 173 8,65986
A0AB39C9Q2 204 8,13522
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8099
6AkO5
4 34,6% 104 3.124E-17
2 phalp2_9410
8Hj1q
5109 38,1% 97 1.106E-16
3 phalp2_37101
1bvx6
877 44,4% 72 4.914E-15
4 phalp2_2632
6RhYr
14867 33,9% 106 4.490E-14
5 phalp2_7764
77796
37 43,2% 74 5.131E-12
6 phalp2_19285
8enOZ
460 48,5% 70 5.131E-12
7 phalp2_33221
5jbqV
447 33,6% 107 9.648E-12
8 phalp2_8653
440Ju
41 39,7% 73 8.004E-10
9 phalp2_24173
2nDwl
15 36,2% 69 4.818E-08
10 phalp2_39360
4E4Jt
2 38,0% 71 9.044E-08

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4XbYy (109 AA)
Member sequence: A0A2I7RNI5 (196 AA)
1 109 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage 1.202.O._10N.222.45.E8
[NCBI]
1881262 Peduoviridae > Canoevirus > Canoevirus canoe
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MG592573 [NCBI]
CDS location
range 21751 -> 22341
strand +
CDS
ATGAAACTAAGTAAAAAAATTCTGTGTTCGGTGGTAGCGGCCATTGGCTTAATTACCGGAGGCAATAGCCTTTATGGTGATGAGTTCACAACGTCAGTCGGCCAAGTGGTGATCGAGGGTGAAGCCCAAGGTGAATTGCGTGTTAGCCCTAAAGCGTTGGACATTACAGGCAATGCAGAAGGGTGCCGACTTGAGCCCTACACCTGCCCTGCGGGATTAATCACAAACGGAATCGGCAACACTCACGGTGTGCCAGACCAACCCATCACGCTTGATCAAGTGGCAAAAGATTGGGTGGTTAACCTGCAAGGTGCTGAGCAGTGCATCGAGTCGGCAGAAAAGGCCGCTAAGCGACCCATGAGCCAAGGGCAGTTTGATGCGTTCACCTCGTTTAGTTTTAACACTGGCTGTTCACGGTTCATGAAAAACCATGATGGTAGCGCCACGCGGATTTTTACTTACATCAAACAAGGCGACTACGAACGAGCCTGCAAAGAGTTACCCAAATGGGTATATGGCGGCGGTGAAAAGTTACCCGGTTTAATGACAAGGCGAGGGCTTGAATATGCTCGCTGCATGGAAGTGGATTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

PDB ID
A0A2I7RNI5
Method AlphaFoldv2
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50
PDB ID
upi000c825c14_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4XbYy) rather than this protein.
PDB ID
4XbYy
Method AlphaFoldv2
Resolution 89.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50