Protein

Protein accession
A0AAE7V8C0 [UniProt]
Representative
4XbYy
Source
UniProt (cluster: phalp2_12036)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKHIKKIVCSVGAAIALIVTGVVPNNVIVSPAGLELIGNAEECRLTPYICPAGLITNGIGNTHGVTDKPITIEQVAKDWAQNIESAQDCLAATTNVAALSQGQIDAFTSFIFNVGCTRYRHNSDGSETRIYKKLKAGYYTEACQELKFWVYGGGKKLNGLIKRRGKEMELCFS
Physico‐chemical
properties
protein length:173 AA
molecular weight:18741,4 Da
isoelectric point:8,66
hydropathy:-0,02
Representative Protein Details
Accession
4XbYy
Protein name
4XbYy
Sequence length
109 AA
Molecular weight
12238,72550 Da
Isoelectric point
9,16155
Sequence
GQQITIEKVAIDWSKNIEDAQSCLSNTTDITSLTQGQLDAFVSFVFNVGCSRYRHNADGSETRIYKKIKTGQFEAACHELKHWVYAGGKKLNGLVKRRQRETALCLSSK
Other Proteins in cluster: phalp2_12036
Total (incl. this protein): 7 Avg length: 184,6 Avg pI: 7,90

Protein ID Length (AA) pI
4XbYy 109 9,16155
A0A2I7RNI5 196 5,78316
A0A7L5GQC0 218 8,13315
A0A7D7ESD0 196 8,84740
R9TQ00 196 6,58516
A0AB39C9Q2 204 8,13522
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8099
6AkO5
4 34,6% 104 3.124E-17
2 phalp2_9410
8Hj1q
5109 38,1% 97 1.106E-16
3 phalp2_37101
1bvx6
877 44,4% 72 4.914E-15
4 phalp2_2632
6RhYr
14867 33,9% 106 4.490E-14
5 phalp2_7764
77796
37 43,2% 74 5.131E-12
6 phalp2_19285
8enOZ
460 48,5% 70 5.131E-12
7 phalp2_33221
5jbqV
447 33,6% 107 9.648E-12
8 phalp2_8653
440Ju
41 39,7% 73 8.004E-10
9 phalp2_24173
2nDwl
15 36,2% 69 4.818E-08
10 phalp2_39360
4E4Jt
2 38,0% 71 9.044E-08

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4XbYy (109 AA)
Member sequence: A0AAE7V8C0 (173 AA)
1 109 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage ST2-1pr
[NCBI]
2856177 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MZ496293 [NCBI]
CDS location
range 15027 -> 15548
strand -
CDS
ATGAAGCACATAAAAAAGATTGTTTGCTCAGTTGGTGCAGCCATCGCGCTCATTGTTACGGGTGTCGTTCCTAACAATGTCATCGTTTCACCTGCAGGCTTGGAGTTGATTGGTAACGCTGAAGAGTGCCGATTAACTCCATACATTTGTCCTGCTGGTCTTATTACTAACGGGATAGGTAACACACACGGTGTTACAGATAAGCCCATCACCATAGAACAAGTCGCTAAAGACTGGGCTCAAAACATTGAGTCCGCACAGGATTGTCTTGCTGCCACCACGAATGTTGCTGCATTGAGTCAGGGCCAGATTGATGCCTTCACATCATTTATATTTAACGTGGGCTGCACACGATATCGGCATAACTCGGACGGTAGCGAGACACGCATCTATAAAAAGCTTAAAGCCGGTTACTACACCGAAGCTTGCCAAGAACTTAAATTCTGGGTTTATGGCGGTGGTAAGAAGTTGAATGGCTTGATTAAGCGACGGGGAAAGGAGATGGAACTTTGCTTTTCGTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4XbYy) rather than this protein.
PDB ID
4XbYy
Method AlphaFoldv2
Resolution 89.73
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50