Protein

Protein accession
A0A6J5NWJ1 [UniProt]
Representative
6H4ds
Source
UniProt (cluster: phalp2_20859)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MSTLEQAFALVAKFEGCRLKAYQDIVGVWTIGYGETLGVVPGMVWTQEQADQALRRRLGYFMAGVLARCPGLSGNRLVACVSLAYNIGLGAFGASSVCRLAKRGDYAQAAEKFLLWNKAGGRVVKGLTIRRTAERDLFLRLD
Physico‐chemical
properties
protein length:142 AA
molecular weight:15506,9 Da
isoelectric point:9,69
hydropathy:0,16
Representative Protein Details
Accession
6H4ds
Protein name
6H4ds
Sequence length
76 AA
Molecular weight
8314,60540 Da
Isoelectric point
6,16558
Sequence
MTAALSEPGALELALELIRRFEGCRLRAYQDVAGVWTIGWGETLGVAPGMVWSQPQADAELARRVAQFLLGVLKRC
Other Proteins in cluster: phalp2_20859
Total (incl. this protein): 4 Avg length: 94,3 Avg pI: 7,85

Protein ID Length (AA) pI
6H4ds 76 6,16558
5wo8U 77 7,87999
6DrcC 82 7,65714
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28762
4NrrK
6 54,5% 55 4.819E-17
2 phalp2_35600
1fedL
106 49,1% 57 2.360E-16
3 phalp2_31530
48EM7
20 50,7% 63 2.020E-14
4 phalp2_31138
26ZNc
11 49,1% 57 4.855E-13
5 phalp2_4289
84eDR
22 47,4% 59 9.170E-13
6 phalp2_11283
6ScB5
20 47,6% 65 3.035E-11
7 phalp2_18112
49WMU
11 52,9% 51 4.172E-11
8 phalp2_24269
3bvr0
10 37,9% 58 3.596E-09
9 phalp2_17749
7tNIh
28 40,6% 59 3.596E-09
10 phalp2_29425
1kDQa
2 39,5% 48 4.944E-09

Domains

Domains [InterPro]
Representative sequence (used for alignment): 6H4ds (76 AA)
Member sequence: A0A6J5NWJ1 (142 AA)
1 76 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured Caudovirales phage
[NCBI]
2100421 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
LR796746 [NCBI]
CDS location
range 3219 -> 3647
strand -
CDS
GTGAGCACCTTAGAACAAGCGTTCGCCCTCGTCGCTAAGTTCGAGGGTTGCCGCCTTAAGGCTTACCAAGACATCGTCGGCGTGTGGACGATAGGTTACGGCGAAACGCTCGGCGTCGTGCCTGGCATGGTTTGGACGCAAGAACAAGCCGACCAGGCATTGCGCCGTCGGCTTGGCTATTTCATGGCGGGGGTCTTGGCGCGTTGCCCTGGGCTCTCGGGCAATCGCCTTGTCGCTTGCGTGTCGCTTGCGTACAACATCGGCTTGGGTGCTTTTGGCGCGTCGTCGGTGTGTCGTTTGGCGAAGCGGGGCGACTATGCGCAAGCGGCGGAAAAGTTCTTGCTGTGGAATAAGGCGGGCGGGCGAGTCGTTAAGGGTTTGACGATCCGCCGCACCGCCGAGCGCGATTTATTCTTACGGCTCGACTAA

Gene Ontology

Description Category Evidence (source)
GO:0003796 lysozyme activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0016998 cell wall macromolecule catabolic process biological process None (UniProt)
GO:0030430 host cell cytoplasm cellular component None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)
GO:0044659 viral release from host cell by cytolysis biological process None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.2.1.17 None Hydrolysis of (1->4)-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues in a peptidoglycan and between N-acetyl-D-glucosamine residues in chitodextrins. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00000632

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6H4ds) rather than this protein.
PDB ID
6H4ds
Method AlphaFoldv2
Resolution 90.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50