Protein

Protein accession
W8CPL1 [UniProt]
Representative
5sQyS
Source
UniProt (cluster: phalp2_14613)
Protein name
N-acetylmuramoyl-L-alanine amidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MGKLKYLVIHCTATPEGREITKQDIEQWHLKERGWSRVGYSDMIHLDGSLENLIEFDQDDNVDSWEISNGARGFNGISRHVVYTGGAEKTKPSWSKYYPPKDTRTEAQKKTLLTYVKFMILRHPDIKVIGHNNISNKACPSFDVVAWLKNENIPFENIGL
Physico‐chemical
properties
protein length:160 AA
molecular weight:18378,6 Da
isoelectric point:7,01
hydropathy:-0,61
Representative Protein Details
Accession
5sQyS
Protein name
5sQyS
Sequence length
253 AA
Molecular weight
28770,20460 Da
Isoelectric point
9,56841
Sequence
MLEHKLNALATVLGVVLVNMIPNWLWLILSSIATGACIWIGQALARWFVTWCRRRLMKDTGNSDLPEHMPGSKKGKIKTKLHSLLMKKFIFMAALAFLVIHCTATPAGREVSRQDIEQWHLVQRGWHQVGYADMIHLNGTIENLVPYNRDNNVDSWEITNGVAGINYKSRHVVYVGGCNAKMEPLDTRTDKQKQALYLYVLATIKEHPNIVIAGHNQFDKKACPSFDVQLWLRNNGIPERNIYTPPKTTTKKS
Other Proteins in cluster: phalp2_14613
Total (incl. this protein): 2 Avg length: 206,5 Avg pI: 8,29

Protein ID Length (AA) pI
5sQyS 253 9,56841
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7758
71AR5
15 58,1% 153 1.958E-58
2 phalp2_3660
7J5qp
4 29,8% 181 5.483E-16
3 phalp2_14660
5KVev
6 27,7% 173 2.749E-14
4 phalp2_25586
3TI73
8 29,1% 161 9.992E-13
5 phalp2_23354
5DkBI
3 25,6% 160 5.091E-10
6 phalp2_16228
4NSsh
1 25,6% 164 1.042E-04

Domains

Domains [InterPro]
Disordered region
Unannotated
Unannotated
Representative sequence (used for alignment): 5sQyS (253 AA)
Member sequence: W8CPL1 (160 AA)
1 253 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Croceibacter phage P2559Y
[NCBI]
1327037 No lineage information
Host Croceibacter atlanticus HTCC2559
[NCBI]
216432 Bacteroidetes > Flavobacteriia > Flavobacteriales > Flavobacteriaceae > Croceibacter >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KC688701 [NCBI]
CDS location
range 22504 -> 22986
strand +
CDS
ATGGGAAAATTAAAATATTTAGTAATACATTGTACTGCAACTCCTGAAGGTAGAGAGATCACAAAACAGGACATTGAACAATGGCATTTAAAGGAGCGAGGTTGGTCCAGAGTAGGGTACTCAGATATGATACACTTGGACGGTTCGTTGGAAAATTTAATCGAATTCGACCAAGACGATAACGTGGACAGTTGGGAAATCTCAAACGGTGCCAGAGGTTTTAACGGTATTTCGAGACACGTCGTTTATACTGGGGGTGCTGAGAAAACCAAACCGAGTTGGTCGAAATATTACCCACCAAAAGACACACGTACAGAGGCCCAGAAAAAGACTCTTTTAACGTATGTAAAATTTATGATACTGAGACACCCGGATATAAAAGTAATCGGACACAACAACATTTCCAACAAAGCATGTCCATCTTTTGACGTGGTGGCCTGGTTGAAAAATGAAAATATACCATTTGAAAATATAGGATTATAA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5sQyS) rather than this protein.
PDB ID
5sQyS
Method AlphaFoldv2
Resolution 84.80
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50