Protein

Protein accession
A0A8S5MXH1 [UniProt]
Representative
41rm9
Source
UniProt (cluster: phalp2_24003)
Protein name
N-acetylmuramoyl-L-alanine amidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MATRQQFVQTAVSYLGAVRGSAKHRRLIDIFNQHKPDGWPMNYVAPWCAAAVSAWAYELGIGNLIPVSANCGTMVSKAKQMGIWIESDSYTPSPGDLILYDWQDSGYGDNVGGPDHVGVVVSVGGGMITVIEGNKGASSVVGYRSVPINGRYIRGFVRPNFDGVSTAPPSASSGNYGLYKVNSSAGLNVRKGPGTNYARITTLSNGTPLRIVEMSGNWGRSVGAGGWVCMDYLTKSGATSAPTYTPSNTSAYAVGRTYQLISDMRVRTGPGTGYRQRAYSELTADGKRHALAGSLACLRAGTQVTCLEMRGDWMRIPSGWICTRQGSKVYIK
Physico‐chemical
properties
protein length:332 AA
molecular weight:35506,8 Da
isoelectric point:9,59
hydropathy:-0,21
Representative Protein Details
Accession
41rm9
Protein name
41rm9
Sequence length
708 AA
Molecular weight
78460,86820 Da
Isoelectric point
6,73937
Sequence
MKKIIDVSEHQKRIDWQAVKASGAVDGVIIRIGYGDDDVSQDDDYIKYNLDECERLGIPHEVYLYSYADTDKHIQSEIAHIKRLISGKDVRVWFDLEYRPAKAFWRRAVEAFLKAFPTGGVYSWEWVFTDVLDGINCPRWICAYGSNSGKPEYNYKPALDCHGWQYTSKARIPGITGDVDVSEWYGDFAEASVEPATASAAPVVGRNAIINQMLAWEGYDEKNGKFKAIIDGYNAYLQTAVKAGCANYAVKYSDEWCATCASNAYIACGIGYLFPVECSCPRMITLAKSMGIWQENDGYVPDPADAVLYDWEDSGKGDNVGTPDHIGIVIAVDKSKGTFVVMEGNKNEAVGRRTMSIDGRYIRGFITPKFNDSVPVPAKKEEVKTVAKTYEISKTGTPNKTTCIKWGLLKNNQKVTARLQPDESAKPCSFSPVQPLSRIEVYDYITTAKRWAYCKVGGKYGFILASSISAYLRTPKQPTDKAVDWVLADDFGKNATRENALKALGYDADAVQKAVNDKLGYKAPEKADSPRIRVWPVWFFEKDEGQFGDCTAILEYAADNETVAHCVLIDTAKATAASVVISKLQKAGVKRIDAVVISHAHGDHYGGLNNIMSKIPVGHIYAPDTAGLDKYQKSYASAIRSQAKKAPSTTLKAGSAFQIGSIKCQCIYQVPTDKLKEHDNHHFVNNQSVVLRFTLGDWTFHTAGDLQN
Other Proteins in cluster: phalp2_24003
Total (incl. this protein): 7 Avg length: 529,3 Avg pI: 7,45

Protein ID Length (AA) pI
41rm9 708 6,73937
1cBz6 820 8,62427
3WDW3 435 6,52906
3WG6M 565 5,93566
3ZS6H 522 8,41656
A0A8S5SCN4 323 6,31648
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8619
23dU9
33 39,5% 480 6.931E-74
2 phalp2_34762
3ij83
364 24,2% 602 5.987E-22

Domains

Domains [InterPro]
GH25
Unannotated
Unannotated
Lactamase_B
Representative sequence (used for alignment): 41rm9 (708 AA)
Member sequence: A0A8S5MXH1 (332 AA)
1 708 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00753, PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Podoviridae sp. ctlSr7
[NCBI]
2826573 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
BK015014 [NCBI]
CDS location
range 37340 -> 38338
strand +
CDS
ATGGCAACTAGGCAGCAGTTCGTACAGACGGCAGTTAGTTACCTCGGAGCGGTTAGAGGTTCAGCTAAACACCGCCGCCTTATAGACATTTTTAACCAGCACAAACCAGATGGCTGGCCAATGAACTATGTTGCGCCGTGGTGCGCTGCAGCTGTGTCCGCTTGGGCGTACGAGTTAGGAATTGGAAATCTAATCCCGGTAAGCGCAAATTGTGGAACAATGGTTTCTAAAGCCAAGCAGATGGGCATATGGATTGAAAGCGATTCATACACCCCGAGTCCTGGAGACCTAATCCTGTATGATTGGCAGGACTCCGGATACGGTGATAACGTGGGCGGACCTGACCATGTAGGCGTGGTCGTATCTGTTGGTGGCGGAATGATTACCGTCATTGAGGGGAACAAGGGCGCATCGTCCGTTGTAGGATATAGAAGTGTACCTATTAACGGTAGATATATAAGAGGGTTTGTGAGACCAAATTTTGATGGAGTAAGCACAGCACCGCCAAGCGCTAGCTCAGGGAATTACGGATTATACAAAGTCAATTCATCCGCAGGTCTTAATGTTAGAAAAGGACCTGGAACAAACTATGCAAGGATAACTACGCTGTCGAACGGTACGCCGCTTCGAATTGTAGAAATGAGTGGCAACTGGGGTAGATCTGTAGGAGCTGGCGGCTGGGTTTGTATGGACTATCTCACGAAATCAGGAGCAACATCAGCACCGACATACACACCTAGCAACACGAGCGCATACGCTGTAGGCAGGACATATCAATTAATTTCTGATATGCGCGTAAGAACTGGTCCAGGAACAGGATATAGGCAGCGTGCATATTCTGAACTAACCGCAGATGGCAAGAGGCATGCACTCGCTGGAAGTCTAGCTTGCTTGCGCGCAGGTACTCAGGTAACCTGTTTAGAAATGCGAGGAGATTGGATGCGCATTCCGTCCGGATGGATTTGCACTCGCCAGGGAAGCAAGGTGTACATCAAATGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0003824 catalytic activity molecular function None (UniProt)

Enzymatic activity

EC Number Entry Name Reaction Catalyzed Classification Evidence Source
3.5.1.28 None Hydrolyzes the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. match to sequence model evidence used in automatic assertion
ECO:ECO:0000256
ARBA:ARBA00001561

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (41rm9) rather than this protein.
PDB ID
41rm9
Method AlphaFoldv2
Resolution 83.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50