Protein

Protein accession
A0A0B5GYD7 [UniProt]
Representative
6QgEm
Source
UniProt (cluster: phalp2_18510)
Protein name
N-acetylmuramoyl-L-alanine amidase
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKICLSVGHSEKEQGAVNQTYGVTEFQFNSALAPLIKEQLHKLGHEVVLVWRESLKDLPKKINATGADVAVELHCNAFNKIASGTEVLHYPSSKRGAKLAQFIQDSIIDILDLRDRGVKPSERELILRATSMPCVILEPGFIDNDVDYLLMENHKEELALAIASGIHEYSLTLR
Physico‐chemical
properties
protein length:174 AA
molecular weight:19377,1 Da
isoelectric point:5,79
hydropathy:-0,11
Representative Protein Details
Accession
6QgEm
Protein name
6QgEm
Sequence length
126 AA
Molecular weight
14203,25440 Da
Isoelectric point
9,54185
Sequence
MKKVALIIGHSERSQGAVNKKHGITEFMFNESLARKVNYYLSKHGIDCEIVYRNSYTLLPTKVNKTNADIAVSMHCNAFNEKPNGTEVLYYKGSQEGAALAISLQRKFIRCLNLKDRGVNPRALIE
Other Proteins in cluster: phalp2_18510
Total (incl. this protein): 6 Avg length: 158,2 Avg pI: 6,97

Protein ID Length (AA) pI
6QgEm 126 9,54185
CTW9 128 8,61215
R9R4A9 174 6,15824
A0A1W6JV24 173 5,75759
A0A9X9P471 174 5,97016
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21852
4t88M
271 52,4% 122 4.565E-41
2 phalp2_21590
2ns10
22 44,3% 124 2.295E-30
3 phalp2_6329
79LaP
54 46,7% 122 2.094E-29
4 phalp2_7941
wIlS
251 32,0% 128 6.162E-27
5 phalp2_11016
4Prl3
67 36,2% 127 3.101E-23
6 phalp2_5086
1FA9p
33 33,0% 127 1.876E-21
7 phalp2_37883
7CjiR
3 49,3% 79 3.208E-20
8 phalp2_1943
4rYA7
14 31,4% 127 8.265E-20
9 phalp2_11495
wgKr
11 26,0% 119 1.412E-18
10 phalp2_38356
BDVO
5 28,2% 92 1.596E-16

Domains

Domains [InterPro]
Representative sequence (used for alignment): 6QgEm (126 AA)
Member sequence: A0A0B5GYD7 (174 AA)
1 126 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01520

Taxonomy

  Name Taxonomy ID Lineage
Phage Vibrio phage phi 1
[NCBI]
1589297 Schitoviridae > Pacinivirus > Pacinivirus phi1
Host Vibrio cholerae
[NCBI]
666 Proteobacteria > Gammaproteobacteria > Vibrionales > Vibrionaceae > Vibrio >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KP280062 [NCBI]
CDS location
range 9203 -> 9727
strand -
CDS
ATGAAAATTTGTCTTTCTGTAGGACATTCAGAAAAGGAACAAGGAGCAGTAAACCAAACATATGGTGTTACTGAGTTTCAATTTAATAGTGCACTTGCACCATTGATTAAAGAACAACTACATAAACTTGGTCATGAAGTTGTATTGGTTTGGCGGGAAAGTCTGAAAGACTTACCTAAGAAAATTAATGCTACTGGTGCAGATGTAGCTGTAGAGCTTCACTGTAATGCATTCAATAAGATTGCTTCTGGTACTGAAGTACTTCATTACCCTTCAAGTAAACGTGGTGCTAAGCTTGCACAGTTCATTCAGGATTCAATTATAGATATTCTAGATTTACGTGATAGGGGTGTTAAACCAAGTGAACGTGAACTAATCCTACGAGCTACTAGTATGCCTTGTGTAATTCTAGAACCGGGATTCATTGACAATGATGTAGACTATCTACTTATGGAAAACCATAAAGAAGAATTAGCATTAGCTATTGCTAGCGGTATTCATGAATACAGTTTAACTCTCCGCTAG

Gene Ontology

Description Category Evidence (source)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi00059906e0_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6QgEm) rather than this protein.
PDB ID
6QgEm
Method AlphaFoldv2
Resolution 94.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50