Protein

Protein accession
A0A7G8LDX5 [UniProt]
Representative
7jwQQ
Source
UniProt (cluster: phalp2_29146)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAYVSLSRPGSNFAGKPITRAIADRYRAAGVDIGAIFQYGKPQGTAPSDWTTGFEGGKRMAAQALRIARDAGMPGWCPIYFAVDENITLQQWNDTAVHFFRGAGEAIGVEWVGIYGHSRVCHWAIEDKVIGRSPKPGSGLWAWVTRAWSDDTGTGYAALYQRIIDTPSNPGPLVDGIRVDVNDVYAADWGQWSVDRTPNAGQPAPAPTAPGGPADMARIGYGVTHTIPAGSDGPRRADDYVGVHTQEGGKGDAIGLANYCKTAGVSYNDAVDDVYTVRMMAPGNAPWAAVQANAVGYHIVLAGSFVSWSRDRWLSKDASDGLDEDAMLTRAARCVAAACQEFGIPVEWVGNNGATGWPQKRGICGHKDFGARGGGHTDPYPNFPIDEFMRRVRAFFAPPSPNLIDAEAKVAARWIGKRITGAATPATTDDETPLFLDGKKVGAFARFESGHVYWRLGASAAYAIPAGGLFEAYAARDWERGLGFPVLRHQVVTTPISKVTAGVQSFEGGVLFTPVGGPVEGFVVHGEIGKRYAAMDWEQGPLGLPTSDERPVPNTDLIEQRFEFGKLTYVPTGVLVELVTN
Physico‐chemical
properties
protein length:581 AA
molecular weight:62307,2 Da
isoelectric point:5,82
hydropathy:-0,22
Representative Protein Details
Accession
7jwQQ
Protein name
7jwQQ
Sequence length
233 AA
Molecular weight
25497,14060 Da
Isoelectric point
5,92003
Sequence
MTRTLIDSGARIEAAGIKRAGHVGSFGYLSTSRPGTFFSGKPIRQDWVNDMNSLGLMIVSCWQFGKNATADWRRGFNGGVADAKAAARKHRELGGPDDAPIYFAVDDDVNLSDYRLFVRPYLQGVNSVIGVNRTGVYGHDECVTWAATDQLLGKTRDGSGKTFGWQTRAWSEGDISRYACVYQRIVDTASTPGPPIDGNPTDVNDMLCDDIGQWNYGGGKWNDILTQMLGTKD
Other Proteins in cluster: phalp2_29146
Total (incl. this protein): 36 Avg length: 479,6 Avg pI: 5,85

Protein ID Length (AA) pI
7jwQQ 233 5,92003
1NTEj 237 5,39450
1ep7d 243 9,31763
7PKiT 233 5,70688
7jPrh 233 5,50028
7qzf5 261 5,53648
8LhIn 233 5,36392
A0A0K0NKU4 233 5,92003
A0A3G3M912 606 5,93765
A0A514DEN0 606 5,62015
A0A345KUB7 606 5,78976
A0A5P8D4W7 606 5,76321
A0A3G3MAW0 606 5,78976
A0A0E3T822 233 5,70688
A0A0E3T7X5 233 5,50028
A0A3G3M7Z6 605 5,98613
A0A3G3MAD2 606 5,84807
A0A1B3B0M0 233 5,36392
A0A385DTM7 603 5,76083
A0A142K9U3 581 5,82318
A0A2L0HJL6 582 5,82318
A0A3G3M9B3 581 5,82318
A0A3G3MAC6 603 5,94066
A0A4Y6EN22 606 5,84807
A0A5P8D607 603 5,84807
A0A7G3WHV3 606 5,94066
A0A7G8LKJ3 606 5,78976
A0A8F2IWP6 603 5,84807
A0A8T8JCW5 606 5,76321
A0A9E7DXH8 606 5,84949
A0A9E7MCW9 230 5,67420
A0A9E7QTW1 606 5,76083
A0A9X9P677 606 5,84807
A0AA96V889 606 5,85012
A0AAE7VDD0 606 5,76321
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3898
6TrTC
3 43,1% 167 6.281E-68
2 phalp2_31904
4LoIq
28 32,4% 216 1.429E-42
3 phalp2_10918
4EKvI
27 34,0% 226 3.432E-36
4 phalp2_8820
2pMjJ
148 32,7% 211 1.450E-34
5 phalp2_15053
7qmsB
8 31,7% 217 6.894E-34
6 phalp2_35595
1dXu3
19 31,0% 216 1.756E-33
7 phalp2_6406
boMu
55 32,5% 203 1.375E-31
8 phalp2_31044
1EaUP
5 32,1% 190 8.892E-31
9 phalp2_26371
1oyu6
8 32,7% 211 2.260E-30
10 phalp2_18219
4N93g
8 28,6% 227 1.460E-29

Domains

Domains [InterPro]
Representative sequence (used for alignment): 7jwQQ (233 AA)
Member sequence: A0A7G8LDX5 (581 AA)
1 233 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08924

Taxonomy

  Name Taxonomy ID Lineage
Phage Gordonia phage Paries
[NCBI]
2762413 Vividuovirus > Vividuovirus paries
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MT639640 [NCBI]
CDS location
range 37534 -> 39279
strand +
CDS
ATGGCGTACGTGTCGCTGTCCCGTCCTGGCAGCAACTTCGCGGGAAAGCCGATCACACGAGCTATCGCGGATCGCTACCGCGCCGCAGGTGTGGACATCGGGGCCATATTCCAGTACGGCAAACCGCAGGGCACCGCGCCGTCGGACTGGACGACCGGGTTTGAGGGTGGCAAACGCATGGCCGCGCAGGCGCTGCGTATCGCCCGCGACGCCGGCATGCCCGGCTGGTGCCCGATCTATTTCGCCGTCGACGAGAACATCACTCTGCAGCAGTGGAACGACACGGCCGTGCATTTCTTCCGCGGCGCCGGCGAGGCGATCGGCGTCGAGTGGGTCGGCATCTACGGGCACTCGCGTGTGTGTCACTGGGCGATCGAGGACAAGGTCATCGGCCGGTCACCGAAGCCGGGATCCGGGCTGTGGGCGTGGGTCACCCGCGCGTGGTCGGATGACACCGGCACCGGCTACGCGGCGCTGTATCAGCGCATCATCGACACGCCGTCGAATCCGGGGCCGCTCGTCGACGGGATCCGCGTCGACGTCAACGACGTCTATGCCGCCGATTGGGGGCAGTGGTCGGTCGACCGCACACCGAACGCCGGGCAGCCGGCACCCGCACCAACAGCACCGGGAGGGCCGGCAGATATGGCACGCATCGGATACGGCGTCACGCACACGATCCCCGCCGGGTCGGACGGGCCGCGCCGCGCGGACGACTACGTCGGCGTGCACACGCAGGAAGGCGGGAAGGGCGACGCGATCGGACTAGCGAACTACTGCAAGACCGCGGGCGTGTCCTACAACGACGCCGTGGACGATGTGTACACCGTCCGCATGATGGCACCCGGCAACGCGCCGTGGGCTGCTGTGCAGGCGAACGCCGTCGGCTACCACATCGTGCTAGCAGGCAGTTTCGTGTCGTGGTCGCGTGACCGGTGGCTGTCGAAGGATGCATCGGACGGACTCGACGAAGACGCGATGCTGACCCGCGCCGCCCGCTGTGTCGCCGCAGCATGTCAAGAGTTCGGGATCCCCGTCGAGTGGGTCGGGAACAACGGGGCGACCGGGTGGCCGCAGAAGCGCGGTATCTGCGGGCACAAGGATTTCGGGGCGCGCGGCGGCGGCCACACCGACCCGTACCCGAACTTCCCGATCGACGAGTTCATGCGCCGGGTGCGTGCATTCTTCGCGCCGCCGTCACCGAACCTGATCGACGCCGAGGCGAAGGTCGCCGCCCGCTGGATCGGAAAGCGCATCACCGGCGCCGCGACACCGGCGACCACCGACGACGAAACACCGCTGTTCCTCGACGGTAAGAAGGTCGGCGCGTTCGCGCGGTTCGAGTCCGGTCACGTGTACTGGCGGCTCGGCGCGTCGGCGGCGTACGCGATCCCCGCCGGCGGGCTGTTCGAGGCGTACGCCGCCCGCGACTGGGAACGCGGTCTAGGGTTCCCGGTGCTGCGGCACCAGGTCGTCACCACACCGATCAGCAAGGTCACCGCCGGCGTGCAGTCGTTCGAGGGCGGCGTGTTATTCACACCTGTGGGCGGACCTGTGGAGGGATTCGTCGTGCACGGCGAGATCGGCAAACGGTACGCGGCGATGGACTGGGAACAGGGACCACTCGGTCTACCCACATCGGACGAACGACCGGTGCCGAATACCGACCTGATCGAGCAACGGTTCGAGTTCGGGAAACTGACCTACGTGCCGACGGGCGTACTCGTCGAGCTGGTCACCAACTAG

Gene Ontology

Description Category Evidence (source)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7jwQQ) rather than this protein.
PDB ID
7jwQQ
Method AlphaFoldv2
Resolution 94.19
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50