Protein

Protein accession
A0A649VDG1 [UniProt]
Representative
1Mw6C
Source
UniProt (cluster: phalp2_1467)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MVTRANVEATKEFIRARLGNPYVYGGALSNNVRQGTDCSEVWQTVLEMAHGRWVPGRQAEGATTESYRYVKVGEVGPFGTIAVAHWSHIPKDAAARIAFHHGPGGGANSHMWGELDGMRIESGGSKGLVTGDRALAVESNYATAWAYLPGPIGGTASLGIGSTGPAVITLQAKLNERGAGLEVDGEFGPLTGAAVARAQQEMHVVGDPPGIAGEATLRALGLLNDEGAVPANPDTLAAEYLAAATGLSVARAQQILPAARDGLKRSDATNVNRIAMWLAQIGHESAGFNATEEYEKGDGGATERWKYLGRTWIQLTWLSNYLGFSKWCFERGLVPTPTYFGDNPKELADLKWAGLGAAYYWTVARPDINALSDRRDLDTVTRRINGGTNGIDDRRNRYNRALAVGDNLLRIIADSDIDEWDALMADNTLYPSQSPYRDSDTGGFTLRQYVRNLDAMEHARFIEDAAIQGYVPAIEVVARLAKGEGPGATDKWAVAKAQNVIRIIKANLAAQTGASA
Physico‐chemical
properties
protein length:516 AA
molecular weight:55351,3 Da
isoelectric point:5,71
hydropathy:-0,28
Representative Protein Details
Accession
1Mw6C
Protein name
1Mw6C
Sequence length
436 AA
Molecular weight
47373,67620 Da
Isoelectric point
6,52429
Sequence
MTVLRSNVEATKAFIRARVGNPYVYGGALSNNVRQGTDCSEVWQTVLEMVLGRWRQGRQSEGATTESYRYIPVGGVGPFGTIRVANWRDIPGNAVARLAFHHGPGGGANSHMWGDLDGMRIESAGSKGLVTGGNARAIEDGYAHAWCYLPGPIDGGGAVPVESTEDLQRRLNTEGAGLEVDGEYGPLTASALAAYDKRVPYARQVKAEGERRGITPRGIQIAFSVVFVESGWKMYANSKVPESLKLPHDAVGSDHDSVGLFQQRCPMWGPANVLMDPTKSAGLFYDRLAKLDYNNQSRPPGDYAADVQRPAAQYRGRYQERMGHAVALYNRITTTPEPSDDLEALLMSDELFASRSIYRTSDQKTMSARDAIFGADACAHMTWIEQKALAGDSWAIGLVADLATGKLPASKDPWAVDRARHIISIIQNAIAAQETK
Other Proteins in cluster: phalp2_1467
Total (incl. this protein): 71 Avg length: 492,0 Avg pI: 5,88

Protein ID Length (AA) pI
1Mw6C 436 6,52429
2LVy0 426 5,36216
7dyXr 454 6,56885
ugp5 433 5,00782
A0A7D5JQW9 526 5,94743
A0A5J6T2E0 525 5,71814
G8I7N0 526 5,93083
A0A1C9M1F8 526 5,81499
A0A0F6YQU5 526 5,89599
G1D3T6 472 5,93316
A0A2D1GDJ1 472 5,63254
A0A5J6T3K6 466 6,21622
Q854L4 473 5,57638
A0A385DV82 472 5,60196
A0A0K2CPS6 526 5,81687
A0A142F1U1 466 6,21622
A0A142K4R9 526 5,93083
A0A1D8EY32 526 5,81687
A0A1D9CAG5 526 5,81499
A0A1S6L2C0 465 6,21622
A0A2P1A0V6 526 5,93083
A0A2P1A2H9 526 5,81687
A0A2P1A2Y7 526 5,93083
A0A2S1PBB3 472 5,60196
A0A2Z5H0T0 465 6,21622
A0A2Z5H107 466 6,21622
A0A2Z5H6C2 526 5,81687
A0A345BM58 472 5,86956
A0A345L5Y8 466 6,21622
A0A385D1K5 472 5,51130
A0A385D246 472 5,60196
A0A385UI13 472 5,60196
A0A3G8FI38 526 5,71172
A0A481VR61 526 5,93083
A0A481VV02 472 5,60196
A0A481VWN3 466 6,21622
A0A481VXL8 466 6,21622
A0A481VXN4 466 6,29955
A0A481VYH2 466 6,21622
A0A481W0B0 472 5,60196
A0A481W2D8 472 5,60196
A0A481W320 516 5,67892
A0A481W7R5 526 5,77299
A0A482JES0 518 5,91645
A0A4P8GHX0 472 5,60196
A0A4Y6EPL8 526 5,81499
A0A515MG35 472 5,60196
A0A5B8WR32 472 5,60196
A0A5J6T8K7 517 5,52494
A0A5J6TCR7 516 5,90929
A0A5J6TN95 526 5,74508
A0A5J6TRS6 516 6,15199
A0A5J6TY85 526 5,81687
A0A5Q2WHV6 526 5,93083
A0A5Q2WS95 516 6,03666
A0A649V7M7 518 5,91645
A0A649V8X9 526 5,81687
A0A649VGJ1 466 6,21622
A0A649VPV9 473 5,64243
A0A6B9J644 466 6,21622
A0A6B9SZN9 465 6,21622
A0A7T1KRD6 473 5,64243
A0A7T3KBC0 466 6,21622
G1BR17 526 5,81499
A0AAE8BVC2 472 5,60196
A0AAE8YB30 472 5,60196
A0A976U9C0 526 5,81681
A0A977KLW9 466 6,21622
A0A977KQK3 466 6,21622
A0AA48Y4F3 527 6,29892
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23524
75PiN
82 41,3% 532 1.583E-146
2 phalp2_3134
870ZT
3 32,8% 429 1.242E-42

Domains

Domains [InterPro]
Unannotated
Unannotated
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): 1Mw6C (436 AA)
Member sequence: A0A649VDG1 (516 AA)
1 436 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage SheaKeira
[NCBI]
2656603 Yeceytrevirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MN585994 [NCBI]
CDS location
range 4377 -> 5927
strand +
CDS
ATGGTTACCCGCGCCAACGTCGAAGCGACGAAAGAGTTCATCAGGGCGCGGCTCGGAAACCCCTACGTCTACGGCGGCGCTCTCTCCAACAACGTCAGGCAGGGCACAGACTGCTCTGAGGTCTGGCAGACCGTACTCGAAATGGCTCATGGCCGTTGGGTTCCCGGACGCCAGGCTGAGGGAGCCACCACCGAGTCCTACCGCTACGTCAAGGTCGGCGAAGTCGGCCCGTTCGGCACTATCGCCGTCGCACACTGGAGCCACATCCCGAAGGATGCTGCCGCTCGCATCGCGTTCCACCACGGCCCCGGAGGCGGGGCCAACTCCCACATGTGGGGGGAACTGGACGGCATGCGTATCGAGTCTGGCGGCTCGAAGGGTCTGGTCACTGGAGACCGCGCTCTCGCGGTCGAGTCCAACTACGCCACGGCCTGGGCCTACCTGCCCGGTCCCATCGGCGGTACAGCCTCTCTCGGCATCGGTTCCACCGGCCCCGCAGTAATCACCCTGCAGGCCAAGCTCAACGAGCGAGGAGCCGGCCTTGAGGTCGACGGTGAGTTCGGTCCTCTGACCGGCGCTGCTGTCGCCAGGGCTCAGCAGGAGATGCATGTCGTCGGTGACCCTCCGGGTATCGCTGGTGAAGCCACGCTCCGGGCTCTGGGACTACTGAACGACGAGGGAGCCGTCCCGGCCAACCCCGACACGCTCGCTGCCGAGTACCTCGCCGCGGCCACCGGCCTCAGCGTTGCCCGTGCTCAGCAGATCCTCCCCGCTGCGCGGGACGGTCTCAAGAGGTCCGACGCCACCAACGTCAATCGCATCGCGATGTGGCTGGCTCAGATCGGTCACGAATCGGCAGGCTTCAACGCCACCGAGGAGTACGAGAAGGGCGACGGTGGAGCCACCGAGCGGTGGAAGTACTTGGGCCGCACATGGATTCAGCTCACTTGGCTCTCGAACTACTTGGGCTTCTCGAAGTGGTGCTTCGAACGCGGGCTCGTCCCGACGCCGACGTACTTCGGGGACAACCCCAAGGAACTCGCTGATCTGAAGTGGGCCGGCTTGGGTGCCGCCTACTACTGGACCGTCGCACGGCCAGACATCAACGCACTGAGCGACCGTCGCGATCTCGACACAGTCACCCGCCGAATCAACGGCGGTACCAACGGGATTGACGACCGGCGTAACCGATACAACCGCGCACTAGCCGTGGGAGACAACCTCCTTCGGATCATCGCGGACTCCGACATAGACGAATGGGATGCCCTCATGGCTGACAACACGCTCTACCCGTCGCAGTCTCCCTACCGGGACAGCGACACCGGAGGCTTCACCCTCCGACAGTACGTCCGCAACCTCGACGCGATGGAGCACGCGAGGTTCATCGAAGACGCCGCGATCCAGGGCTACGTCCCTGCCATCGAGGTGGTCGCACGACTGGCTAAGGGCGAAGGCCCAGGAGCCACCGACAAGTGGGCAGTAGCTAAGGCCCAGAACGTCATTCGAATCATCAAGGCCAACCTGGCCGCTCAGACAGGAGCATCTGCATGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1Mw6C) rather than this protein.
PDB ID
1Mw6C
Method AlphaFoldv2
Resolution 87.24
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50