Protein

Protein accession
A0A2U8UMJ2 [UniProt]
Representative
7uhSc
Source
UniProt (cluster: phalp2_12317)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTHTNDTYAREILRAGNDLGITPRGIVIAFATVFVESDWYMWANAKVPESLRLPHERVGNDGRSVGLFQQQVVWGNGAWWWGDAATCMDPYKSARLFFERLKTRDYSTGDPGAHAQAIQRSAYPDRYGQRMSEAQNYYNQLAGEDANVGFSGDPVWLEDVLREALGDRLVVAQADWKERGTGGVMGDIWGVMIHHTGNDRETVAGIRDGRPDLRGPLSQCLITPDGKCHLIAVGPCNHAGTGSYPGVGTNNGNQRLIGFECAWPTIRPDGSFDPEQRWPDAQIITMRDATAAVLKRLGHDSKHVIGHKEWAGATQGKWDPGNLDMNWFRGEVQKDLDGFVFPGEQPSAPQPGPALPPDYDKEVWDQLRILWPQLGHRTLVDAVAAIGAKLGIEGCYDVKGKS
Physico‐chemical
properties
protein length:402 AA
molecular weight:44402,2 Da
isoelectric point:5,58
hydropathy:-0,50
Representative Protein Details
Accession
7uhSc
Protein name
7uhSc
Sequence length
391 AA
Molecular weight
43081,14760 Da
Isoelectric point
6,28329
Sequence
VPTKDDYAREIIRAGRDLGITDRGIVIAFATVSVECDWIMYANAKVPESLKLPHERVGSDGKSVGLFQQQVVWGNNAWWWGDAATCMDPYKSARLFFERLKKRDYNNGDPGAHAQAIQGSAFPDRYGQRMAQAQAYFNQLTNQENPMAVSGDPIWLEDVLRDAIGDLLIVEPGWKERGAGGFMGVIWGSMWHHTGNVNETVATIRDGVQQPSGWLPGPLSQGLIKPDGTCHLIAVGPCNHAGAGNWKDLTDGNRQSIGFECAYSGSGPWPQKQIITMRNIAAAISKHIGKRADDSVCGHKEYAKPQGRKVDPGNMDMNWFRAEVQKDIDGFVFPGETPGAPEPPKVKRFPDDWTERELMIEVLRQLRGPALNGWPQLGGKSVVDYLGKGNG
Other Proteins in cluster: phalp2_12317
Total (incl. this protein): 56 Avg length: 435,0 Avg pI: 5,77

Protein ID Length (AA) pI
7uhSc 391 6,28329
1p4lY 531 8,70344
1roS3 426 5,71609
3NS0x 430 4,84418
6RKkY 511 6,70907
6RP1H 525 8,30013
7AqhM 422 5,96851
7ArDU 398 5,70029
7AsOe 403 5,51278
7g8WY 430 4,93183
7gnZJ 421 5,06449
7hlH0 431 4,80815
7juLm 459 5,85484
7kFyg 430 4,93797
7m9GL 417 5,04551
7pQmP 430 4,88641
7pSvK 430 4,89341
7pV9H 430 4,89341
7pW7z 430 4,98696
7qJyO 430 5,17453
7qdhK 431 4,84089
7qgn6 430 4,90722
7qxpp 411 5,94265
7ubw4 430 4,99174
8KwEU 402 5,57849
8KwEY 424 5,97062
8MARL 398 5,62839
8MQdt 402 5,67614
8MWcG 423 6,22941
8MZf5 398 5,87001
fQj4 531 8,66109
A0A249XT90 403 5,51278
A0A023ZWD2 459 5,85484
A0A0A0RKX1 402 5,57849
A0A7G9UXF8 402 5,67614
A0A0K2FPQ1 402 5,67858
A0A192YBS4 459 5,85484
A0A2L1J041 459 5,93862
A0A345MGX5 402 5,67858
A0A385DZN0 402 5,67858
A0A385UK50 459 5,94129
A0A385UKB6 459 6,02677
A0A3G8FHF6 459 5,93862
A0A3S9U966 459 6,02677
A0A411AYD0 459 5,85671
A0A482MF21 459 6,02677
A0A5P8D580 459 5,93862
A0A5P8D8G7 459 6,02677
A0A5P8DD55 459 5,93862
A0A5P8DF31 459 5,93862
A0A649VD64 402 5,67858
A0A649VIH8 402 5,67858
B5U576 402 5,67858
G1BNC1 459 5,93862
A0A9E8M3R0 459 5,93862
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40628
7RbAk
146 37,6% 377 1.782E-126

Domains

Domains [InterPro]
Unannotated
Ami2
Unannotated
Representative sequence (used for alignment): 7uhSc (391 AA)
Member sequence: A0A2U8UMJ2 (402 AA)
1 391 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage Byougenkin
[NCBI]
2182394 Cheoctovirus > Cheoctovirus byougenkin
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MH155866 [NCBI]
CDS location
range 26387 -> 27595
strand +
CDS
GTGACACACACCAACGACACCTACGCGCGCGAAATCCTACGCGCCGGAAACGACCTCGGCATCACCCCGCGCGGAATCGTCATCGCGTTCGCAACCGTGTTCGTAGAGTCCGACTGGTACATGTGGGCCAACGCGAAAGTCCCAGAATCACTGCGCCTTCCTCATGAACGCGTAGGTAACGACGGGCGCAGCGTCGGCCTATTCCAGCAGCAAGTGGTGTGGGGCAACGGCGCATGGTGGTGGGGCGACGCAGCCACCTGCATGGACCCCTACAAGTCCGCCCGACTGTTCTTCGAACGCCTCAAGACCCGCGACTACAGCACCGGCGACCCGGGAGCGCACGCCCAAGCCATTCAACGCTCGGCATACCCAGATCGATACGGACAGCGCATGTCCGAAGCGCAGAACTACTACAACCAGCTCGCAGGAGAGGACGCAAACGTGGGATTCAGCGGAGATCCCGTCTGGCTCGAAGACGTTCTACGAGAAGCCCTCGGCGACCGACTCGTAGTCGCCCAGGCCGACTGGAAAGAACGCGGGACCGGCGGCGTAATGGGCGACATCTGGGGCGTCATGATCCACCACACCGGCAACGACCGAGAAACCGTCGCCGGAATCCGTGACGGCCGCCCCGACCTGAGAGGCCCACTATCGCAATGCCTCATCACCCCCGACGGGAAATGCCACCTGATCGCCGTCGGCCCATGCAACCACGCTGGGACCGGCTCGTATCCCGGCGTCGGCACCAACAACGGCAATCAGCGGCTCATTGGCTTCGAGTGCGCCTGGCCCACCATCCGCCCCGACGGCTCGTTCGATCCCGAGCAGCGCTGGCCCGACGCCCAGATCATCACCATGCGTGACGCCACCGCGGCGGTGCTGAAACGACTCGGCCATGACTCCAAGCACGTCATCGGCCATAAGGAATGGGCCGGTGCCACACAGGGCAAGTGGGACCCCGGCAACCTCGACATGAACTGGTTCCGCGGCGAAGTCCAGAAAGACCTGGACGGGTTCGTGTTCCCCGGTGAGCAGCCGTCGGCACCGCAGCCTGGCCCGGCCTTGCCGCCCGACTACGACAAAGAGGTGTGGGATCAGTTGCGCATCCTGTGGCCGCAGCTCGGACACCGCACCCTCGTTGACGCCGTGGCGGCGATCGGCGCGAAGCTCGGCATCGAAGGCTGCTACGACGTCAAGGGCAAGTCCTGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7uhSc) rather than this protein.
PDB ID
7uhSc
Method AlphaFoldv2
Resolution 91.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50