Protein

Protein accession
A0A2H4GSX2 [UniProt]
Representative
7C7wA
Source
UniProt (cluster: phalp2_30754)
Protein name
Lysin A
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSFTWFRPEGPLRTREQVAREVHAVSQARGLDELATVIALMTISTEVGTGIGDDRKWWCPANDRVPATKNYPHDSRSDDNRSSGYFQQQPGPNGEPWWGTPENMMTLPQAANTFLERLSDDYRRAANNPRLAGEFAQRVQQSEFPDRYADKWDEAWSVLRRALNETTPEVPMPENRPPYNEFPIWSENHYNTKRTANDIDAFLLHTSEGFVGRDDAAEALSLWYQPRSRQVAYHYAVSQASDGGVTVVDNVDTDYASWSALSANGRSINLCFAGTRAAWSRNEWLGKFGNAIDVAAYLAVQDCKKYNIPTKVIAPPYTGRLPGITDHRYVTQILKDGTHTDVGDGFPWDYFTERVNHWAAGGKTEPEPPKVKRFPDDWTDRELAVETLRQQRGYALDGWPQLGGRTVVDVLGAIGEKLGVEGCYDVKDKS
Physico‐chemical
properties
protein length:430 AA
molecular weight:48483,1 Da
isoelectric point:5,27
hydropathy:-0,71
Representative Protein Details
Accession
7C7wA
Protein name
7C7wA
Sequence length
535 AA
Molecular weight
58967,97550 Da
Isoelectric point
5,39797
Sequence
MSFTRFDPQPLLSRVEIARRVHKVSLARGLDELATVIALMTISTEVGAGDGTARHWWCPANRKDAASLNFPHDSESDDSRSVGYYQQQNSAVGVDDWWGTMRSRMTLELATDVFLDRLADDYGRAAGNPKLAGQFAQRVQGSAFPDRYAQRWDEAWAVLREALRTEPAPTPTPEPTTPEVPVGKPAYREIDRMGNSRSPRRGARVRNFLLHTQEGDGTAESLAAYLNNSANQVSYHYTLRGGVVVDVVDTDYASWSVLDANPYTVNLCFAGSRAGWSRAQWLAIGDDIRVAAWLAVEDGRKYGFAADVLAPPYRVADGFSDHKYVTQALGIGNHTDVGPNFPWDVFAAAVREFTTTTTAPPKPVVNMIDQYAAAHADLVGERIDKGELATPDGRGRFAGFANGWVYWTPETGAVFVPRRMMEAWAVYKWETGPLGYPTVGATQLADGWVQAFEKGVLYWRDGAAIPFYVTGMIGARWARDGYERGSLGWPISNETIRGDGSVRQDFEHGSILWAPDGTVVLANDDGATHVRTVEH
Other Proteins in cluster: phalp2_30754
Total (incl. this protein): 28 Avg length: 491,4 Avg pI: 6,07

Protein ID Length (AA) pI
7C7wA 535 5,39797
1MfeS 528 6,57385
1Nbfi 488 5,41513
5BmpU 455 4,47268
6Qnbv 506 5,65715
6VgQS 487 5,32180
75tuH 506 5,44849
7UT5K 545 6,84531
7qGYb 493 6,18939
87KQJ 531 6,23265
8Lmdg 531 6,77296
8Lmen 533 6,18393
8hXOB 540 8,42758
F1B2S6 535 5,39797
A0A6G6XK17 432 5,41746
G1JWF2 463 5,57962
A0A160DGW5 591 7,26189
A0A4Y6EMI2 436 5,50812
R9RWA5 433 5,42246
A0A159B6I5 351 8,80253
A0A0K1LS76 463 5,57962
A0A345KTX9 591 6,95939
A0A514TZ74 591 6,95962
M9MUE0 370 6,08395
A0A8A1VE48 430 4,89721
A0A9E8G3P2 533 6,57396
B5A6N9 433 5,34113
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28062
7ebmV
12 55,4% 402 6.010E-156
2 phalp2_35332
7m5Au
49 59,6% 364 4.742E-151
3 phalp2_39054
7o3xD
152 46,5% 350 1.067E-106
4 phalp2_36228
7u9Qz
32 45,3% 377 9.998E-104
5 phalp2_30716
7i7AZ
2 40,4% 383 3.710E-87

Domains

Domains [InterPro]
Unannotated
Unannotated
LGFP
Representative sequence (used for alignment): 7C7wA (535 AA)
Member sequence: A0A2H4GSX2 (430 AA)
1 535 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08310

Taxonomy

  Name Taxonomy ID Lineage
Phage Mycobacterium phage SimranZ1
[NCBI]
1933771 Cheoctovirus >
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
KY385384 [NCBI]
CDS location
range 26526 -> 27818
strand +
CDS
ATGAGCTTCACCTGGTTCCGGCCCGAAGGTCCACTCCGCACCCGCGAACAAGTCGCCCGCGAAGTCCACGCCGTCTCCCAAGCCCGTGGCCTCGATGAACTCGCCACCGTCATCGCCCTGATGACCATCTCCACCGAGGTCGGCACCGGCATCGGCGATGACCGCAAGTGGTGGTGCCCCGCCAACGACCGCGTGCCTGCCACGAAGAACTACCCCCACGACTCCCGCAGTGACGACAACCGCTCATCGGGGTACTTCCAGCAGCAACCCGGACCCAACGGCGAACCCTGGTGGGGCACACCCGAAAACATGATGACCCTGCCACAAGCAGCCAACACGTTCCTCGAACGACTCTCCGACGACTACAGGCGCGCCGCCAACAACCCCAGGTTGGCCGGCGAGTTCGCGCAACGAGTCCAGCAATCCGAATTCCCTGACCGCTACGCCGACAAATGGGACGAAGCCTGGTCGGTGCTGCGACGTGCCCTCAACGAGACCACACCGGAGGTTCCCATGCCCGAAAACCGCCCGCCGTACAACGAATTCCCCATCTGGTCGGAGAACCACTACAACACCAAACGCACCGCCAACGACATCGACGCCTTCCTGCTGCACACCAGTGAGGGATTCGTCGGCCGCGACGACGCCGCCGAAGCCCTATCGCTGTGGTACCAGCCCAGGTCCCGCCAGGTGGCCTACCACTACGCCGTCTCCCAGGCCTCCGACGGTGGTGTGACCGTCGTGGACAATGTTGACACCGACTACGCCTCCTGGTCTGCGCTATCCGCCAACGGCCGCAGCATCAACCTCTGCTTCGCAGGCACCCGCGCCGCATGGTCGCGCAACGAGTGGCTCGGCAAGTTCGGAAACGCCATCGACGTCGCCGCGTACCTCGCCGTGCAGGACTGCAAGAAGTACAACATCCCCACCAAGGTCATTGCGCCACCGTACACTGGCCGACTCCCCGGAATCACCGACCACCGGTATGTCACCCAGATACTCAAGGACGGCACTCACACCGACGTCGGCGACGGATTCCCGTGGGACTACTTCACCGAACGCGTCAACCACTGGGCGGCTGGCGGCAAGACCGAACCCGAACCGCCGAAGGTGAAACGCTTCCCGGACGACTGGACCGACCGCGAACTCGCCGTGGAGACCTTGCGTCAGCAGCGCGGCTACGCGCTGGATGGTTGGCCGCAGCTCGGCGGCCGCACAGTGGTGGACGTACTGGGCGCGATCGGCGAAAAGCTCGGCGTCGAAGGCTGCTACGACGTCAAGGACAAATCCTGA

Gene Ontology

Description Category Evidence (source)
GO:0001897 symbiont-mediated cytolysis of host cell biological process None (UniProt)
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity molecular function None (UniProt)
GO:0009253 peptidoglycan catabolic process biological process None (UniProt)
GO:0042742 defense response to bacterium biological process None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000ca1e0a3_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (7C7wA) rather than this protein.
PDB ID
7C7wA
Method AlphaFoldv2
Resolution 87.90
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50