Protein
- Protein accession
- A0A166YGS6 [UniProt]
- Representative
- 1jszc
- Source
- UniProt (cluster: phalp2_8504)
- Protein name
- Lysin A
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MSQFKADRDMLSPNDSGVRAAPPSLVIVHTNEGDPNGRADDLAAYLQKPSSQASYTLIVDRTGRIVRSNDDEFVPWAAGSPANERGLHLCFVGRAVQSTAEWLAQPRQLDAAARAVADWCRRYNIPATWLTGDRMRAGARGIGGHDTTVFAWHATDHTDPGAGFPRAQFVALVNKHLTGTSQQEDDDMAFTDEDRRKLDYIYGQLRPWPQLGTNDKGEPLTLIDAVAELKANDR
- Physico‐chemical
properties -
protein length: 234 AA molecular weight: 25771,0 Da isoelectric point: 5,49 hydropathy: -0,58
Representative Protein Details
- Accession
- 1jszc
- Protein name
- 1jszc
- Sequence length
- 283 AA
- Molecular weight
- 31684,44460 Da
- Isoelectric point
- 5,39023
- Sequence
-
MSNAYKCDHDILTSNDDGRRQGLIKRIYIHTFEGSDLDAVAMARYQLSPAAGGSYHMVIDLDGMTARENDDEYIPWSAGWTANRDGIHISLAGRAAFSREFWLARTKQLDKLADVVAAYCKAYGVPAIYRNAEDLKAGRWGISTHADAAIAWRETDHTDPGKGFPMDVLVDKVNALLHPTPVPPPPPAPVEPGVKYPSLIDGRPLRYSEFIRFIDEKVTRIYDDMILDPKDMNVAALYTGIQGEAYPSYVDPSKSFTLDQYVRLIDFKIDAIRKELDKQSVQS
Other Proteins in cluster: phalp2_8504
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_32230
7h6JD
|
19 | 50,2% | 215 | 1.008E-79 |
| 2 |
phalp2_18569
7cxQw
|
45 | 43,7% | 176 | 4.787E-53 |
| 3 |
phalp2_24741
6EXtT
|
52 | 35,7% | 190 | 8.299E-35 |
| 4 |
phalp2_1243
oSGY
|
19 | 36,0% | 183 | 8.543E-33 |
| 5 |
phalp2_31178
7Vvn5
|
47 | 37,0% | 181 | 2.193E-30 |
| 6 |
phalp2_558
HtN7
|
28 | 34,6% | 182 | 1.390E-27 |
| 7 |
phalp2_16608
eqCG
|
19 | 32,3% | 204 | 8.284E-23 |
| 8 |
phalp2_28942
5F55F
|
4 | 30,3% | 201 | 1.088E-17 |
| 9 |
phalp2_23125
4E9CJ
|
94 | 28,9% | 176 | 2.675E-17 |
| 10 |
phalp2_7436
4Ksn7
|
7 | 28,4% | 190 | 3.609E-17 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Gordonia phage McGonagall [NCBI] |
1838072 | Mcgonagallvirus > Mcgonagallvirus macgonagall |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
KU998255
[NCBI]
CDS location
range 3248 -> 3952
strand +
strand +
CDS
GTGAGCCAGTTCAAGGCAGACCGAGACATGTTGTCTCCCAACGATTCCGGCGTACGCGCCGCGCCGCCGTCGCTGGTCATCGTCCACACAAACGAGGGTGACCCGAACGGCCGCGCCGACGACCTGGCCGCGTACTTGCAGAAGCCGAGCAGCCAAGCGTCGTACACGCTCATCGTGGACCGCACCGGCCGCATCGTCCGCAGCAACGACGACGAGTTCGTTCCGTGGGCTGCTGGCAGCCCGGCGAACGAGCGCGGTCTGCACCTGTGTTTCGTCGGCCGCGCCGTTCAGTCCACCGCGGAATGGCTCGCGCAGCCCCGGCAGCTCGACGCCGCCGCGCGCGCCGTCGCCGACTGGTGCCGCCGGTACAACATCCCGGCGACCTGGCTCACCGGCGACCGTATGCGCGCCGGTGCGCGCGGCATCGGCGGCCACGACACCACCGTGTTCGCCTGGCACGCCACCGACCACACAGACCCCGGCGCAGGGTTCCCGCGCGCACAGTTCGTGGCGCTGGTGAACAAGCACCTGACCGGCACATCACAGCAGGAGGACGATGACATGGCATTCACCGACGAGGACCGCCGGAAGCTCGACTACATCTATGGGCAGCTTCGGCCGTGGCCGCAGCTCGGCACCAACGACAAGGGCGAGCCGCTGACGCTGATCGACGCCGTCGCCGAGCTCAAGGCCAACGACCGATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0001897 | symbiont-mediated cytolysis of host cell | biological process | None (UniProt) |
| GO:0008745 | N-acetylmuramoyl-L-alanine amidase activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
| GO:0042742 | defense response to bacterium | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000203147c_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(1jszc)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50