Protein
- Protein accession
- A0A222ZES2 [UniProt]
- Representative
- 7s5Uq
- Source
- UniProt (cluster: phalp2_16559)
- Protein name
- Lysin A
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MTYQLHEEYSCDFTGPERTPLGIAIHWWDDPKNKPTFWNIVNLLLERSRQRSASVNFVAEAGLVACLVSPNVVAWAQGDGASGWGNNNLISIECNPRCTPEDRETVAELIADQRLLWGSLPLYPHNKFTATQCPGVWEQWLPWLDRRANEIMAAKRGQAAPVVPASAPAPAAPAPAPAADNRIHWVVERGDTLGGIAAHYGGPSAAQIAAYNGIPDANRIEVGQRIYIPGPLVWIVDPGDTLGAIGAHYGMSAETVAANNGLNVNAPIFPGQRLRIID
- Physico‐chemical
properties -
protein length: 278 AA molecular weight: 30092,6 Da isoelectric point: 5,37 hydropathy: -0,17
Representative Protein Details
- Accession
- 7s5Uq
- Protein name
- 7s5Uq
- Sequence length
- 244 AA
- Molecular weight
- 27277,26150 Da
- Isoelectric point
- 9,02520
- Sequence
-
MAYRYETQYDSPNYEPGREGNRIQGITIHHWGYDGQYHDDVIRFLCRPGGKTSAHYVVSGGRVTCIISPENTAYHAGTWAANLRQIGIECRPECSRADMETVAELIADLRKTYGYLPLSPHKRFFNTACPGRWESKLGWLDKRAEEIRNGKSTPTQNTGAGGEDRRKKKMRNVGIYWKASDGSVKCAILNTSSGFWAAHSNGNKFLASYNNPLAIAFDTPSWAEVTEAHAAVLQARCEQVRKGA
Other Proteins in cluster: phalp2_16559
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_1064
7bl2o
|
2 | 46,9% | 196 | 9.847E-71 |
| 2 |
phalp2_13682
65xN2
|
64 | 46,9% | 162 | 4.750E-52 |
| 3 |
phalp2_13827
7cMO8
|
12 | 50,3% | 155 | 4.750E-52 |
| 4 |
phalp2_25234
40uRC
|
5 | 49,0% | 161 | 5.831E-51 |
| 5 |
phalp2_31988
59Np4
|
130 | 45,1% | 175 | 4.686E-49 |
| 6 |
phalp2_12749
88ep5
|
5 | 40,6% | 160 | 1.175E-45 |
| 7 |
phalp2_7901
8LrgT
|
13 | 44,6% | 159 | 5.619E-45 |
| 8 |
phalp2_13806
716PV
|
1 | 39,0% | 164 | 1.242E-40 |
| 9 |
phalp2_26331
1cKvm
|
65 | 34,8% | 181 | 2.221E-25 |
| 10 |
phalp2_7058
8iEHQ
|
9 | 35,1% | 179 | 4.117E-25 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Arthrobacter phage Abidatro [NCBI] |
2015853 | Galaxyvirus > Galaxyvirus abidatro |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MF140397
[NCBI]
CDS location
range 18061 -> 18897
strand +
strand +
CDS
ATGACTTACCAACTGCACGAGGAGTACTCCTGCGACTTCACCGGGCCGGAGCGGACGCCGCTCGGCATCGCTATTCACTGGTGGGATGATCCAAAGAACAAGCCGACGTTTTGGAACATTGTCAACCTGCTACTCGAGCGGTCCCGGCAGCGGTCCGCCTCGGTCAACTTCGTGGCTGAGGCCGGCCTCGTGGCTTGCCTTGTGTCTCCGAACGTGGTGGCCTGGGCGCAAGGTGACGGGGCCTCCGGCTGGGGCAACAACAACCTGATAAGCATCGAGTGCAACCCTCGGTGCACTCCGGAGGACCGGGAGACGGTCGCGGAGCTTATCGCGGATCAGCGTCTCCTCTGGGGGAGCCTGCCGCTCTACCCTCACAACAAGTTCACGGCGACTCAATGCCCGGGCGTGTGGGAGCAGTGGCTTCCGTGGCTGGACCGGCGGGCCAACGAGATTATGGCGGCTAAGCGCGGGCAGGCGGCTCCGGTAGTTCCGGCCTCCGCTCCTGCTCCGGCCGCTCCGGCTCCTGCCCCTGCGGCCGATAACCGGATCCACTGGGTTGTCGAGCGGGGCGACACGCTCGGCGGGATCGCGGCGCACTACGGCGGCCCGTCTGCGGCGCAGATCGCGGCGTACAACGGCATCCCGGACGCTAACCGGATCGAGGTCGGCCAGCGCATCTACATTCCTGGGCCGCTGGTCTGGATCGTGGACCCTGGCGACACGCTGGGCGCGATCGGGGCGCACTACGGCATGAGCGCGGAGACGGTCGCGGCCAACAACGGCCTGAACGTCAACGCTCCTATCTTCCCGGGGCAGCGGCTCCGGATTATCGACTAA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008745 | N-acetylmuramoyl-L-alanine amidase activity | molecular function | None (UniProt) |
| GO:0009253 | peptidoglycan catabolic process | biological process | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000b94cbd1_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(7s5Uq)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50