Protein
- Protein accession
- A0AA51U7Z8 [UniProt]
- Representative
- 3nSHh
- Source
- UniProt (cluster: phalp2_1834)
- Protein name
- Peptidase M15A C-terminal domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 98% (predicted by ML model) - Protein sequence
-
MSQDPRAIQLTTNFKLSEFLHGDDAIPAPWILDNIYRLANRLQVIRDLLGKPIIINSGYRSKAHNLAVGGASHSQHLNGMAADIVVSGMPAKAVQEFLRHWSGGMGCYQHYTHLDIRPTPVRWS
- Physico‐chemical
properties -
protein length: 124 AA molecular weight: 13842,7 Da isoelectric point: 9,30 hydropathy: -0,21
Representative Protein Details
- Accession
- 3nSHh
- Protein name
- 3nSHh
- Sequence length
- 173 AA
- Molecular weight
- 19098,86030 Da
- Isoelectric point
- 9,83912
- Sequence
-
LATYTRGKAIKLSSNFYLREFECKCGKCKKTVVDAKLVSSLQKIRDHYNKEVIINSGYRCAAHNRAVGGAAFSQHLKGRAADIVVKGVAPKDVAAYAASIGVNGIGVYKTFTHIDTRANKSYWGIEFKQNNRAIAEQVIDGLWGVGAARKRKLMAAGYDYKEIQALVNELLRK
Other Proteins in cluster: phalp2_1834
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_29133
79Li3
|
3905 | 56,3% | 119 | 2.059E-41 |
| 2 |
phalp2_3181
8qBIB
|
378 | 58,6% | 121 | 7.376E-38 |
| 3 |
phalp2_11062
4US9Y
|
716 | 51,6% | 122 | 1.564E-32 |
| 4 |
phalp2_34760
3hjNi
|
68 | 40,7% | 113 | 2.645E-31 |
| 5 |
phalp2_14040
85wjf
|
1353 | 51,8% | 110 | 1.742E-30 |
| 6 |
phalp2_17462
4UpQn
|
1 | 47,2% | 125 | 4.957E-28 |
| 7 |
phalp2_37118
1hl31
|
667 | 38,1% | 110 | 7.494E-26 |
| 8 |
phalp2_20618
4JRfo
|
7 | 38,5% | 122 | 2.625E-25 |
| 9 |
phalp2_15919
4GXbY
|
30 | 44,3% | 106 | 1.721E-24 |
| 10 |
phalp2_33636
DFQX
|
10 | 38,8% | 108 | 2.882E-23 |
Domains
Domains [InterPro]
1
173 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Microcystis phage MaAM05 [NCBI] |
2812902 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MW495066
[NCBI]
CDS location
range 207847 -> 208221
strand -
strand -
CDS
ATGAGTCAAGATCCACGGGCTATTCAGCTCACCACAAACTTCAAATTGTCCGAATTTTTACACGGGGACGATGCCATTCCCGCTCCCTGGATTCTGGACAATATTTACCGACTGGCCAATCGCCTGCAAGTCATCCGGGATTTGTTGGGCAAGCCCATCATCATCAATTCCGGCTACCGCAGCAAGGCCCACAATCTGGCGGTGGGCGGGGCTTCTCACAGTCAGCACCTGAACGGCATGGCCGCAGACATTGTGGTCAGTGGCATGCCCGCCAAAGCGGTACAGGAATTTTTGAGGCACTGGTCTGGCGGGATGGGCTGTTATCAGCACTACACGCATTTGGACATTCGCCCCACCCCAGTCCGCTGGTCGTAA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(3nSHh)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50