Protein
- Protein accession
- A0A223W052 [UniProt]
- Representative
- 7csIv
- Source
- UniProt (cluster: phalp2_15037)
- Protein name
- TtsA-like Glycoside hydrolase family 108 domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MTDRFEICHAITAKWEGGWSDHPADPGGKTMYGITEKRWHEYQDKLKVKRTPVRNVTKAQALAFYRSEFWLACGADKLFPGVDLAVHDGSVNSGVSRGRKWLLASAGSNDHSETVKKICRARLSFMQSLAIWKTFGNGWGRRVADIEARGVAMALAAMGLSPSQVSGKIKTEAAKSAQQASSAKKAATTSATAASAPAAAPVVEPSTVTDATTVWILVAIVAAGAVATVIFIAKKRAADARVEAYNEVAA
- Physico‐chemical
properties -
protein length: 250 AA molecular weight: 26620,1 Da isoelectric point: 9,68 hydropathy: -0,10
Representative Protein Details
- Accession
- 7csIv
- Protein name
- 7csIv
- Sequence length
- 86 AA
- Molecular weight
- 9739,94010 Da
- Isoelectric point
- 8,93069
- Sequence
-
MSTAKFRRCHDVTKAWEGGWSDHPADPGGKTMYGLTEAVFHAWLRQQRKPVRPVRQITAAEAEQIYFEQYWVPSGGPTISPPTMPP
Other Proteins in cluster: phalp2_15037
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_16609
esP7
|
11 | 33,8% | 68 | 2.959E-17 |
| 2 |
phalp2_11557
12S8B
|
2 | 37,9% | 58 | 7.627E-13 |
| 3 |
phalp2_15722
3iCcu
|
63 | 36,1% | 72 | 1.227E-10 |
| 4 |
phalp2_21282
1uMBM
|
118 | 33,3% | 66 | 1.686E-10 |
| 5 |
phalp2_18063
3IzUl
|
9 | 25,3% | 75 | 7.044E-08 |
| 6 |
phalp2_33765
15NtY
|
387 | 28,4% | 88 | 2.510E-07 |
| 7 |
phalp2_19635
4kzEN
|
2 | 26,7% | 86 | 5.547E-05 |
| 8 |
phalp2_3878
6ORZl
|
656 | 25,0% | 84 | 5.547E-05 |
| 9 |
phalp2_15477
82FB3
|
2 | 33,3% | 54 | 3.724E-04 |
Domains
Domains [InterPro]
1
86 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Agrobacterium phage Atu_ph08 [NCBI] |
2024265 | Roslyckyvirus > Roslyckyvirus ph08 |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
MF403009
[NCBI]
CDS location
range 40486 -> 41238
strand +
strand +
CDS
ATGACTGACCGATTCGAGATTTGCCACGCCATCACGGCTAAGTGGGAAGGTGGATGGAGTGACCATCCCGCGGACCCCGGCGGCAAAACCATGTATGGCATCACCGAAAAGCGCTGGCACGAATATCAGGACAAGCTGAAGGTCAAGCGGACGCCGGTGCGCAACGTCACCAAGGCGCAAGCCCTCGCGTTCTACCGCAGCGAATTCTGGCTCGCCTGCGGAGCTGACAAGCTATTCCCCGGTGTTGATCTGGCTGTACACGACGGGTCGGTAAACTCCGGTGTTTCTCGTGGTCGCAAATGGCTGCTTGCATCCGCCGGCAGTAACGATCACAGCGAGACGGTGAAGAAAATCTGCCGCGCTCGCCTTTCCTTCATGCAGTCACTCGCGATCTGGAAAACGTTCGGCAATGGCTGGGGGCGTCGTGTTGCTGATATCGAAGCGCGTGGCGTTGCCATGGCGCTCGCAGCGATGGGGCTTTCTCCTTCGCAGGTCAGCGGGAAGATCAAGACAGAAGCGGCCAAATCGGCCCAGCAGGCCAGCTCGGCAAAGAAAGCGGCCACCACAAGCGCCACCGCGGCGTCAGCGCCAGCCGCCGCGCCCGTTGTCGAGCCTTCCACCGTGACGGACGCAACAACCGTCTGGATCCTCGTCGCCATTGTGGCGGCCGGTGCCGTTGCCACCGTTATCTTTATCGCCAAGAAGCGCGCCGCCGATGCCCGCGTTGAGGCCTACAACGAGGTGGCAGCATGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0016020 | membrane | cellular component | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi000bb9f1a4_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(7csIv)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50