Protein
- Protein accession
- D6PH71 [UniProt]
- Representative
- 80wC7
- Source
- UniProt (cluster: phalp2_9770)
- Protein name
- Glycoside hydrolase family 19 catalytic domain-containing protein
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 98% (predicted by ML model) - Protein sequence
-
MQYGLKILLEFGYSNKVFTDLYYKKMTINLSKAAKWYKEESHQLAAWNWLESQLTDDQIDEFACMYRAGPANPTGRVITPDIMQQLTGYAANKFDNTFCGDFNKLLMITKFDQHKGAMCMLIANLMHETGNFRWMKEIADGTAYNNRADLGNGPYDGPKYKGTGVLMLTGKYNYTRLAAELQDPLIVERGCEYVADHYPFRSALTWIKDNDLLYLCLTKGFDDCCYRINGGWNGYEDRLEKYKICKKVFNVL
- Physico‐chemical
properties -
protein length: 252 AA molecular weight: 29106,1 Da isoelectric point: 6,88 hydropathy: -0,42
Representative Protein Details
- Accession
- 80wC7
- Protein name
- 80wC7
- Sequence length
- 66 AA
- Molecular weight
- 7282,34720 Da
- Isoelectric point
- 6,70839
- Sequence
-
MATGFDKHREAMCMLIANLLHETGNFRWMSEIADGSAYEMRADLGNVYPGDGKKFKGAGVLMLTGR
Other Proteins in cluster: phalp2_9770
| Total (incl. this protein): 2 | Avg length: 159,0 | Avg pI: 6,79 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 80wC7 | 66 | 6,70839 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_19257
826kG
|
1 | 54,3% | 46 | 1.005E-11 |
Domains
Domains [InterPro]
1
66 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
uncultured phage MedDCM-OCT-S04-C348 [NCBI] |
743545 | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
GU943056
[NCBI]
CDS location
range 1364 -> 2122
strand +
strand +
CDS
ATGCAGTATGGCCTCAAAATCCTGTTAGAGTTTGGTTATAGTAATAAAGTTTTTACCGACCTATATTATAAAAAAATGACAATCAATCTCAGTAAAGCTGCAAAGTGGTACAAGGAAGAATCACACCAGCTAGCTGCCTGGAACTGGCTAGAATCGCAACTTACAGACGATCAAATTGATGAATTTGCTTGTATGTATCGTGCAGGTCCAGCAAATCCAACAGGTCGCGTTATTACACCGGATATTATGCAGCAATTAACGGGTTATGCTGCTAACAAATTTGATAATACATTTTGTGGCGACTTTAATAAGTTGTTGATGATAACAAAGTTTGATCAACATAAAGGAGCCATGTGCATGTTAATTGCAAACCTTATGCATGAAACAGGTAATTTCCGTTGGATGAAAGAAATTGCAGATGGGACTGCATATAATAATCGTGCTGATTTAGGTAACGGTCCCTACGACGGACCTAAATATAAAGGTACAGGAGTACTAATGCTGACTGGTAAATACAACTACACACGCCTAGCTGCTGAATTGCAGGATCCTCTCATCGTAGAACGTGGTTGTGAATATGTAGCCGATCACTACCCTTTTCGTTCTGCATTAACTTGGATTAAAGATAATGATCTTCTATACCTTTGTCTTACCAAGGGTTTTGACGACTGTTGTTACCGCATTAATGGAGGATGGAACGGATATGAAGATCGCCTTGAAAAATACAAAATCTGCAAAAAGGTATTTAATGTCCTCTGA
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0001d1c846_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(80wC7)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50