Protein

Protein accession
D6PH71 [UniProt]
Representative
80wC7
Source
UniProt (cluster: phalp2_9770)
Protein name
Glycoside hydrolase family 19 catalytic domain-containing protein
Lysin probability
100%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MQYGLKILLEFGYSNKVFTDLYYKKMTINLSKAAKWYKEESHQLAAWNWLESQLTDDQIDEFACMYRAGPANPTGRVITPDIMQQLTGYAANKFDNTFCGDFNKLLMITKFDQHKGAMCMLIANLMHETGNFRWMKEIADGTAYNNRADLGNGPYDGPKYKGTGVLMLTGKYNYTRLAAELQDPLIVERGCEYVADHYPFRSALTWIKDNDLLYLCLTKGFDDCCYRINGGWNGYEDRLEKYKICKKVFNVL
Physico‐chemical
properties
protein length:252 AA
molecular weight:29106,1 Da
isoelectric point:6,88
hydropathy:-0,42
Representative Protein Details
Accession
80wC7
Protein name
80wC7
Sequence length
66 AA
Molecular weight
7282,34720 Da
Isoelectric point
6,70839
Sequence
MATGFDKHREAMCMLIANLLHETGNFRWMSEIADGSAYEMRADLGNVYPGDGKKFKGAGVLMLTGR
Other Proteins in cluster: phalp2_9770
Total (incl. this protein): 2 Avg length: 159,0 Avg pI: 6,79

Protein ID Length (AA) pI
80wC7 66 6,70839
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19257
826kG
1 54,3% 46 1.005E-11

Domains

Domains [InterPro]
Unannotated
Representative sequence (used for alignment): 80wC7 (66 AA)
Member sequence: D6PH71 (252 AA)
1 66 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage uncultured phage MedDCM-OCT-S04-C348
[NCBI]
743545 No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
GU943056 [NCBI]
CDS location
range 1364 -> 2122
strand +
CDS
ATGCAGTATGGCCTCAAAATCCTGTTAGAGTTTGGTTATAGTAATAAAGTTTTTACCGACCTATATTATAAAAAAATGACAATCAATCTCAGTAAAGCTGCAAAGTGGTACAAGGAAGAATCACACCAGCTAGCTGCCTGGAACTGGCTAGAATCGCAACTTACAGACGATCAAATTGATGAATTTGCTTGTATGTATCGTGCAGGTCCAGCAAATCCAACAGGTCGCGTTATTACACCGGATATTATGCAGCAATTAACGGGTTATGCTGCTAACAAATTTGATAATACATTTTGTGGCGACTTTAATAAGTTGTTGATGATAACAAAGTTTGATCAACATAAAGGAGCCATGTGCATGTTAATTGCAAACCTTATGCATGAAACAGGTAATTTCCGTTGGATGAAAGAAATTGCAGATGGGACTGCATATAATAATCGTGCTGATTTAGGTAACGGTCCCTACGACGGACCTAAATATAAAGGTACAGGAGTACTAATGCTGACTGGTAAATACAACTACACACGCCTAGCTGCTGAATTGCAGGATCCTCTCATCGTAGAACGTGGTTGTGAATATGTAGCCGATCACTACCCTTTTCGTTCTGCATTAACTTGGATTAAAGATAATGATCTTCTATACCTTTGTCTTACCAAGGGTTTTGACGACTGTTGTTACCGCATTAATGGAGGATGGAACGGATATGAAGATCGCCTTGAAAAATACAAAATCTGCAAAAAGGTATTTAATGTCCTCTGA

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001d1c846_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (80wC7) rather than this protein.
PDB ID
80wC7
Method AlphaFoldv2
Resolution 95.77
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50