Protein
- Protein accession
- C8CHK7 [UniProt]
- Representative
- 4DyQf
- Source
- UniProt (cluster: phalp2_23122)
- Protein name
- Endolysin
- Lysin probability
- 100%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MRFRRFALPALILLLLVVGLSRPKPPKKPGTPVVNSNKEEELHPELRKRWRLAAAEYARRFPSLPRPYLAYAYRSMEEQARLYAVGRARPGQLYVAHGVDGKSYPVIAPSLKEFPDWRIITNARPGQSLHNYRPALAFDVAFQDGKGGFSCLECFQKFGQIAKSYGLEWGGDWRVRDYPHFQPPNYTWQMAQAGVPPRFTKEV
- Physico‐chemical
properties -
protein length: 203 AA molecular weight: 23289,6 Da isoelectric point: 10,00 hydropathy: -0,49
Representative Protein Details
- Accession
- 4DyQf
- Protein name
- 4DyQf
- Sequence length
- 121 AA
- Molecular weight
- 13656,53100 Da
- Isoelectric point
- 9,41575
- Sequence
-
VLWRQSRSAEEVQNTIQRLQKGTVFQQKQAEYLIKAGPSTGPWATNALPLESAHQWGLAIDLCPLVDGKAAWDRIDLFKRMGSIGKSLGLVWGGDWRKKDWGHFEIPNWTQIAKKMIGDLA
Other Proteins in cluster: phalp2_23122
| Total (incl. this protein): 2 | Avg length: 162,0 | Avg pI: 9,71 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 4DyQf | 121 | 9,41575 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_26149
d8Lo
|
162 | 44,0% | 125 | 2.295E-22 |
| 2 |
phalp2_3003
1DUEy
|
3 | 31,3% | 102 | 4.303E-12 |
| 3 |
phalp2_13223
2QtNq
|
30 | 28,7% | 101 | 6.500E-07 |
| 4 |
phalp2_24111
8q05w
|
31 | 29,7% | 131 | 4.505E-04 |
Domains
Domains [InterPro]
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Thermus virus P23-77 [NCBI] |
1714272 | Halopanivirales > Sphaerolipoviridae > Gammasphaerolipovirus > |
| Host |
Thermus thermophilus [NCBI] |
274 | Deinococcus-Thermus > Deinococci > Thermales > Thermaceae > Thermus > |
Coding sequence (CDS)
Coding sequence (CDS)
CDS Source ID
CDS Source
GQ403789
[NCBI]
CDS location
range 4829 -> 5440
strand +
strand +
CDS
GTGCGGTTCAGGAGGTTCGCGCTACCCGCGCTGATACTCCTTCTCCTGGTGGTGGGGCTGTCCCGGCCTAAGCCGCCTAAGAAGCCCGGGACCCCTGTGGTCAACAGCAACAAGGAGGAGGAGCTTCACCCAGAGCTGCGCAAGCGCTGGAGGCTGGCCGCGGCGGAATACGCCCGGCGGTTTCCGTCCCTTCCCCGGCCGTACCTGGCCTACGCCTACCGCTCCATGGAAGAGCAGGCCCGCCTCTACGCTGTGGGCAGGGCGAGGCCGGGGCAGCTGTACGTGGCGCATGGAGTGGACGGCAAAAGCTACCCCGTCATCGCCCCCAGCTTGAAGGAGTTTCCCGACTGGCGCATCATCACCAACGCCCGGCCCGGTCAGTCCCTCCACAACTACAGGCCGGCCCTGGCCTTTGACGTGGCGTTCCAGGACGGCAAAGGAGGGTTCTCCTGCCTGGAGTGCTTTCAGAAGTTCGGCCAGATAGCCAAGTCCTATGGCTTGGAGTGGGGCGGCGACTGGCGGGTCAGAGATTACCCCCACTTCCAGCCCCCCAACTACACCTGGCAGATGGCCCAGGCCGGCGTTCCGCCAAGGTTTACGAAGGAGGTTTGA
Gene Ontology
| Description | Category | Evidence (source) | |
|---|---|---|---|
| GO:0008233 | peptidase activity | molecular function | None (UniProt) |
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
PDB ID
upi0001b2f25c_model
Method
AlphaFold3 (non-commercial)
Resolution
-
Chain position
-
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50
The structures below correspond to the cluster representative
(4DyQf)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50