Protein

Protein accession
C8CHK7 [UniProt]
Representative
4DyQf
Source
UniProt (cluster: phalp2_23122)
Protein name
Endolysin
Lysin probability
100%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MRFRRFALPALILLLLVVGLSRPKPPKKPGTPVVNSNKEEELHPELRKRWRLAAAEYARRFPSLPRPYLAYAYRSMEEQARLYAVGRARPGQLYVAHGVDGKSYPVIAPSLKEFPDWRIITNARPGQSLHNYRPALAFDVAFQDGKGGFSCLECFQKFGQIAKSYGLEWGGDWRVRDYPHFQPPNYTWQMAQAGVPPRFTKEV
Physico‐chemical
properties
protein length:203 AA
molecular weight:23289,6 Da
isoelectric point:10,00
hydropathy:-0,49
Representative Protein Details
Accession
4DyQf
Protein name
4DyQf
Sequence length
121 AA
Molecular weight
13656,53100 Da
Isoelectric point
9,41575
Sequence
VLWRQSRSAEEVQNTIQRLQKGTVFQQKQAEYLIKAGPSTGPWATNALPLESAHQWGLAIDLCPLVDGKAAWDRIDLFKRMGSIGKSLGLVWGGDWRKKDWGHFEIPNWTQIAKKMIGDLA
Other Proteins in cluster: phalp2_23122
Total (incl. this protein): 2 Avg length: 162,0 Avg pI: 9,71

Protein ID Length (AA) pI
4DyQf 121 9,41575
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_26149
d8Lo
162 44,0% 125 2.295E-22
2 phalp2_3003
1DUEy
3 31,3% 102 4.303E-12
3 phalp2_13223
2QtNq
30 28,7% 101 6.500E-07
4 phalp2_24111
8q05w
31 29,7% 131 4.505E-04

Domains

Domains [InterPro]
Representative sequence (used for alignment): 4DyQf (121 AA)
Member sequence: C8CHK7 (203 AA)
1 121 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Thermus virus P23-77
[NCBI]
1714272 Halopanivirales > Sphaerolipoviridae > Gammasphaerolipovirus >
Host Thermus thermophilus
[NCBI]
274 Deinococcus-Thermus > Deinococci > Thermales > Thermaceae > Thermus >

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
GQ403789 [NCBI]
CDS location
range 4829 -> 5440
strand +
CDS
GTGCGGTTCAGGAGGTTCGCGCTACCCGCGCTGATACTCCTTCTCCTGGTGGTGGGGCTGTCCCGGCCTAAGCCGCCTAAGAAGCCCGGGACCCCTGTGGTCAACAGCAACAAGGAGGAGGAGCTTCACCCAGAGCTGCGCAAGCGCTGGAGGCTGGCCGCGGCGGAATACGCCCGGCGGTTTCCGTCCCTTCCCCGGCCGTACCTGGCCTACGCCTACCGCTCCATGGAAGAGCAGGCCCGCCTCTACGCTGTGGGCAGGGCGAGGCCGGGGCAGCTGTACGTGGCGCATGGAGTGGACGGCAAAAGCTACCCCGTCATCGCCCCCAGCTTGAAGGAGTTTCCCGACTGGCGCATCATCACCAACGCCCGGCCCGGTCAGTCCCTCCACAACTACAGGCCGGCCCTGGCCTTTGACGTGGCGTTCCAGGACGGCAAAGGAGGGTTCTCCTGCCTGGAGTGCTTTCAGAAGTTCGGCCAGATAGCCAAGTCCTATGGCTTGGAGTGGGGCGGCGACTGGCGGGTCAGAGATTACCCCCACTTCCAGCCCCCCAACTACACCTGGCAGATGGCCCAGGCCGGCGTTCCGCCAAGGTTTACGAAGGAGGTTTGA

Gene Ontology

Description Category Evidence (source)
GO:0008233 peptidase activity molecular function None (UniProt)

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi0001b2f25c_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4DyQf) rather than this protein.
PDB ID
4DyQf
Method AlphaFoldv2
Resolution 93.96
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50