Protein

Protein accession
CYgK [EnVhog]
Representative
CDyq
Source
EnVhog (cluster: phalp2_16698)
Protein name
CYgK
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKNDLFIDVSSHNGYDITGILEEMGTQNTIIKISESTSYLNPCRHAQVEQSNPIGFYHFAWFGGDIEEAEREARYFLD
Physico‐chemical
properties
protein length:79 AA
molecular weight:9089,9 Da
isoelectric point:4,71
hydropathy:-0,51
Representative Protein Details
Accession
CDyq
Protein name
CDyq
Sequence length
118 AA
Molecular weight
13347,72770 Da
Isoelectric point
4,94536
Sequence
MKLGKENKQMKKNDLFIDVSSHNGHNIEGIMEDIGTTNTIIKISEGTTYINPCLSAQVEQSNPIGFYHFAWFGGDVDEAEREARYFLDNVPQKVKYLCLDYEDHASGDKQANTDACIR
Other Proteins in cluster: phalp2_16698
Total (incl. this protein): 11 Avg length: 106,9 Avg pI: 4,77

Protein ID Length (AA) pI
CDyq 118 4,94536
5LZ4a 142 4,54453
6Zfr1 104 4,67816
6dx3c 131 4,88437
70tMi 102 4,71004
715OG 117 4,85021
715Pa 98 4,97207
7NafA 104 4,67816
7Opda 102 4,71004
NDPA 79 4,77757
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_29408
1gJlJ
3 52,7% 74 5.960E-20
2 phalp2_3177
8tR7d
1 30,5% 118 2.082E-09
3 phalp2_36207
7gJNV
1 37,0% 81 3.903E-09
4 phalp2_39219
3xEyz
21 33,6% 107 6.582E-08
5 phalp2_10061
7upZQ
3 30,0% 120 2.513E-05
6 phalp2_24735
6DhTX
2 31,9% 94 1.194E-04

Domains

Domains
Representative sequence (used for alignment): CDyq (118 AA)
Member sequence: CYgK (79 AA)
1 118 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (CDyq) rather than this protein.
PDB ID
CDyq
Method AlphaFoldv2
Resolution 93.94
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50