Protein

Protein accession
4066o [EnVhog]
Representative
1kgCZ
Source
EnVhog (cluster: phalp2_19074)
Protein name
4066o
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MDNYNRYIDEWRKQNRDDMIEKVIDAIIGIMIVAAITFIFWPREVGATDWVGGHFGYSSLRDPKPVYVEEHFWDEEKALDIWKNNPKIEYTIKSLTLQNTIWRYLTDEMGLSNEAAAGIFGNMMVECGSRSFNLHPYVYSPGGYYYGLCQWNTFGHHSSIAGGTLEDQLEYLADTIQSEMGQRAYQSFCSSDTPEEAANIFGQWYERCQEPYGRQSEARRAYERFGTS
Physico‐chemical
properties
protein length:228 AA
molecular weight:26456,1 Da
isoelectric point:4,79
hydropathy:-0,56
Representative Protein Details
Accession
1kgCZ
Protein name
1kgCZ
Sequence length
229 AA
Molecular weight
26071,25160 Da
Isoelectric point
6,83798
Sequence
MKNIVLSLLVAILSIIMLFIGTAAKVDNDFRTTATISEVYIPSEQQPKEKEKIEVTDSILYKEPFEINYKTITWEEYQEIQRKEKKKKAVVRKQIPGNAAQIIWNTFKSWDWNDAVSAGVLGNIMAEAGGHSLSINPYLYGEGGAYYGICQWSLYYFPSVHNADLYEQLNHLKSTISAQLSQYGYYNFLTIGNAAEAARVFAKYYERCASWSYSSRERNAQIAYASFVG
Other Proteins in cluster: phalp2_19074
Total (incl. this protein): 5 Avg length: 217,8 Avg pI: 5,03

Protein ID Length (AA) pI
1kgCZ 229 6,83798
3e0sp 212 4,42943
3h67o 207 4,75034
4FRDa 213 4,35753
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31630
4DBqn
83 48,1% 164 8.807E-40
2 phalp2_3410
3PMQ6
4 37,2% 161 2.416E-18
3 phalp2_33233
5o1H8
8 19,7% 147 1.105E-04

Domains

Domains
Disordered region
Phage_lys2
Representative sequence (used for alignment): 1kgCZ (229 AA)
Member sequence: 4066o (228 AA)
1 229 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4066o
Method AlphaFoldv2
Resolution 74.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (1kgCZ) rather than this protein.
PDB ID
1kgCZ
Method AlphaFoldv2
Resolution 75.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50