Protein

Protein accession
35zqy [EnVhog]
Representative
35z4C
Source
EnVhog (cluster: phalp2_34225)
Protein name
35zqy
Lysin probability
99%
PhaLP type
endolysin
Probability: 83% (predicted by ML model)
Protein sequence
MITHKQWVNKYLHIIIDVDKAYGGQCVDAARSHMSEVDGYSDIEGVRGAVDFFTKYDDMPKLKAAYFKIAYQPGMIPPDGAKVVWGTTPNNSFGHIAVSDESGQIRLGIMEQDGFDNPARDVNKGTSKGLVKRETGYGNVLGWLVPREKPVEIKVSGNVPPAGSTIAESDRTAQIGFYWLNKTSDPKNPRWMLLTAAGWKFI
Physico‐chemical
properties
protein length:202 AA
molecular weight:22352,2 Da
isoelectric point:7,79
hydropathy:-0,39
Representative Protein Details
Accession
35z4C
Protein name
35z4C
Sequence length
202 AA
Molecular weight
22433,27280 Da
Isoelectric point
7,79843
Sequence
MITHKQWVNKYLHIIIDVDKAYGGQCVDAARSHMSEVDGYSDIEGVRGAVDFFTKYDDMPKLKAAYFRIAYQPGMIPPDGAKIVWGTTPNNSFGHIAMSDESGQIRMGVMEQDGFDNPARDVSKGTSKGLVKRETGYGNVLGWLVPREKPVEIKVSGNVPPAGSTIAESDRTAQIGFYWLNKTSDPKNPRWMLLTATGWKFI
Other Proteins in cluster: phalp2_34225
Total (incl. this protein): 4 Avg length: 200,0 Avg pI: 8,64

Protein ID Length (AA) pI
35z4C 202 7,79843
1LD93 202 10,06121
7IC9H 194 8,92534
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3543
4zM0o
163 34,4% 151 1.255E-26
2 phalp2_17703
71dZW
23 29,4% 146 9.681E-20
3 phalp2_39580
6iLgH
177 29,9% 167 1.152E-18
4 phalp2_7271
3X4sS
15 28,2% 145 1.569E-18
5 phalp2_9654
1P0dk
1 30,7% 153 7.357E-18
6 phalp2_1580
81s3j
1 29,2% 164 3.444E-17
7 phalp2_24452
4BMZo
66 29,3% 150 2.979E-16
8 phalp2_37199
1X3cR
12 25,3% 142 1.888E-15
9 phalp2_18809
1hN6j
3 29,3% 160 3.491E-15
10 phalp2_11577
18WTO
29 29,0% 172 7.506E-14

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 35z4C (202 AA)
Member sequence: 35zqy (202 AA)
1 202 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
35zqy
Method AlphaFoldv2
Resolution 74.83
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (35z4C) rather than this protein.
PDB ID
35z4C
Method AlphaFoldv2
Resolution 75.13
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50