Protein

Protein accession
6z3mw [EnVhog]
Representative
2E5KU
Source
EnVhog (cluster: phalp2_4413)
Protein name
6z3mw
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
AVQAVDPVDLIERFSQAKEDFYRSLTTFATFGKGWLNRVADVKVKASAMLA
Physico‐chemical
properties
protein length:51 AA
molecular weight:5662,4 Da
isoelectric point:8,42
hydropathy:0,05
Representative Protein Details
Accession
2E5KU
Protein name
2E5KU
Sequence length
55 AA
Molecular weight
6142,97100 Da
Isoelectric point
8,18931
Sequence
MTLAAVSNFDAAELIERFSQAKEDFYRSLPTFATFGRGWLNRVADVKLKATSMIG
Other Proteins in cluster: phalp2_4413
Total (incl. this protein): 28 Avg length: 69,2 Avg pI: 7,58

Protein ID Length (AA) pI
2E5KU 55 8,18931
1JxKG 68 6,38634
1cKY3 77 7,18146
1nBYC 102 5,72462
1ybkv 92 6,45432
1zqer 76 9,39667
2Rnr0 66 9,39712
2Y2P0 59 9,51884
33KbS 58 8,38548
46aYi 100 5,69484
48gnk 73 5,04931
4GDEr 53 9,30081
4aK43 58 8,37794
4bSrI 88 9,30222
4lpEd 73 5,04926
4oJFM 83 10,34971
4rbJl 43 4,87112
4rmFU 53 9,30081
51qpO 69 9,69142
56SlF 80 6,95535
5CeQW 63 5,62123
5h2hD 64 4,97832
6Awok 88 5,65897
6KFnN 58 6,08901
7EOa 69 9,25465
JIV4 65 8,31605
e92H 53 9,30081
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_33666
Z8aS
37 48,1% 54 7.127E-19
2 phalp2_1793
31LbW
7 45,2% 42 8.621E-13
3 phalp2_22020
5h7FG
31 69,4% 36 1.631E-12
4 phalp2_6437
8C5d6
1 36,0% 50 5.088E-10
5 phalp2_39877
1ayoe
23 42,8% 56 9.636E-10
6 phalp2_21563
2cRdU
12 37,0% 54 1.709E-08
7 phalp2_13859
7szFa
1 35,1% 37 5.770E-07
8 phalp2_4278
84OYZ
16 25,9% 54 3.721E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 2E5KU (55 AA)
Member sequence: 6z3mw (51 AA)
1 55 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2E5KU) rather than this protein.
PDB ID
2E5KU
Method AlphaFoldv2
Resolution 94.84
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50