Protein

Protein accession
4HEFc [EnVhog]
Representative
4HEFc (this protein)
Source
EnVhog (cluster: phalp2_30371)
Protein name
4HEFc
Lysin probability
80%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKRMMLAMSLAIILGATTMSAAQGYTGTLLKYPKGPCRGLRDTPELLIRCAFKTVGIPGEIPTALYVAQRESGLNERAYNGICAGLFQMHLAYWQGRVTEFLYRSQFPNTWPAVPAFNGRANALVAAKMVKRGGWGPWSL
Physico‐chemical
properties
protein length:140 AA
molecular weight:15400,0 Da
isoelectric point:10,02
hydropathy:0,04
Other Proteins in cluster: phalp2_30371
Total (incl. this protein): 36 Avg length: 154,1 Avg pI: 9,34

Protein ID Length (AA) pI
11jGy 155 11,00670
11nfb 168 10,05844
11nly 111 9,29346
18Sil 164 8,43042
1DPxW 155 8,79847
1HDnW 137 10,03620
1dmIo 155 7,73695
1p6tJ 135 10,14212
3QA7r 135 10,04290
3Qy0H 196 10,02730
3Qy6h 170 9,42085
3QyGu 143 9,83074
3Qz0k 136 9,65190
42ebg 140 9,88644
4DiHv 184 8,93082
4DjWY 159 7,78599
4Djcd 196 8,59604
4DzwE 164 8,74167
4F9k0 145 6,04673
4FhEO 179 9,82680
4Fi4A 160 9,82629
4Fj3H 162 10,43023
4Fl7l 142 10,03807
4Fmzh 165 9,26251
4HIuY 155 6,71669
4ItI3 165 9,92950
5Bmla 151 9,89843
5GXWW 162 9,53947
5IIpm 169 9,63063
5JJCL 144 7,72404
5zCWs 148 9,94833
6Kfd7 143 9,91216
80a0Y 136 9,50633
bZKb 133 9,19160
wB3X 147 10,43274
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19119
1DVP3
12 33,3% 105 5.891E-27
2 phalp2_15922
4HJbN
3 32,9% 94 1.296E-15
3 phalp2_5704
4HFPa
1 34,3% 102 1.278E-12
4 phalp2_40503
4DsyH
1 29,2% 99 2.589E-10
5 phalp2_1680
4F4r8
4 33,6% 101 3.297E-07
6 phalp2_26092
7vGVt
16 29,1% 103 2.108E-06
7 phalp2_34493
4JV9D
12 25,7% 128 9.858E-06
8 phalp2_34891
4fZxi
5 29,6% 108 1.341E-05
9 phalp2_10602
5BOvL
251 28,3% 127 3.926E-04

Domains

Domains
Disordered region
Unannotated
Protein sequence: 4HEFc
1 140
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4HEFc
Method AlphaFoldv2
Resolution 81.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50