Protein

Protein accession
3QGEw [EnVhog]
Representative
3QGEw (this protein)
Source
EnVhog (cluster: phalp2_17246)
Protein name
3QGEw
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MAVFTFGERSLAKLSTADTELAEIPKYVLAQGVFDLTIVWGWRSDQQQMDAFLAGNSKKKTGSFHQVTKDGQPWAQAIDFAPWCKLPDGSMGIPWKDTHAFAVLGGMMIAAGEYLGVPIVYGGDWDMDGTTTDQTLMDWGHIQKRNPDAST
Physico‐chemical
properties
protein length:151 AA
molecular weight:16603,6 Da
isoelectric point:4,83
hydropathy:-0,24
Other Proteins in cluster: phalp2_17246
Total (incl. this protein): 39 Avg length: 146,7 Avg pI: 5,36

Protein ID Length (AA) pI
10WJ9 133 4,98492
110Bf 147 5,83477
17dW6 147 5,31902
1Eaf8 143 5,78845
1Rj4V 144 4,91262
1RjQ1 143 4,68753
2SOXZ 148 5,76946
2cxur 168 6,17797
2jXKl 162 6,96053
2k2i8 163 5,35369
2kZUt 142 4,65599
2nPbk 147 4,95007
2qPMc 144 4,56130
2rGqF 147 4,76307
2rKDw 146 4,59415
2rSam 145 4,90023
2uixj 148 4,70612
35F17 142 5,62714
36uDR 146 6,21252
3QbgC 152 4,79036
3eDBe 148 5,28895
4B6j7 143 4,99327
4BJEh 138 5,88024
4GkjH 144 8,12638
4JPpa 143 4,71533
4JQ2t 146 6,27385
4MoAC 146 4,57846
4MoMQ 142 5,23336
4bc6T 144 6,90624
4i3Xz 146 4,66275
5jW3f 146 4,56306
6O6Rf 146 6,13750
8fkYj 146 5,34169
faiy 147 4,50724
fdfA 155 5,84989
jAsu 145 4,82906
k48w 147 5,20932
rwna 141 5,48311
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23690
sMBl
3369 31,4% 143 8.890E-43
2 phalp2_25774
7CLmL
1 30,2% 142 1.208E-32
3 phalp2_22785
8jxEz
234 25,4% 153 4.802E-30
4 phalp2_27432
4Rvdx
79 29,5% 149 1.693E-29
5 phalp2_39303
4ipTG
1 28,7% 146 6.069E-26
6 phalp2_13949
8Fkan
8452 25,8% 147 1.745E-23
7 phalp2_26018
6REqE
256 26,7% 142 3.272E-23
8 phalp2_18798
1dhYs
557 24,4% 147 1.575E-22
9 phalp2_15106
eKg9
268 29,4% 146 5.536E-22
10 phalp2_14517
7CiFr
1 25,5% 149 5.536E-22

Domains

Domains
Unannotated
Protein sequence: 3QGEw
1 151
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3QGEw
Method AlphaFoldv2
Resolution 96.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50