Protein

Protein accession
4L7Wk [EnVhog]
Representative
1gGC5
Source
EnVhog (cluster: phalp2_39901)
Protein name
4L7Wk
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAEIVYTSKQFCDILKHIATQLNTKYDNHFPRNLGYNYGSGYSWDCWNLPKSLIWGWKEGGAVGSYQRANLSTGLGDWNGWTILNCCAGISTDFSNINVGEFLLTEDKGHAGIYVGEFKDRYGQVCNVVECTTSWGTGRVIGSWVESNGVRRSSKGGSISKSWHWHGMLPWIDYAEVAPVVKKVAEDGYWGMDTTRWTQRLLGTYVDGIVSNQPHSNKKYLSNATTGWEFKFFGYRAGSDMIRALQRLIGTTADGYFGHQSVIALQTFLNNRGFDAGAIDGYMGARTVKAWQRYLNSQL
Physico‐chemical
properties
protein length:299 AA
molecular weight:33537,2 Da
isoelectric point:8,63
hydropathy:-0,37
Representative Protein Details
Accession
1gGC5
Protein name
1gGC5
Sequence length
292 AA
Molecular weight
33592,52450 Da
Isoelectric point
9,05151
Sequence
MATVMTNKQYVDTLKHIASIDTVYINRFPYNCGYFNGNTGKFSFDCWNLVKAVINGWQDIRVHGYYVKGFKVTGDIDGATILKKCTTKSKDFTKLNIAGSYLYMKGHAGSYIGETLINGRYYNVIECTGSWTKNVLYSWVDADGTRRRYKGGAKNCKWTDWGLMCWVDYKTIPDPSPAPEGFILGKYIHKGVDYGFVFNPTYYANKYKDLKDAFGNDDKKLFDHFINHGMYENTTYGDNKHCGRQAISNFNPIKYREKNSDVVSAYGTKVEDNPKYYEHYCRFGYNEGRKAT
Other Proteins in cluster: phalp2_39901
Total (incl. this protein): 22 Avg length: 285,9 Avg pI: 8,61

Protein ID Length (AA) pI
1gGC5 292 9,05151
19a8P 297 8,56541
1Jfht 312 9,05492
23uVG 249 9,53012
24Ftv 230 8,64207
37T8v 240 9,67253
38Hat 290 9,12055
3TD0j 306 9,26367
3TJIQ 361 8,85829
3TJhZ 307 8,61409
3ZDj4 291 8,78493
3ZNxp 305 5,16532
3ZRL0 228 8,77004
3gUPE 312 5,39552
3gVgH 291 9,08052
3hRpn 257 9,21713
3nAOv 300 8,95081
40JTF 290 9,11243
4L8cq 232 8,90445
n5vg 273 8,63723
omkR 327 8,35924
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14326
3ZXZ8
9 48,6% 189 1.389E-64
2 phalp2_21686
3dOCR
8 45,5% 193 1.389E-64
3 phalp2_29540
3v8t1
106 28,9% 207 1.509E-41
4 phalp2_13292
3o9d4
478 29,1% 209 2.058E-41
5 phalp2_35339
7rQza
10 25,7% 221 2.741E-34
6 phalp2_34755
3gjml
29 26,8% 238 2.052E-32
7 phalp2_8930
3gSwx
1 29,2% 202 5.166E-32
8 phalp2_13101
8tQAx
63 26,8% 186 9.529E-28
9 phalp2_24045
8beMs
3 27,9% 197 6.618E-24

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 1gGC5 (292 AA)
Member sequence: 4L7Wk (299 AA)
1 292 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gGC5) rather than this protein.
PDB ID
1gGC5
Method AlphaFoldv2
Resolution 90.56
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50