Protein

Protein accession
49V5X [EnVhog]
Representative
4fGuZ
Source
EnVhog (cluster: phalp2_37725)
Protein name
49V5X
Lysin probability
92%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSLNVPTRTAKPICGTRRSYLSMNHIQLLASSAIVRCKKSTQSLVSNLREAASTRQTIRHAVLTSTYVNEIDTPGTLQARAHKGLTPRRLRLARGVAFVIGISLSIPMASADSGSIDAINPKDYVRLALPKKEAICLSRLIGKESAWNHKAIGNLNSPNKDYVYGLLQLKNPIAKDKSPIEQIHLGLKYIDHRYQGNACNAWNHWLRKGWH
Physico‐chemical
properties
protein length:211 AA
molecular weight:23470,9 Da
isoelectric point:10,24
hydropathy:-0,32
Representative Protein Details
Accession
4fGuZ
Protein name
4fGuZ
Sequence length
264 AA
Molecular weight
29401,66120 Da
Isoelectric point
10,68526
Sequence
MNSNVPTRTARLTCGTRRSSQSMNHMIRNASSATARCKRFIQFQVSNSRVLGFTVRTIKWLNAHSAILRLFQSLREFMKKLELFIIASLALLNSNQTNSTRPTQGLQNIGSDLRFYLSAIKSKLTGLVLSGLEPIRGSERAASRIARSVAIAIGISLSIGITPADGGSIDAIQPKEYVRLALDKREATCLSKLIGKESAWNTKAVGNLNSPSKSFTYGLLQLKNPVVKDKSAIEQIHYGLRYINHRYQGDTCKAWAHWLRKGWH
Other Proteins in cluster: phalp2_37725
Total (incl. this protein): 30 Avg length: 193,1 Avg pI: 10,01

Protein ID Length (AA) pI
4fGuZ 264 10,68526
1Jty8 191 9,97334
1KnBp 191 9,98069
1Ltqm 204 9,88502
1MvfD 191 10,01002
27Nuv 176 10,11562
2HP9p 191 10,06605
2rdJG 202 10,17487
30LWH 196 10,20182
31ac1 210 9,96077
31pAu 196 9,86065
375jn 197 9,95696
37hDJ 143 9,57789
3b3mm 218 10,53131
3bgl2 188 9,65609
48Aiu 193 9,96599
49csn 222 9,98546
49ySd 197 9,95522
4bYmb 176 10,17100
4ghXC 200 9,87064
6HhAb 149 9,63636
7XxHn 140 9,76330
7YzNr 143 9,44889
87wjB 242 10,29130
88fLw 200 9,95658
8lzRS 191 10,01176
8mtzu 177 10,15765
SDUf 204 10,16217
dYWx 191 10,12155
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_31544
4bSnc
14 43,2% 208 2.751E-75
2 phalp2_33936
24xWk
49 35,9% 242 2.265E-32

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 4fGuZ (264 AA)
Member sequence: 49V5X (211 AA)
1 264 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4fGuZ) rather than this protein.
PDB ID
4fGuZ
Method AlphaFoldv2
Resolution 64.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50