Protein

Protein accession
5l0V3 [EnVhog]
Representative
4wcDC
Source
EnVhog (cluster: phalp2_39337)
Protein name
5l0V3
Lysin probability
82%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSPDLVAHVKQAEGLRLRAYRCPAGYWTVGYGHRVPSSKTTVTAGQAEVWLVEDLRKAERQALGLVPGLAGRRLDALTDLVFNVGLGALDGDDPDSPLDDAGVVRALRAGDWLAAATRFRQWCHARVGGQLVVLPGLAARRAVGARWIEEG
Physico‐chemical
properties
protein length:151 AA
molecular weight:16235,3 Da
isoelectric point:8,74
hydropathy:-0,08
Representative Protein Details
Accession
4wcDC
Protein name
4wcDC
Sequence length
195 AA
Molecular weight
21132,17880 Da
Isoelectric point
8,98104
Sequence
VGETKGAKPGMKFTREQCLQILLDSGLARHEAGMRKCINEPDIVPTKTYVAMISLTYNIGVGGFCKSSIVKRWNAGDRYGACNAFSAYVKAKGRVLPGLVARRRGERALCVDGLDELVTVNFEDSIVPEHRPILKLGASGFWVEQAQRDLGIAVTGRFDSKMEDAVKAFQKATGALKVTGVIDAATWAVLIGQED
Other Proteins in cluster: phalp2_39337
Total (incl. this protein): 15 Avg length: 220,9 Avg pI: 9,36

Protein ID Length (AA) pI
4wcDC 195 8,98104
1IGyP 193 8,58417
1Xv4j 198 9,63211
35E3E 262 9,62830
4eIC4 198 9,68194
7UzDc 262 10,04432
84xMx 262 9,88083
8dorA 198 9,09161
8ejfp 198 9,12964
8iMhv 194 9,45198
8j3kT 220 7,73496
8qCGi 260 9,93079
8sJsO 260 9,91996
8sxJy 262 9,99178
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_24666
5IWY5
127 30,4% 197 4.699E-26
2 phalp2_34307
3A2Hz
178 30,9% 194 1.987E-24
3 phalp2_16385
5LDDR
25 30,3% 191 1.009E-21
4 phalp2_33113
4KEiU
2 29,5% 122 8.443E-12
5 phalp2_6611
1poy8
1 27,9% 193 1.313E-10

Domains

Domains
Representative sequence (used for alignment): 4wcDC (195 AA)
Member sequence: 5l0V3 (151 AA)
1 195 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959, PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4wcDC) rather than this protein.
PDB ID
4wcDC
Method AlphaFoldv2
Resolution 90.33
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50