Protein

Protein accession
6WGva [EnVhog]
Representative
4I9nl
Source
EnVhog (cluster: phalp2_5710)
Protein name
6WGva
Lysin probability
95%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MTNYIPSPTLIAFIKSKEGCAEWNDELQMFIPYEDSGGISTIGYGHKVTHFDIQAGVFKYGLSQGGCDRLFEQDLAPRVKFINSLDIPSLTQGQVDALIDIAYNVGYGAVKAIQSFGLTNAPATMMHYIHDAHGRELKGLITRRQQDVAWWTGVAEGKIA
Physico‐chemical
properties
protein length:160 AA
molecular weight:17639,8 Da
isoelectric point:5,66
hydropathy:-0,12
Representative Protein Details
Accession
4I9nl
Protein name
4I9nl
Sequence length
153 AA
Molecular weight
17146,31450 Da
Isoelectric point
5,65755
Sequence
MTNYIPSPTLIAFIKSKEGCAEWNDELQMFMPYEDSGGISTIGYGHKVTHFDIQVGVFKYGLSQGGCDRLFEQDLAPRVKFINSLDIPSLTQGQVDALIDIAYNVGYGAVKAIQSFGLTNAPVTMMHYIHDAHGRELKGLITRREQDVKWWNS
Other Proteins in cluster: phalp2_5710
Total (incl. this protein): 2 Avg length: 156,5 Avg pI: 5,66

Protein ID Length (AA) pI
4I9nl 153 5,65755
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14757
6QRn1
45 34,2% 152 9.719E-15
2 phalp2_33221
5jbqV
447 35,7% 137 3.635E-12
3 phalp2_20515
4e8Pa
1 32,9% 158 6.772E-12
4 phalp2_31105
1YRHZ
456 34,0% 144 2.814E-10
5 phalp2_33124
4MhZ3
118 34,2% 143 3.837E-10
6 phalp2_9968
35Zox
8 31,2% 125 3.352E-09
7 phalp2_2632
6RhYr
14867 35,8% 120 1.571E-08
8 phalp2_1869
3V0OR
7 29,1% 151 1.571E-08
9 phalp2_4451
31DIk
4919 32,7% 119 2.140E-08
10 phalp2_126
5jCCA
637 34,1% 129 2.913E-08

Domains

Domains
Representative sequence (used for alignment): 4I9nl (153 AA)
Member sequence: 6WGva (160 AA)
1 153 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
6WGva
Method AlphaFoldv2
Resolution 93.78
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4I9nl) rather than this protein.
PDB ID
4I9nl
Method AlphaFoldv2
Resolution 95.66
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50