Protein

Protein accession
8qU76 [EnVhog]
Representative
pNVl
Source
EnVhog (cluster: phalp2_12447)
Protein name
8qU76
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MARLKAPKQSGNKFTAFADKLGPLVYEGLRKRGFTSRAAYDNVMSQLAWESTYGTSDVARNNHNYGGYGYDGNGNYTVFKNDRDFIDAYLNTMSSRYRKALQSNNVYDYARVLKTKGYYGDTYENYSKGLAGMSSLRKAAAKHYHIMGEPYKKPLAPVVPNILDKPIQQ
Physico‐chemical
properties
protein length:169 AA
molecular weight:19100,3 Da
isoelectric point:9,65
hydropathy:-0,77
Representative Protein Details
Accession
pNVl
Protein name
pNVl
Sequence length
169 AA
Molecular weight
19095,22150 Da
Isoelectric point
9,65493
Sequence
MARLKAPKQSGNKFTAFADKLGPLVYEGLRKRGFTSRAAYDNVMSQLAWESTYGTSDVARNNHNYGGYGYDGNGNYTVFKNDRDFIDAYLNTMSSRYRKALQSNNVYDYARVLKTKGYYGDTYENYSKGLAGMNSLRKAAAKHYHIIGEPYKKPLAPVVPNILDKPVQQ
Other Proteins in cluster: phalp2_12447
Total (incl. this protein): 16 Avg length: 165,6 Avg pI: 9,56

Protein ID Length (AA) pI
pNVl 169 9,65493
11voK 171 9,26657
1cwif 177 9,67962
5SMiZ 169 9,67737
5Vfnz 148 9,25877
61vEx 169 9,65493
6sDDm 167 9,63604
6txYM 169 9,65493
6vzbX 144 9,43516
70uHk 169 9,65493
7OCAg 171 9,28263
7Yxz5 169 9,65493
84xUe 169 9,65493
JYeR 150 9,41356
N9NL 169 9,65493
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_28351
1dQRh
5 45,2% 115 2.735E-21
2 phalp2_32468
17iN5
5 26,7% 146 2.308E-12
3 phalp2_33670
ZMY8
6 29,0% 131 1.897E-05
4 phalp2_22760
8m92P
7 26,7% 112 2.565E-05
5 phalp2_3102
419EZ
42 21,4% 140 2.844E-04
6 phalp2_17322
4k4va
55 21,1% 156 2.844E-04

Domains

Domains
GLUCO
Disordered region
Representative sequence (used for alignment): pNVl (169 AA)
Member sequence: 8qU76 (169 AA)
1 169 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (pNVl) rather than this protein.
PDB ID
pNVl
Method AlphaFoldv2
Resolution 85.01
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50