Protein

Protein accession
8GzQ4 [EnVhog]
Representative
6NaQ7
Source
EnVhog (cluster: phalp2_24770)
Protein name
8GzQ4
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
mspsdraslrgdltrdegvrlrlyqdtvgkwtigcgrnlsdngisqqeallllehdienaerdarkacpkyetlsgtrqralcnmsfnlgltrllrfrkmltaiadadfetaatealksrwatqvgqravriaalirag
Physico‐chemical
properties
protein length:139 AA
molecular weight:15495,5 Da
isoelectric point:9,50
hydropathy:-0,42
Representative Protein Details
Accession
6NaQ7
Protein name
6NaQ7
Sequence length
159 AA
Molecular weight
17462,52100 Da
Isoelectric point
5,30918
Sequence
MTPEGRLQLCADEGSRSRAYVDVTGNVSIGVGRNLTGKGLADSEIAMLLSNDIRDAETALAGYTWFAVIEPVRLDVCAMMVFNLGATGFASYRHMQAALAAGDWQGAADQLWDSGAARLLETRYRRFWWAMVTASWSPADWSLTDGLGRKRDASNWPMH
Other Proteins in cluster: phalp2_24770
Total (incl. this protein): 5 Avg length: 147,2 Avg pI: 5,71

Protein ID Length (AA) pI
6NaQ7 159 5,30918
4XlX5 141 4,64877
8D9x8 156 4,74437
8Dou2 141 4,35378
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17008
8jNgs
3526 40,4% 131 5.521E-41
2 phalp2_32797
2yPO8
5731 40,9% 127 9.418E-40
3 phalp2_9780
80F6A
154 38,3% 120 1.457E-37
4 phalp2_32153
6Jjiu
15 42,7% 124 9.651E-37
5 phalp2_16834
1uDRT
24 37,1% 132 9.651E-37
6 phalp2_24213
2KXlJ
83 30,7% 140 1.883E-30
7 phalp2_27968
RI1z
5083 37,0% 124 4.841E-30
8 phalp2_7697
6I6lz
232 36,7% 147 4.841E-30
9 phalp2_6928
2KMCh
358 34,5% 159 4.841E-30
10 phalp2_8442
12q1i
28 31,5% 130 3.197E-29

Domains

Domains
GH24
Disordered region
Representative sequence (used for alignment): 6NaQ7 (159 AA)
Member sequence: 8GzQ4 (139 AA)
1 159 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00959

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6NaQ7) rather than this protein.
PDB ID
6NaQ7
Method AlphaFoldv2
Resolution 87.56
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50