Protein

Protein accession
4GYzb [EnVhog]
Representative
4Hvkb
Source
EnVhog (cluster: phalp2_26973)
Protein name
4GYzb
Lysin probability
85%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNGIWQKTAFDATIKKAAEKYGLELWFLRALIWQESRFHPEVVSKRGAIGLCQLLPDTAAELKINPYDPWQNIEGGAKYLAKLLKRYKGDKPLALAGYNAGMGNVRKYKGIPPFKETKNYIKEIMAWENSQPPEAEGEVIA
Physico‐chemical
properties
protein length:141 AA
molecular weight:15955,3 Da
isoelectric point:9,23
hydropathy:-0,46
Representative Protein Details
Accession
4Hvkb
Protein name
4Hvkb
Sequence length
143 AA
Molecular weight
15882,38450 Da
Isoelectric point
7,71353
Sequence
MMDNNPYYDIRKAASVIYRVPMWLIDAQITAESNWNPRAKSHCGAMGLMQLMPVVCEEMGVADPYDPQDNIMGGVGYLAKLLKSRFVRGDIALALAAYNGGIGNLKKYRGIPPFKETREYVAKIMAMKPEGEPETITPLPSNL
Other Proteins in cluster: phalp2_26973
Total (incl. this protein): 3 Avg length: 151,7 Avg pI: 8,78

Protein ID Length (AA) pI
4Hvkb 143 7,71353
81eBC 171 9,39789
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8756
8sxGC
25 50,8% 120 1.555E-45
2 phalp2_546
D4nQ
40 50,0% 110 8.366E-36
3 phalp2_40101
7ZJzm
4 43,5% 131 2.224E-32
4 phalp2_31948
7IfgM
1 44,2% 131 4.179E-32
5 phalp2_21911
4JxwN
4 47,5% 124 1.670E-29
6 phalp2_19158
1QRTe
2 42,0% 119 2.290E-29
7 phalp2_17644
6McxP
2 49,5% 101 8.078E-29
8 phalp2_32795
2xoLt
8 45,9% 135 7.336E-28
9 phalp2_37089
16u4b
9 41,9% 131 4.860E-27
10 phalp2_17520
5z96M
10 42,6% 129 4.410E-26

Domains

Domains
Representative sequence (used for alignment): 4Hvkb (143 AA)
Member sequence: 4GYzb (141 AA)
1 143 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01464

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4GYzb
Method AlphaFoldv2
Resolution 94.93
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4Hvkb) rather than this protein.
PDB ID
4Hvkb
Method AlphaFoldv2
Resolution 95.02
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50