Protein

Protein accession
HLZA [EnVhog]
Representative
5H7aG
Source
EnVhog (cluster: phalp2_22068)
Protein name
HLZA
Lysin probability
70%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAIRQITTPNVGIGASRGWCLKYVDDTTNAPARARTAEQAYQIEARNGNIRGGEPPVGVWVPLFFSLTRGQFAGLGHVALAFNHGNGIIEIHDSEVQSGARRPYNSINELLAWFAFHGIIYRGWSIWCDGTRYVEEYTPAPAQPQATGLVPASGTATVLVNALNVRAEPSTASEIVATYANGQQFNYDGYLIANGYVWLSYIGGSGHRRYVAEGVFDNDPNNVFVRGGVSR
Physico‐chemical
properties
protein length:231 AA
molecular weight:25124,7 Da
isoelectric point:6,51
hydropathy:-0,17
Representative Protein Details
Accession
5H7aG
Protein name
5H7aG
Sequence length
226 AA
Molecular weight
24530,79580 Da
Isoelectric point
6,41010
Sequence
MATEVRQSTYPNINVPATRGYCLKYVDDGVNAPNRKPTAQSSWDSNPDKRSGDLPVGVWVPIFFSLTKGPYAGLGHIAWAFNHGGGWVEIHDSETQTKARPVYRSINEVLQWFGNYAPVYLGWSLSVDGARIAQEFTVQESAPSGLHNAKGTATVLVDALNVRNAPDKNSASVAVYSKGQAFNYDGYVTANGYVWLSYVSNSGVRRYVAEGPDDGNNNTVYVSGGV
Other Proteins in cluster: phalp2_22068
Total (incl. this protein): 6 Avg length: 223,3 Avg pI: 7,01

Protein ID Length (AA) pI
5H7aG 226 6,41010
7jP4d 223 8,98446
8tXBd 222 8,85410
8tqGN 211 5,93595
bohC 227 5,37466
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13160
2cTZ9
1 46,7% 152 1.248E-55
2 phalp2_26479
23VRG
24 33,5% 149 1.625E-16
3 phalp2_15311
1jnhU
14 28,1% 174 2.993E-16
4 phalp2_4954
6F7cq
8 27,3% 179 1.866E-15
5 phalp2_23362
5H2Et
1 28,3% 159 1.478E-12
6 phalp2_34123
fsVs
5 28,2% 145 1.647E-11
7 phalp2_4493
3lAfw
1 26,3% 182 1.347E-10
8 phalp2_1465
1Mi2x
11 27,7% 144 1.985E-09

Domains

Domains
Unannotated
SH3_5
Representative sequence (used for alignment): 5H7aG (226 AA)
Member sequence: HLZA (231 AA)
1 226 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08460

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
HLZA
Method AlphaFoldv2
Resolution 93.69
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5H7aG) rather than this protein.
PDB ID
5H7aG
Method AlphaFoldv2
Resolution 91.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50