Protein

Protein accession
DZyH [EnVhog]
Representative
3awG3
Source
EnVhog (cluster: phalp2_8903)
Protein name
DZyH
Lysin probability
77%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MFKNAYDYLKNTPTSNFSPSEWPVPEDLKYVNAEVLYNLDITRYAFGRSVFPSGVPDGLVRRTGSKNSRHYIGDGSLGKAVDVFPSVAVMDFWICAVENTVWQGIGVYLDTNVNRRQPQPMVHLDIDRRKHGNRVFWVRNRGEYIYKHEEPEKFWDLIAEASTR
Physico‐chemical
properties
protein length:164 AA
molecular weight:19005,2 Da
isoelectric point:7,84
hydropathy:-0,54
Representative Protein Details
Accession
3awG3
Protein name
3awG3
Sequence length
167 AA
Molecular weight
19450,85930 Da
Isoelectric point
8,05450
Sequence
VIYRNAYEYLIARNHQSFFTPEEWPVPEDLKFVKAEVIYELNNTRRNFGKAVFPSRVPEGLVRRSGSRTSRHYIGDGRLGMAVDVFPSVNVMDFWLCAVENLHWRGIGVYLDTNVNSMQPQPMVHLDIDNRSHGQRALWVRDEKGVYIHKSTDPELFWELLAKASTR
Other Proteins in cluster: phalp2_8903
Total (incl. this protein): 6 Avg length: 164,3 Avg pI: 7,24

Protein ID Length (AA) pI
3awG3 167 8,05450
2jVNR 165 5,94373
2kYkn 158 5,76770
8aVXq 166 6,91108
8qowO 166 8,94455
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_962
2beLw
61 42,1% 133 1.364E-33
2 phalp2_4766
4Uo8o
43 30,2% 152 5.515E-12
3 phalp2_10510
82c7B
113 30,0% 133 4.153E-10
4 phalp2_15915
4FDix
111 27,2% 143 1.613E-06
5 phalp2_13381
4il4F
53 25,8% 116 2.033E-04

Domains

Domains
Unannotated
Representative sequence (used for alignment): 3awG3 (167 AA)
Member sequence: DZyH (164 AA)
1 167 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3awG3) rather than this protein.
PDB ID
3awG3
Method AlphaFoldv2
Resolution 92.39
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50