Protein

Protein accession
6dgY1 [EnVhog]
Representative
89f4d
Source
EnVhog (cluster: phalp2_14039)
Protein name
6dgY1
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAREIFVDLSSFQKDLTVDDYEKLGVKKAIVKISESTTYANPYIHDLVDKAAAGGVNAFAFYHFGRFTNDAQAANEAKYFIDNAKNQVNVKPKTLMILDAEINGMPTSSVIVFLKALRDAGFRTGFYTGKNLLASFDLEEIKPYMDFFWLASYPTVAPADKNPDFNYFPSAKYVDAWQYTDNLLGMKVDGSITVTDNGNDVFDANNSSQSS
Physico‐chemical
properties
protein length:211 AA
molecular weight:23492,1 Da
isoelectric point:4,81
hydropathy:-0,24
Representative Protein Details
Accession
89f4d
Protein name
89f4d
Sequence length
178 AA
Molecular weight
20045,24890 Da
Isoelectric point
6,05463
Sequence
MANREVVIDLASFQSELTVQDYKNIGATKAIVKISENTNYVNPYIRSLIDRSAGGGVNGFAFYHFGRFYNDSAAVAEANYFIENAKRYANVQPGTLLILDAEIKGMPTSSVIAFLQTLRNAGYHTGFYTYKFLLPQFNLEEIHKYADFFWLAAYPLGDQPGPRVPDFNYFPSANNQHS
Other Proteins in cluster: phalp2_14039
Total (incl. this protein): 3 Avg length: 233,3 Avg pI: 5,68

Protein ID Length (AA) pI
89f4d 178 6,05463
A0A8S5NG23 311 6,16728
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12156
6iQWP
264 35,7% 176 6.187E-20
2 phalp2_37341
8v6XY
562 28,8% 170 2.178E-12
3 phalp2_22092
5WvnN
1 29,1% 185 7.468E-12
4 phalp2_36241
7z7Vw
21 28,3% 141 7.239E-08
5 phalp2_20117
23DDy
70 27,8% 151 1.329E-07
6 phalp2_20361
2Sd5N
42 30,1% 159 6.044E-07

Domains

Domains
GH25
Disordered region
Representative sequence (used for alignment): 89f4d (178 AA)
Member sequence: 6dgY1 (211 AA)
1 178 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (89f4d) rather than this protein.
PDB ID
89f4d
Method AlphaFoldv2
Resolution 87.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50