Protein

Protein accession
16n5q [EnVhog]
Representative
4aDch
Source
EnVhog (cluster: phalp2_31541)
Protein name
16n5q
Lysin probability
99%
PhaLP type
endolysin
Probability: 89% (predicted by ML model)
Protein sequence
MKLDISKIKQVRLKDNQFFKEDSSKSQIYLHHTAGNGNAEGVSRYWNGNETRIGTAFIIGEDGTIVQCFSSKHWAWHLGIDNQ
Physico‐chemical
properties
protein length:83 AA
molecular weight:9467,5 Da
isoelectric point:7,93
hydropathy:-0,65
Representative Protein Details
Accession
4aDch
Protein name
4aDch
Sequence length
128 AA
Molecular weight
14381,90990 Da
Isoelectric point
8,41591
Sequence
MKLDISKIKQARLKDNQFFAEESPKTQIYLHHTAGNGNAEGVSRYWNGNETRIGTAFVIGEDGTIVQCFSSKHWGWHLGIDNQDFARNGANYVNLNKCSVGIEVCNWGYLTKKGDKFYNYAGGVIKSE
Other Proteins in cluster: phalp2_31541
Total (incl. this protein): 23 Avg length: 119,0 Avg pI: 8,72

Protein ID Length (AA) pI
4aDch 128 8,41591
1KWw7 115 8,70621
1Mlav 107 8,82226
1WYGc 103 8,79015
1eOOo 138 9,18554
22UQZ 156 8,54137
2Li2s 137 7,86174
2kU6T 132 8,95816
2rdHN 131 8,68906
2scln 127 9,27109
3SNAt 142 9,56770
44cj6 85 7,93801
556Jn 84 7,99848
5ExHU 120 8,60203
5bHtz 104 9,13125
5fZ0M 141 9,35464
5iXcX 105 8,89040
5v2nP 155 8,44460
6xmLR 101 7,90184
71CWN 124 9,25026
FFs5 102 9,20591
FxZK 117 9,09850
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6160
5DUGr
13 51,2% 80 2.745E-31
2 phalp2_16476
6Lmyy
2 34,4% 122 3.160E-19
3 phalp2_13409
4uFYn
31 32,9% 88 2.610E-08
4 phalp2_24542
7H8ln
5 24,1% 116 1.503E-06

Domains

Domains
Representative sequence (used for alignment): 4aDch (128 AA)
Member sequence: 16n5q (83 AA)
1 128 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4aDch) rather than this protein.
PDB ID
4aDch
Method AlphaFoldv2
Resolution 94.92
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50