Protein

Protein accession
47dmD [EnVhog]
Representative
46M5C
Source
EnVhog (cluster: phalp2_34856)
Protein name
47dmD
Lysin probability
89%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MKKLIFLLAAMLVSVSFVTVAEARPKDKQYHYTKVQKVKKYKVVKVVTEKDSKGQILQQTVYHDDSVAGYWSFEYNRHKPPITGPVEKTVGDLANKATHYMGATASQLGLPRRLWCADFMNMLVGGTDRRAISYANRGTKAQHGCVNCIAVTTRSGGAHVGVVSGYDEKGNPIIVSGNHNRRVGVATYDKRRVIAYRYI
Physico‐chemical
properties
protein length:199 AA
molecular weight:22166,3 Da
isoelectric point:9,90
hydropathy:-0,33
Representative Protein Details
Accession
46M5C
Protein name
46M5C
Sequence length
215 AA
Molecular weight
23941,52420 Da
Isoelectric point
9,75086
Sequence
MKKLIFLLVAILVGINCLTAANAEPKDKPNHYATKVHKKVKKQKVEKPAKIDYRQIKLNEMMAIYDENSSGGFFALEKAREEYRQNVKVVKVKKPEVKQVLTQQETKDLVERASKYLGFGPNQLGLPRNLWCADFINMLVGGHSRAAASYLSRGTYAKHGCVNCVAVLTRRGGNHVGVVSGYDDDGNPIIISGNHNGVVGIGVYRRERVIGYRTI
Other Proteins in cluster: phalp2_34856
Total (incl. this protein): 18 Avg length: 210,4 Avg pI: 10,08

Protein ID Length (AA) pI
46M5C 215 9,75086
20gkn 203 9,77097
29ih 204 9,65609
2ZDt6 211 10,28833
2Zm76 215 9,64861
30eIf 201 9,92705
46s0i 199 9,89746
48LMS 215 9,72069
497Hg 200 10,00983
4aTAZ 226 10,79035
4beiX 226 10,89891
4rkGX 215 9,97618
51O1t 214 10,32740
6LGYD 218 10,60816
6TEh 209 10,08055
6U6W 218 10,09016
ZMEp 200 10,07333
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_13958
gdy7
74 31,7% 214 1.964E-34
2 phalp2_22172
6KFgq
3 24,3% 238 1.071E-07
3 phalp2_36789
6KVwf
51 27,0% 174 1.213E-05

Domains

Domains
Disordered region
Unannotated
Representative sequence (used for alignment): 46M5C (215 AA)
Member sequence: 47dmD (199 AA)
1 215 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (46M5C) rather than this protein.
PDB ID
46M5C
Method AlphaFoldv2
Resolution 68.67
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50