Protein

Protein accession
4DnCH [EnVhog]
Representative
4Ncgb
Source
EnVhog (cluster: phalp2_2332)
Protein name
4DnCH
Lysin probability
91%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MTTSFYSQRDPLHAAKHYGLPQASRESTLGAYGCGITAIAQKLTLCGWPTTPLAVQQTLAEGRGFIPSGSYNYISWPRLPNLYPQMNYNGRQDFGAGIAPLPARFLTQIDARVDRGEPVIVYVDANRYAGGLQQHFVLIHGRDPHGAFHIVNPWNGLLQDLRPYGETDALAIRGFILLDLAIKPSLTT
Physico‐chemical
properties
protein length:188 AA
molecular weight:20707,3 Da
isoelectric point:8,58
hydropathy:-0,17
Representative Protein Details
Accession
4Ncgb
Protein name
4Ncgb
Sequence length
150 AA
Molecular weight
16414,70170 Da
Isoelectric point
7,84176
Sequence
GAFGCAVTALAQYLTIVGLPITPDRVQDVLILGGGFKARDSYNFVNWPKLPELFPRFVYRGVADCPNTLAPIGVMTGIEDRLSRGQPVILYVDASAYERGLQQHFVLAIGCLESGAIAVANPWNGQRQDLRPYGKNDRSSICGVKWLDFV
Other Proteins in cluster: phalp2_2332
Total (incl. this protein): 6 Avg length: 175,3 Avg pI: 7,08

Protein ID Length (AA) pI
4Ncgb 150 7,84176
1pkPh 187 5,45634
2Kruj 195 6,17666
7MHG 199 6,08077
itp5 133 8,31805
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_38584
1ZtIJ
25 31,5% 146 3.511E-21
2 phalp2_28654
3nomK
21 33,5% 152 1.520E-19
3 phalp2_19617
4fSoV
12 25,4% 161 2.080E-19
4 phalp2_14350
4eunE
39 31,7% 126 1.504E-16
5 phalp2_18175
4yL66
1 26,7% 101 4.703E-15
6 phalp2_5478
3bJ6R
1 25,9% 154 6.962E-13
7 phalp2_26623
1nNB9
2 20,8% 149 3.168E-07
8 phalp2_30247
4fknK
4 16,2% 154 7.974E-07

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4Ncgb (150 AA)
Member sequence: 4DnCH (188 AA)
1 150 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Ncgb) rather than this protein.
PDB ID
4Ncgb
Method AlphaFoldv2
Resolution 93.21
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50