Protein

Protein accession
47p68 [EnVhog]
Representative
3nCxR
Source
EnVhog (cluster: phalp2_4499)
Protein name
47p68
Lysin probability
99%
PhaLP type
endolysin
Probability: 96% (predicted by ML model)
Protein sequence
MSASADKVVAIALKEVGYKEKPVNITKYGAWYGMDGQSWCAMFVSWCFNQAGLTSLIAAGPKGYAGCQTFEAWAKAHKLIVPTTTVQAGDILLFDFYKSGVAEHTGIATGGFDPHTHLVPTVEGNTAAENAGSQANGDGTYIKHRAISTIRAVVRPKYPN
Physico‐chemical
properties
protein length:160 AA
molecular weight:17070,2 Da
isoelectric point:8,41
hydropathy:-0,08
Representative Protein Details
Accession
3nCxR
Protein name
3nCxR
Sequence length
227 AA
Molecular weight
25246,41610 Da
Isoelectric point
8,91915
Sequence
VKIVDIANTEIGYLEKKSNSQLESKTANAGSKNFTKYGEWYGMNGQPWCAMFVSWCADRAGIGTDIIPKHASCAVGIKWFKERGLWADCKGYKPKEGDIIYFQSGANNRHVGIVTKVTDERVYTVEGNTSGGSTMISNGGSVAAKNYALSYSKILGYGKPKYAEDDDMTGKEIFEKLQEYLRTQKVPDWAKDELREAINLGITDGKNPCELIPRYQAAIMAKRTVRK
Other Proteins in cluster: phalp2_4499
Total (incl. this protein): 65 Avg length: 228,6 Avg pI: 7,01

Protein ID Length (AA) pI
3nCxR 227 8,91915
12FMl 158 7,05033
12Ild 160 9,15865
12XaF 161 8,69345
13Ifp 246 8,44847
13bdM 188 6,12067
18Ldr 161 9,11546
19g6A 254 4,63882
19nM5 211 8,61654
1DKIh 151 9,21216
1jFNs 220 8,65722
2cVfG 151 9,21216
3WEpH 246 7,71546
3gVRW 244 8,79769
3iakf 237 4,97122
3icyS 214 8,51448
3vSwP 251 7,02140
406nv 270 4,91068
407Wg 233 5,82693
40aqy 240 5,99182
40iDg 268 8,83231
40n5J 268 4,84305
46YXl 160 7,73467
48ZBz 156 8,97569
4JZFq 244 9,05105
4JjiT 262 6,93540
4L6p0 305 4,61910
4L7t4 188 9,09715
4Ofd5 268 4,89500
4kdvK 270 4,85743
4koYw 269 5,17840
4kqGM 257 4,74989
4yb04 222 5,05125
5VFn1 275 8,15385
5cY0E 151 9,06214
61glN 184 5,47663
6GnyC 151 9,30100
6PtUj 252 6,31370
6TaUJ 273 9,58505
6Xpqc 268 6,52355
6okC 252 9,36656
6omnr 254 7,80611
71HiM 252 4,80246
79KjX 214 6,41356
7DIVK 264 4,59955
7DSVm 267 5,73849
7G6w7 233 5,97391
7WFh9 215 8,94417
7WiRc 275 8,15385
7Xs8k 270 6,32336
7ZDES 220 5,32356
7q0JS 228 6,20667
7q0y0 227 8,58643
8522V 246 6,23185
86eJn 229 4,49474
86rAg 246 6,12630
8oITr 246 6,23185
8qPqv 258 4,30887
8qlt0 249 6,23185
8rHw8 206 8,29316
8rQLz 208 6,62364
8rzWC 246 6,12630
IA2r 151 9,21197
A0A8S5PY01 326 4,13188
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25538
3nJBF
256 40,0% 205 3.295E-61
2 phalp2_20025
1n2Uq
6 41,1% 197 5.616E-57
3 phalp2_1959
4y5vj
188 43,8% 171 2.702E-56
4 phalp2_34753
3g9I4
5 43,6% 172 3.699E-56
5 phalp2_774
21gG8
241 43,3% 187 1.300E-55
6 phalp2_16018
3yVVm
206 33,4% 215 3.007E-51
7 phalp2_11549
ZJ7o
17 49,0% 165 5.633E-51
8 phalp2_27350
4yaBH
91 40,8% 186 1.977E-50
9 phalp2_14678
6cDBc
13 46,7% 156 6.936E-50
10 phalp2_1568
7XJtL
188 43,5% 170 6.936E-50

Domains

Domains
CHAP
Unannotated
Representative sequence (used for alignment): 3nCxR (227 AA)
Member sequence: 47p68 (160 AA)
1 227 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3nCxR) rather than this protein.
PDB ID
3nCxR
Method AlphaFoldv2
Resolution 93.76
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50