Protein

Protein accession
47UBP [EnVhog]
Representative
NQqI
Source
EnVhog (cluster: phalp2_2920)
Protein name
47UBP
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSVKFIKPRRKVTKVYLHCSASDHPNHDNAATMDAWHKARGWSGIGYNFFIRKSGLIEIGRDINKIPAAQAGHNRGSIAICSHGLAIEKFTEEQFNSLRDISGQIDDAYNSRITFHGHCEVSAKACPVFDYKAVLQLDKHGHLGASTVSHQEPVKVVDGQWPELKSGAKGPAVERLQQLLNIKVDGAFGPKTARAVRAFKKSHGLFDSDVVAGHVWKLLENKGASS
Physico‐chemical
properties
protein length:226 AA
molecular weight:24817,9 Da
isoelectric point:9,43
hydropathy:-0,39
Representative Protein Details
Accession
NQqI
Protein name
NQqI
Sequence length
235 AA
Molecular weight
26823,12420 Da
Isoelectric point
6,06179
Sequence
MSLFTKPKREITKVFLHCSASNRPTHDNVEVIRSWHKQRGWSDIGYHFFIRADGTIEKGRDLESKPAAQKGHNTGSIAICLHGLYPQDFTVNQYDSVYGLCQAIDLAYDDRRISFHGHCEVSVKPCPVFDYRTVLNLDNLGLMFMDNKDDAFRPPLQLFNRGLEVVRLQRQLNLWLDEAKGEADPERLVVDGIFGQNTQIAVMQFQGFNELESDGIVGPRTRLKLPIVEPQEAAY
Other Proteins in cluster: phalp2_2920
Total (incl. this protein): 114 Avg length: 230,7 Avg pI: 7,10

Protein ID Length (AA) pI
NQqI 235 6,06179
17ovd 148 6,47205
1BZh 192 4,96792
1BdIp 192 4,96792
1C5Lz 241 6,55714
1C7Hc 192 4,96792
1CF6h 192 4,96792
1HN0U 152 5,54910
1HN4I 152 5,70990
1HOs2 137 5,86649
1ahia 199 4,78876
1s4O 199 4,50537
2Kbhd 192 4,96792
2QQDk 192 5,21904
2QQX1 241 6,85822
2UcrV 345 8,33204
2V6RN 156 6,69617
2aTkP 157 9,44792
2jAR0 342 8,89401
2jifz 274 7,73854
2jrSp 224 8,24913
2mKHJ 190 6,10027
33aDG 221 6,29466
33tFa 162 9,09567
3ENWV 165 6,64615
3T1PZ 238 5,97391
3Vp9c 204 4,59045
3XZe 194 8,55142
3aUGW 242 5,45094
3bKd9 243 6,54617
3iRNE 226 10,53505
4BlDR 143 9,84763
4DkI 188 8,68223
4FwAv 264 6,04093
4Nc4P 219 8,54762
4OJxn 308 4,63229
4OPjf 240 8,77803
4QCwb 347 9,31363
4WkaP 262 5,97983
4Wu9Y 258 5,03471
4X32L 238 6,91898
4dZmx 346 8,88292
4hXaF 234 8,97492
4uG4R 238 8,44189
4yJuN 223 6,10169
5dIeY 195 5,03357
5dS8b 236 5,76901
5mbT3 164 9,92273
5qYRn 248 9,18476
6AQQv 346 8,80298
6CVh2 177 9,25233
6MOx5 162 9,09599
6P31j 168 6,15858
6TuqV 269 5,68153
6Tx1E 267 5,77475
6VVFH 221 6,78802
6VVwz 220 6,49285
71PDZ 340 9,12797
7Bxk2 164 9,79670
7CRx4 196 5,49510
7KZGE 393 8,51565
7VgkU 238 5,96010
7e5rl 343 7,69790
7e6L6 353 6,26448
7hv6a 352 9,44347
7jTdX 162 9,33368
7jX5z 247 5,90895
7m6NF 348 9,22963
7nHWh 346 9,56474
7okYW 164 9,44870
7pood 345 8,27221
7w723 162 9,49872
7xQxX 349 9,14582
7y0Hk 337 8,33404
80YmM 141 5,32049
82I5U 203 4,67031
84Wup 143 9,83712
8AiGk 238 5,96010
8BZvE 199 4,68054
8CGgJ 238 6,85554
8Dzql 238 6,85554
8E5iP 248 6,25436
8FcrQ 248 5,68654
8FnCJ 248 6,13062
8Fvu8 165 6,64615
8Ges1 239 6,79188
8dL5V 232 8,79170
8kFuE 210 9,25104
8ktgF 241 6,85822
8tCc 248 5,68790
8wXwF 203 4,67031
8yTFS 231 6,24356
8zCkf 192 4,79235
8zNmx 199 4,80309
CzK6 238 6,78592
P56Y 227 8,70415
PaKR 223 6,74658
Ri6L 172 5,88405
TtgA 221 6,58681
VA5G 267 8,86938
ViVN 248 5,97647
XZOz 221 6,40242
dgcr 162 8,96879
dn8D 199 4,77183
dvQq 162 9,09567
dwCa 351 8,93637
gQiK 348 9,35941
hh40 266 6,04030
inyU 247 4,89107
ist4 131 8,72078
rxOn 232 5,51863
tX58 241 6,55765
A0A6J5MBI6 150 7,71836
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30941
13o5z
7 60,0% 150 1.293E-64
2 phalp2_38947
3NmsY
2 49,0% 151 1.613E-54
3 phalp2_25618
48uUY
31 31,5% 244 8.011E-40
4 phalp2_36521
1qwup
277 33,1% 226 8.011E-40
5 phalp2_28171
8yYQh
287 34,7% 161 1.376E-33
6 phalp2_19675
4Bope
30 30,0% 223 4.230E-32
7 phalp2_35819
4CbZx
312 30,0% 193 2.545E-28
8 phalp2_22939
3bu4t
1 27,4% 226 6.735E-26
9 phalp2_9794
8qYns
5 36,6% 153 2.753E-24
10 phalp2_13210
2JwXt
44 25,9% 239 1.755E-23

Domains

Domains
Representative sequence (used for alignment): NQqI (235 AA)
Member sequence: 47UBP (226 AA)
1 235 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
47UBP
Method AlphaFoldv2
Resolution 87.93
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (NQqI) rather than this protein.
PDB ID
NQqI
Method AlphaFoldv2
Resolution 89.67
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50