Protein

Protein accession
7xzN5 [EnVhog]
Representative
EGF4
Source
EnVhog (cluster: phalp2_17898)
Protein name
7xzN5
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MIPVTKKLIKYNHGGVNNPKYIVIHETGNTDIGADAERHYRYFNGGDRGASSHYFVDDKQIIQVVEHKIQSWHNGKKYVSNPAVPQCNNSNSIGIEICVNQDGDYSKAVAHAVELTKKLMKEFNIPVDRVIRHYDSCGKQCPAKMLREPKIWSDFKNAIQGVQMDEEYEKAIQVLVEANIIGSPAAWQLDKINMNNIPSLVKKMAKYVEGKS
Physico‐chemical
properties
protein length:212 AA
molecular weight:23959,1 Da
isoelectric point:8,73
hydropathy:-0,54
Representative Protein Details
Accession
EGF4
Protein name
EGF4
Sequence length
304 AA
Molecular weight
32937,58690 Da
Isoelectric point
8,20600
Sequence
MLDIQQKIIKYNFTKGRTGVDYIVIHDTGNTNAGADAEAHYNYFGGANRNASAHYFVDDHQIIQVIEDSNAAWHCGDGKGKYGIKNSNSIGVEICVNQDGDYDKAVSNAVELAATLLKKYNLSLDRLKRHYDASRKTCPASMSANNWAAWELFVSAVKAALDGTTAAPSQKPVVFGHVANVQRWLNQEYGFALEIDNSAGPLTWTALRKALQMEINRQGGAGLEIDGSIGPKTKAAIIYTKRGTKGNITKLVQAALYCKGYDPNGLDGGFGPGTEAAVRKYQADHGLTVDGMAGRETQVSLFRV
Other Proteins in cluster: phalp2_17898
Total (incl. this protein): 130 Avg length: 288,5 Avg pI: 8,49

Protein ID Length (AA) pI
EGF4 304 8,20600
10DhW 236 8,67121
11wHP 271 9,08561
14xtl 235 9,39667
1a2Cf 299 9,20578
1dk8v 298 9,60832
1k3fX 272 5,48311
1mq4s 327 8,44776
1n3pN 357 5,92816
23hH6 283 9,27611
2Pqr7 282 8,93669
2c4jl 224 8,05321
2mx3K 359 8,23366
3ITVB 283 8,88376
3PPLV 257 5,10621
3ix0K 285 9,47655
3jMyE 253 8,95203
3kLOW 254 9,45063
3kYbu 345 6,93489
3kj8y 240 9,35528
3lE2C 264 8,55749
3lRAS 312 9,57357
3nI6v 255 9,27682
3qqcN 357 5,92816
3qxw0 254 9,34387
3tfdX 282 8,86023
3uGgX 327 8,58585
3ugwn 254 9,38216
3uwTZ 254 9,39351
3x8P4 312 9,53728
3xWgT 319 8,92753
3yOPr 283 8,76572
41jMv 236 9,65306
4j9MP 344 10,08068
4lMLO 221 6,29909
4xpjB 301 6,63870
5LAxi 361 8,71227
5MU3C 357 5,61384
5Pu1T 319 9,00083
5X20B 345 6,33575
5Y5w5 361 5,19340
5hTQ8 310 8,40979
5imOE 261 6,12226
5iquw 305 9,24349
603kN 328 8,44286
60oqv 283 8,75798
64tnX 271 9,08548
691TB 258 8,73471
6M0xE 227 6,20053
6bVFK 331 9,62205
6dWoE 304 8,85082
6eUsg 359 5,39006
6fyz0 283 8,60584
6icfF 332 8,45182
6kGQ4 312 9,40344
6kIJI 327 9,11075
6nVif 345 6,97877
6ntHT 272 9,91396
6nu13 253 8,90910
6pm3 236 9,12990
6q7n 280 8,57289
6qIzQ 351 8,97550
6rY17 353 9,17748
6u5wK 357 5,92810
72zEl 332 8,84991
76Nna 356 6,17603
7ATcr 301 9,85994
7BKUS 300 9,17142
7BdIt 234 8,70989
7Bmmg 224 9,36379
7BmoO 304 9,01624
7M4u9 271 8,86500
7MlR6 271 9,08555
7OgGM 282 8,74399
7Ukzv 282 8,40334
7YgdX 253 7,03697
7Z6c5 292 9,60252
7ZA4w 271 8,72240
7bg6E 240 9,04590
7cfRz 306 8,77481
7fnNv 234 8,94307
7gepF 304 8,94127
7htOn 272 9,21919
7lBwq 322 9,71888
7pFbd 259 7,65135
7po7q 258 8,56761
7qSZ3 235 9,15917
7qTuQ 288 9,19450
7rPqA 295 9,57499
7su15 314 8,54627
7su2N 317 8,30703
7uyhy 299 9,05802
7vDbl 254 9,26818
7vdyG 299 8,68262
7vzgc 249 7,73109
7wMj4 272 9,41685
7wfCi 368 9,53960
7yZoD 329 9,29926
7yZqC 330 8,69706
82oSS 295 9,34909
82sP9 212 8,87525
82t4W 264 8,35609
83ZnT 354 9,35509
844E 284 9,38964
84dJh 328 8,71788
85X8I 247 8,10879
88ezV 206 7,03959
8K7va 309 9,28869
8aVaA 253 9,04609
8cFiQ 252 8,45575
8ezJw 252 8,83025
8hjzt 266 9,09644
8idaU 361 8,77133
8ku0I 332 8,24507
8lqGy 351 9,00348
8pmoz 253 8,94301
8s7Yt 315 8,77010
9vgd 262 8,61460
K0FM 327 8,26145
d1Cx 237 8,13773
m15K 268 8,79621
n2ba 240 6,29721
ocji 247 8,35931
ogV8 245 6,73794
op7C 251 5,53796
p4JG 271 8,97975
wTD6 271 8,36395
xnkF 271 8,71582
A0A8S5SWU1 332 8,85507
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36746
6AI9B
16 35,7% 322 2.376E-88
2 phalp2_24835
7sl6i
39 34,4% 337 3.821E-69
3 phalp2_24299
3nIle
36 44,2% 190 3.061E-67
4 phalp2_5925
64hKp
120 30,0% 313 1.897E-60
5 phalp2_6450
8Jtqs
137 37,7% 302 6.622E-60
6 phalp2_29544
3KzTO
225 39,2% 242 1.629E-56
7 phalp2_26224
op4C
2 25,0% 360 1.977E-55
8 phalp2_32877
3ejEW
46 27,2% 334 2.128E-53
9 phalp2_36521
1qwup
277 29,4% 285 4.254E-51
10 phalp2_38692
MSLD
8 29,6% 314 2.756E-50

Domains

Domains
Representative sequence (used for alignment): EGF4 (304 AA)
Member sequence: 7xzN5 (212 AA)
1 304 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (EGF4) rather than this protein.
PDB ID
EGF4
Method AlphaFoldv2
Resolution 94.10
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50