Protein

Protein accession
HkzJ [EnVhog]
Representative
4f5QX
Source
EnVhog (cluster: phalp2_31553)
Protein name
HkzJ
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSYQELTQFNSPNYTPEIQVSAVYGMARAVEGVTYHWWGSNSDFMSIVNYLCRANGNTSAHTVGEAGRVAWIIDAVNAAWHAGNARGNATTVGYECNTRLSDGDYETMGEFHYDMEKAYGRRLNIYVHKEWFNTSCSPIDKNRIRAIADRYHAGGGSRPTVNETQIREVFRSILGREVDPEGLRHYLGQAAKGWSIDQIRADVNNSQEAHQRRAELARQAEELKRSEWVRNLNDIEDIKLVVAPVAGLRVVNMVTMEAFGNVIPKGTVIDIAKETVVQGKKYYLSQYAVKNNKPFGIAATELVAPADPNKDKPAWQKNLKDIADQDFWTRSECEVTDLTTGKLAKKLPMGTKVRVTHVTKLVDDDLMVLEGGTLAIDKLYLSDKPIDSLLEKRVSALEAIVNKIIEFLTNLFKNFNK
Physico‐chemical
properties
protein length:417 AA
molecular weight:46837,5 Da
isoelectric point:6,78
hydropathy:-0,42
Representative Protein Details
Accession
4f5QX
Protein name
4f5QX
Sequence length
384 AA
Molecular weight
42315,17970 Da
Isoelectric point
5,44150
Sequence
MASYKYITNYDSPNYTPGSSVKGVFGYPREIKGFTYHWWGDPNNHPTFEGVVSWLCRNGGNTSAHYVVEAGRVACIVSPYDAAWHSGNALGNATTLGIECNPRALDGDYTTIAQFSAQLIDAFGDRLKYKHSDWQATQCPGVYDIGRIDRESYDWISNAEWGDVSPKTSTPIPTPPPVVPAPVPSPIKAEWVVNLKDYKGPQLQVVKADGARRVNLITGEEFSEVIPRGTNIDIVKETKVSGVLYYISHYSSNANAAIGIRADAMGIPATPPVVEKPEWLNNLEDITDQVFWTRSETSVLNLADGTTVKTLPINTPVRITHATRIVGNDILVLEGGVTGIDKLYLSDKPITNPDSDLAIETNRIVKFIFDIVTKILDKLNNIFR
Other Proteins in cluster: phalp2_31553
Total (incl. this protein): 101 Avg length: 402,2 Avg pI: 6,65

Protein ID Length (AA) pI
4f5QX 384 5,44150
14kes 463 5,74002
18Xtk 458 7,06039
1ISbq 476 7,64811
1L5yV 474 7,63947
1Lloa 469 6,28562
1Lqdu 470 5,12929
1cJwb 242 5,38080
1cvwz 424 7,15514
1nsNI 242 5,41564
2bnfp 421 5,76401
2f35g 425 5,30873
3P6Jv 320 5,33084
3bxi9 448 8,97988
3erDP 473 6,18519
3rl83 477 8,17177
3xOnt 449 5,57104
4FP4O 433 5,54018
4FUuc 288 7,64811
4Jj53 416 8,64555
4YIIf 469 6,28681
4gLAX 474 6,36178
4kSWw 318 5,90406
4kwHS 474 6,97303
4uHKq 467 9,45547
5MZTr 314 5,52727
5NIBR 416 7,66891
5NvjF 416 7,66880
5TFTc 312 5,27849
5V0DX 316 5,52011
5WPAK 421 6,24009
5ZvWK 416 7,66880
61dGs 420 5,98164
62mrj 314 5,69347
63bha 421 6,10481
68UMO 419 6,53122
6O5U 468 5,48322
6Szl7 474 9,07407
6UbZg 458 6,05480
6YZD6 417 7,12525
6YkdX 417 6,77392
6Z31w 443 6,81348
6ZDGy 416 6,53270
6ZlLW 420 5,76117
6Zyqq 416 7,12468
6aFt8 416 7,12525
6arYf 416 6,34348
6ax8e 312 5,60094
6dnWw 416 7,12417
6lXc8 314 5,64607
6nLnt 416 6,39162
6pXAq 416 7,66880
6tcTZ 421 5,76600
6thHz 314 5,68926
6wRFo 416 7,66880
70CEE 314 5,27849
7MbM5 417 7,12559
7NEFI 421 6,23782
7ULmo 307 8,48857
7XKvi 475 6,11613
7YrQS 314 5,38609
7ZB6I 416 6,53276
7ZC1z 375 8,15069
809Sb 569 6,50155
82oZi 375 5,66852
89G6Z 282 8,50178
8IAfy 462 9,51787
8aEo1 304 8,16784
8b0n9 421 5,66863
8e2US 477 6,11152
8eAKQ 420 5,86706
8ew8b 314 5,28667
8famz 553 6,30091
8n209 334 5,40740
8qVUG 277 8,52680
8qWg1 416 6,77540
8uHH0 464 9,07343
BGfb 314 5,64203
BuUA 416 7,66880
DE64 416 6,34388
HJvB 416 6,34479
Iknf 468 5,62595
MmEU 416 7,66880
Nfah 416 6,77557
O6rY 421 5,87109
OINL 416 7,66880
OajF 308 6,01910
Oh3d 308 6,25214
btjg 478 6,56731
dEp4 523 6,17245
e6uF 349 6,60926
jlRb 469 5,70711
p7rD 416 6,53270
pNM2 405 6,38912
t0KU 291 5,11746
tC3e 287 8,24275
wZpf 348 8,49740
xAcT 314 5,40155
ygIc 417 6,77392
A0A2H4PI22 462 9,51787
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10949
4Jkr6
29 27,5% 316 6.565E-58
2 phalp2_16214
4K5QO
38 23,7% 299 1.204E-22
3 phalp2_1799
36BX3
4 23,5% 301 9.842E-21
4 phalp2_34085
8plKk
11 18,9% 401 1.319E-20
5 phalp2_872
8dsHX
35 23,8% 294 2.443E-19
6 phalp2_36626
25FVY
29 19,8% 362 2.542E-17
7 phalp2_6714
1Z5Qj
1 19,1% 266 3.705E-08

Domains

Domains
Ami2
Unannotated
Unannotated
Disordered region
Representative sequence (used for alignment): 4f5QX (384 AA)
Member sequence: HkzJ (417 AA)
1 384 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4f5QX) rather than this protein.
PDB ID
4f5QX
Method AlphaFoldv2
Resolution 84.52
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50