Protein

Protein accession
63477 [EnVhog]
Representative
1rkWk
Source
EnVhog (cluster: phalp2_29740)
Protein name
63477
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKIIDLSKHNTVTDWDAVKKSCDGVILRCGYRGYTEGKIKTDAKLEEFAKECAKRKIPFGLYFMSQAITVAEGREEAEYTLKCAEKYGATMPLFIDSEDGDGTEKKVRADGLDKQTRTAVCKAFCETMKKAGRKAGVYASESWFNDNLNFLTLQQYLAWVAKYGANTGAPCTKVKLSKCDMWQYTSKGSVPGVKGKCDVSELYEESNMDEWLEKEKPEEVKEEEKKKGDGYMFDAPVLGGGMKGNAVLLWQKLLAGCGYDLGNDGRAGNLSGEFNEGTYIATLDYKKHVGISIGADGTDGTVTKETWAAMLGI
Physico‐chemical
properties
protein length:314 AA
molecular weight:34636,1 Da
isoelectric point:5,96
hydropathy:-0,52
Representative Protein Details
Accession
1rkWk
Protein name
1rkWk
Sequence length
277 AA
Molecular weight
30692,80970 Da
Isoelectric point
8,86261
Sequence
LKKVIDLSKHNVITSWAGVKAACDMVVLRIGYRGYTNGKIAMDQKFKENLAAAQKIGIPVGFYFFPCSITDQEAIEEADFCAEAVKGLPIVLPLFADSEVSDAVHHAGRSDHLSVQQRTHLLKVFCDRLQSKGIPAGIYASTSWLNTKLDMSQLPYTVWVAQYAPTCKYTGKYLLWQFTSRANVAGVLKPCDMSYVIDAAEKNPYREPVTTVRKGDSGAIVKWVQWYVRAKEDGVFGGNTESAVIAYQKKAFPDDKSEWDGEVGPHTKAAFKKNGRA
Other Proteins in cluster: phalp2_29740
Total (incl. this protein): 100 Avg length: 310,9 Avg pI: 8,20

Protein ID Length (AA) pI
1rkWk 277 8,86261
1GNHN 308 8,08886
1GTmM 306 9,57325
1gAuw 293 9,12964
1jHkB 315 8,64864
1qxoK 309 5,67483
21sIY 307 9,44631
23BOu 279 5,08802
23CKt 280 5,32862
24kMV 350 6,82224
3AsAc 381 5,39654
3VEIT 304 9,56042
3VF3S 306 9,35464
3WDH4 303 9,34954
3WIrm 305 9,40112
3WNdx 312 9,11198
3ZUL0 310 7,54114
3dYdh 210 7,61344
3fSyQ 279 5,15521
3fTEY 262 5,66590
3h4W6 279 5,02260
3h5um 295 9,50846
3iBfO 312 9,20758
3ivCl 343 9,57125
3l58M 371 8,41656
3mraG 355 6,76437
3nW0b 308 9,33239
3sVfo 314 9,20372
3sYiQ 383 5,44730
3ts2c 312 8,99194
3yeRN 351 8,15855
3ykQ3 233 8,48109
40lyq 299 6,27260
41bLJ 260 9,79960
4Qmlj 295 8,90813
4ynCz 286 8,12439
5N6vp 351 8,45311
5ONZV 356 8,75772
5Qqix 341 5,67056
5QsFT 312 8,92147
5RX9b 312 7,64004
5V5iS 312 8,92786
5VvHB 296 8,86616
5XejV 279 9,38642
5YenH 312 8,92122
5Yeor 317 8,93205
5Z2y6 313 8,91097
5Zfvs 285 9,72565
5jBaB 286 8,75444
5jDid 298 9,36476
62v8g 313 8,87074
634bZ 356 8,44002
64Lep 312 8,84108
650IF 356 8,44002
65Atw 314 9,00696
65W7c 312 8,84108
65dkF 250 6,89243
65eEB 306 9,59072
65frD 278 7,49589
66EOA 351 8,15855
66OVR 312 8,92779
676zJ 351 8,45311
68oDe 286 9,53837
68v7H 310 9,38848
68y08 310 9,37875
6950x 312 8,75946
69kZ0 260 8,52132
69sRi 313 8,87074
6YmRO 361 8,89981
6cHp8 361 8,65735
6caW5 382 5,53290
6gDyk 355 6,24464
6k93h 312 8,99871
6kTGR 312 8,84108
6p1sN 310 9,38842
6q8m2 355 6,76437
6ra00 312 8,64284
6sL8S 314 9,25781
6sYyV 310 8,89730
6vb3X 287 9,76930
6vxYU 312 8,47284
7DQfp 283 6,09669
7Kwpm 341 5,24257
7q2Eb 284 9,00438
7rVwp 285 7,50391
7vFFn 329 5,45708
7xYCP 381 5,67415
80pop 312 8,64265
83L37 251 8,36614
8brwW 306 9,16787
8bs8i 288 6,83662
8dfjr 312 8,91322
8f6O4 251 8,55445
8fD95 305 9,28179
8gkCU 306 9,40905
8jG4Y 313 8,56322
8oc6I 312 8,72265
EFTL 297 8,62782
qkMq 371 7,54995
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6199
5X5xO
102 42,7% 199 1.644E-77
2 phalp2_5668
4yhqW
7 44,2% 183 3.799E-73
3 phalp2_2537
6a7No
192 30,0% 286 4.661E-69
4 phalp2_26548
858CP
728 33,3% 219 2.751E-67
5 phalp2_3425
3TK4D
923 39,3% 201 6.331E-66
6 phalp2_10110
c1k0
16 32,6% 236 4.508E-60
7 phalp2_33310
6kV0o
68 32,5% 209 7.550E-59
8 phalp2_19565
3TIJJ
398 30,7% 244 1.931E-58
9 phalp2_14288
3xQM7
130 27,3% 307 1.931E-58
10 phalp2_24035
8aWkT
56 29,3% 215 3.612E-58

Domains

Domains
GH25
Unannotated
Representative sequence (used for alignment): 1rkWk (277 AA)
Member sequence: 63477 (314 AA)
1 277 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1rkWk) rather than this protein.
PDB ID
1rkWk
Method AlphaFoldv2
Resolution 95.32
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50