Protein

Protein accession
4K3Sl [EnVhog]
Representative
4EipG
Source
EnVhog (cluster: phalp2_20589)
Protein name
4K3Sl
Lysin probability
77%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
MTRALALNITAVDRATKVIQDLNKRVSQMTRPYQNMQRSLQQFGRAAGFGVVKEKLEGLTRGAKRVAESFAKIGAPLIALVGGGTIAGLFALTAGWARFGLQVSQTARILGVNTQELYNFQNAARLVGISGQTATQSFQSFADTLQDARWGRNQQAMGLLVGLGIHLKNTKAGSIDAMDALGQIADKIHSFQQSGRSGAARTLANQLGLTSLLPVLMQGRKALEAYEAQAQKLSGAMNWQQAAAAAMQWNRLHIAMEGVKNTIGAALLPAVTPLVQQFGAWIQANRTLIATDLGDFVKGLGSAFRGLTLETVLNDILAIVKGMLSLVTDVASATASLGGLKDVLIAVGVLWGTPKIVSFGLAIWTMTGYVNASRKALLAWSVARAAATGSNTAGISAKLLAAPTAAGLLGATGIGLVGGAAVYGGAKWWEHHQLSKLNNSRIAASPMAGAVMRYFQSQGWTRTQAAGIAANIGAESGFNPGAVGDHGAARGIGQWHANRQAMFNSWALRNRLPGLRQADMLEQLQFYNYELRHSGAGKRLAGTTNAYDAGAAVSLYGERPADAAGQAAMRGDLAQRLAGSAAPPVNLSVQTTVHRDGTATTRVTTPSGVKIVNTSPTAGVT
Physico‐chemical
properties
protein length:621 AA
molecular weight:65017,7 Da
isoelectric point:10,54
hydropathy:0,06
Representative Protein Details
Accession
4EipG
Protein name
4EipG
Sequence length
648 AA
Molecular weight
67101,58900 Da
Isoelectric point
9,62489
Sequence
MPANKFNITIAATDKATVVANKVASSLKRMIFPAQVAGKTVGMIGKDADFRRVSTGMRNIAREAVGVVDNVSLANTALKGAVGLAGAGAFITSAKDWAAGTAGAARFAQTIGMSSQKLQTLVGVGQGFGIEAQAMTGAVKTLGNTFEDALYGRNQDAMVMMNRLGISLHRTKEGAVDANRALDDVSDAIARNKGNPQVQQRIASLFGVSELLPMLRNGHTAFRQYEAAVARTGAVRSPAMEENAAKLQLAFVMAGKTLDGIGNRIENWIAPSLTKVLDGFTSLAEKHPAAATETVGAGSVVGTALGGFFGYKMVSSVARGVKSLLGMNTATKATEVAMERVANRAGPMLLTRIATLAEAVGLDGLAGAVMSLGIALEGLPLLAIGGAAAAAGGFAYMTYRDWNKLPKSSEGAGSPTKATGSQKGAIQRDVSYFTKQGWSPAQAAGIVANLFRESGLNSKASGDHGLAYGLAQWHPERAANFAAWAGHGLKDSTEQEQLAFVQYELTQGTERSAGAALRRAKTPEEAGATISHRYERPLDDGGSAASRAADARAIFDRLPSIASQPVHLEQLAPPAPTSPAQIEQLGAPTAPTSAVQIEQLAAPPAPASTSKIEVEFKNAPPGIRVTTVKPGTNPPAVKVAHAMPGTQQ
Other Proteins in cluster: phalp2_20589
Total (incl. this protein): 89 Avg length: 638,5 Avg pI: 9,27

Protein ID Length (AA) pI
4EipG 648 9,62489
13kfm 565 9,34825
16lL4 566 9,25200
1H0I4 662 9,81152
1fTzJ 693 10,52390
1hReH 635 9,98675
1hSUE 600 9,05808
1lPIh 676 9,27695
1qnHz 669 9,21912
20Cwa 669 9,21912
219hC 661 8,92031
2AVUP 695 9,71759
2AZEt 679 9,69303
2TVpB 679 8,95036
3Ac4m 562 9,26980
3GJOP 660 9,29765
3NMKL 678 9,38333
3NRrX 600 9,43838
3Oqzt 648 9,85111
3T9ro 650 10,14244
3V46O 646 9,70760
3fliU 675 7,14798
3go9y 601 9,61264
3iMeF 679 9,68942
3zM4Q 539 9,97108
4C3mn 514 8,07017
4DYMg 577 9,11011
4EJcv 637 9,97070
4ES5O 699 9,51639
4EbSX 626 9,65345
4Ekzx 468 8,00325
4ICQt 675 8,78319
4K1X6 617 10,20040
4T4Ep 638 10,42539
4T5fF 618 10,28027
4Y0VT 626 8,44209
4Y3gm 543 9,24872
4qRe7 667 9,53031
4qRsq 701 8,32359
4ui6r 699 9,35367
5CluM 519 9,82390
5EY4H 590 8,59913
6CZKW 676 9,27695
6CZP4 589 8,52435
6DHEQ 597 7,95122
6DP8q 676 9,32788
6DZRA 708 6,34996
6DmbD 662 9,27682
6FqIj 492 9,11108
6I0Qt 618 9,81101
6I2no 774 9,01979
6JbC7 688 6,66019
6KHel 710 8,92263
6Rwlh 671 9,18083
6SkC7 575 7,00634
6SxyX 712 9,45237
6SyAK 679 10,07269
6wTcA 667 9,80430
71TuD 676 9,38378
71YAx 703 9,69200
76SPy 635 9,84286
7ATvj 574 7,85762
7IkVt 576 9,21558
7XRmN 707 9,49260
7YdvG 698 9,50781
7behc 636 9,43503
7gTCQ 734 9,47319
7k3PM 672 9,70921
7lDve 543 9,54572
7lIkx 639 9,70038
7nX1N 676 9,32782
7owVA 697 9,49531
7prAN 564 7,87296
7pxln 505 5,90343
7tpJJ 674 10,27505
7tpRZ 685 10,00519
7ugGO 662 9,09380
7vKVF 626 9,18425
7wHLP 617 9,96045
7xHnp 618 9,69161
7y7gY 607 9,82925
OVi4 660 9,29765
eXLN 635 9,78412
fiS5 666 9,92441
frrr 676 9,44638
gOY0 617 9,98024
gPrR 695 9,71759
hiFl 588 9,76556
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_32035
5Bcj7
44 29,9% 657 4.566E-136
2 phalp2_19716
4LhjK
50 26,4% 658 1.310E-112
3 phalp2_17054
3RVGN
12 24,5% 561 1.167E-75
4 phalp2_35835
4FXsJ
127 25,7% 568 5.354E-70
5 phalp2_30566
5H6Eo
2 22,7% 588 1.016E-66
6 phalp2_2735
7tm77
47 22,6% 663 6.192E-66
7 phalp2_32654
25kP9
2 25,9% 670 8.368E-66
8 phalp2_32382
wgVF
186 25,1% 529 1.696E-64
9 phalp2_9214
6N8ZA
134 24,5% 542 2.535E-63
10 phalp2_36112
6IjI9
1 20,9% 683 5.093E-62

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4EipG) rather than this protein.
PDB ID
4EipG
Method AlphaFoldv2
Resolution 55.38
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50