Protein

Protein accession
2JOhx [EnVhog]
Representative
4S1ms
Source
EnVhog (cluster: phalp2_18234)
Protein name
2JOhx
Lysin probability
68%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MARDPARLRKAYEVKLGKNLVDKLSDAQIAQLSKYYNSLNEREQSKIDSELIQGRSNDFTDIARGLAGEFDEKGNIRNITKQNVKTAKKSNKKQQEIPDGLDDLLKEIRDDVDTLKKEKGDQKKGKTTAIVKSVTQKKTKPSKSEDIDPRILSLLGLEDVSDLDYDTYKTLLKERMMAARMSGSKIPTEESELIDDEYKKVKSRSGRFKPKSRKKTVKTSNFVNKKPKSKPTGPTKVQTDKLLPSSTSASTPESVKVDLQEDVQEKLVPISKSLSNIESNLQKLLKINQDKLELEKEAARKLAAKEETEGFREKEAKLEEGDKKKGKKIEKALEPVKGIFDMIGDFFKNILLGGAVNLIMDIVTNPGKYLKPIIDFGNFIVDFINDKIIKFINDIVFAPINAYIGLWNKAFNELEWALKQIAKVIPGTPTPKLPRIPIVAIPNIPQIPYPQWMQQQEGGGQVIDIKNLSLFDGGAIDKMTGMTIKGMGKDTQLIAAQPGEIMMSKKAVDMYGADNLLAANALAGGSNKPKFGKIQGFQGGGQVGKVIIGAGHAPTPSNAARGIGLGSDNRSVQGTADDGSSGGNTNPTGVKEWEATRHVVETLKTLVQQRGLSDKIGFRDIYSWGGLSRVPQEVESVRGQQYVDVHFDARGFGKAGVLPSANESATDRSLMNEFGRYSSTFDPSSKGVTRGGGTLLELARIDDPAIRGLLEEVKRGQQGPASMRMAEKILRGILPSVGASPVQADDLSGAPPAPVL
Physico‐chemical
properties
protein length:756 AA
molecular weight:82671,6 Da
isoelectric point:9,34
hydropathy:-0,55
Representative Protein Details
Accession
4S1ms
Protein name
4S1ms
Sequence length
706 AA
Molecular weight
77700,54200 Da
Isoelectric point
9,41872
Sequence
MVRKNRRKSSKKDLAEEILRELQGAKKSDDAPAGLDDLIKSIQNEAKREEKRKKEITAIVKSVGKAKKRPKPKVEEIDPRILELLGIEEYEAELDYDDYVVLLKEVMAQRVVSGGSEEREGDTERLKKELKRARGSTGRFKVKPKKTVKASTFTGRKSRTSQPRQQTSNRITNLGRNEEQVKTEIRTERQEELIPLSSSLTKIDNNLRQVLELDRDKNKKEKSAANKLRREKATARRRGREARMEGGGSQSDPKKLEKVTKPFTGAFDAIMNFFKNIALGGLVTFLLEVVKDPGIIIRPFYDFGNFIIDFINKYIIGFANFLLTPINLIVDVLNFANRKIIELFNSLSKLNPFDDSDPVEYKEIGDIKIPEIPNLEYPQFAQKQEGGGEVVDANSISMVKGGAIDNSTGLKVKGMGKDTQLIAAQPGEIMMSKKAVDAYGASNLLKANAAAGGSNTPNFGNILGFEGGGMVGGNYDSFAKEMIKVHEGLRLDKYLDSRGFPTIGYGHLIEKGESMPDRITKQKADELFDIDYRHHKKAAEKIPGFDQAGGMQKAALIDLTFNMGPAWASGFPAFKKAFKEGNYEQAGNELVDSAWYGQVGRRAPTIVNLIKGKSANAAYLKDTPKPSPGSGGSSQTMIASSPQSSNIQPYSNTGGGSSQVAIPLPQKQQGTNSASSAGQKTVPGFSAEDMNNFDLIVVKSIYNIVG
Other Proteins in cluster: phalp2_18234
Total (incl. this protein): 59 Avg length: 726,1 Avg pI: 8,32

Protein ID Length (AA) pI
4S1ms 706 9,41872
12jJo 779 8,77971
15PFU 760 8,93540
1RqE1 747 9,44528
1Rsnu 789 5,20704
1S1Py 795 5,32305
1Sw26 743 9,23814
1T0SW 795 5,41740
1TcvQ 592 9,41221
1Vhwo 789 5,52665
1VzlE 788 5,57303
1ticm 710 9,34580
1vog3 710 9,40898
1wAZa 710 9,42143
1xIfK 710 9,38416
1xrD4 710 9,39686
227cm 728 8,84392
22863 583 9,37759
24fz1 728 8,51042
268ut 829 9,35954
26bRa 782 9,30461
2JXPs 791 9,28314
2uXzu 810 5,43730
2vMbl 704 9,64545
2vOYD 694 9,39016
2vgOj 787 8,50114
2vgVo 657 9,11030
2zFNS 660 6,40208
2zHyR 694 8,11098
36bZd 790 8,34848
3IIvo 712 9,60651
3IXoO 704 9,57634
3KnqG 666 8,44421
4R5RE 479 9,17129
4ROqS 730 8,58327
4S2Qg 592 9,41221
4iQD5 859 5,23558
4jx7v 728 8,66018
4stv7 718 5,38682
4tGzH 719 5,29105
4wB1Y 647 8,63678
6BkUB 710 9,37204
6Co8t 622 9,10875
81w7e 831 9,33433
8A0RG 706 9,39532
8CXIL 592 9,41221
8FFyy 795 5,36614
8FNKG 795 5,32305
8GQi9 743 9,23814
8qI3E 592 9,37752
8qhre 706 9,38268
8zHpy 743 9,23814
Aeii 760 8,86893
Dett 802 9,46494
eSl3 705 9,12358
r6Qh 813 5,90878
thDk 760 7,57075
u6XN 782 9,34477
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12654
1Stlx
3 35,2% 468 6.247E-165
2 phalp2_20122
24cuN
93 32,8% 777 1.311E-135
3 phalp2_29814
22sRD
17 29,5% 616 8.964E-68
4 phalp2_30844
iB2r
13 29,4% 485 3.056E-59
5 phalp2_15795
478xy
67 27,7% 554 1.508E-56
6 phalp2_7951
ANiV
5 24,4% 698 2.115E-46
7 phalp2_38252
6RqN
3 25,3% 726 2.858E-43
8 phalp2_24435
4uyrv
2 26,9% 482 1.156E-39
9 phalp2_1272
Ddcp
53 20,5% 588 7.599E-18

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4S1ms) rather than this protein.
PDB ID
4S1ms
Method AlphaFoldv2
Resolution 65.96
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50