Protein

Protein accession
mwiW [EnVhog]
Representative
4bEaJ
Source
EnVhog (cluster: phalp2_34374)
Protein name
mwiW
Lysin probability
94%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MARTSEHDLSSLRIDPGPHIGKIVSNVDPHRQGAVQVELLGNIGNQRGADQQVFTVRYASPSFGSTDVEHDATNADDHHSSQQSHGFWSPPPNTGSMVLCVFVGGDPGQGYYFSSIQDQHMNQSVPGIASTKNTKKQYKSYNPDTGDWSKTTDTSSLTDQGSQLPVSEINRPAAKGVQPDTNKMEKPVNSARVSQLAEQGLIDDPYRGTHTSSARRESPSNVHGWSTPGPLDKTPGAPTRSVGPRDTQVTKHTSRRPGHSFIMDDGDESNLRKSKAGEGPAEYVNLQNGETGGDVSVPKDQQIRITAANGAQFIMHCAEDFIHIHNSKGTAWIEMTSNGKIDIYTADCVSVHTEADYNITSDRDVNIHAGRAINLYADHDINAHSKSEMHIKSDSNIAMSASTHIAVNAEGGDITLMSDGTANMTSGDIRLDSRNIDVMAHGHIHLTSSGNTEIRAESIAASASIDIDILAAGTINQTSATQQLKTYSAFILKSDGTIDQRSDGPFSTSGSDMAMTSDGPLAVTGGGTINIQGGPNVLLNSGPGRKKGSTTVATDPYNASLDAAATSPISAGAARQANEAMGRKPHPAKITADPITHAIHTNYGRAITSRVPLDHGSFKGFENFNPPGHTQDLTDRHNSDQPYAKGEERRQISNNDDLENIPNGTRKPSPNVSGARRNPYPHHPSPKGDPSSNVVSQESQISDRNTPSDWVQDQEFMGDVAKLSGKLGITVAELLSIFAAETGTASLDPSKSNGDGCVGLVQICKQTNPKTGKTNFDELAARSPKEAAEDLISPEGMRKLTRHRQMFWIEKYFDLILPGGAGIFPKEDRAVYIWLALAAGTDSSKLITKKTIYPFADTRCKQNKGWQDPTQNNDCTVKKAGTWLRWYEKQFIAPKLGAKSYSPDLSSGPGSTIQPGPAASAPPPGNSSSNWPASTQNAQMPSDPPGVPNSTRWAWYNGKYIAVDATGTPVDSSGNQVSPDAAYSKPQVLDKPSAPPESTGGLPATQMAGGNPDGSEILTA
Physico‐chemical
properties
protein length:1020 AA
molecular weight:108900,6 Da
isoelectric point:5,95
hydropathy:-0,67
Representative Protein Details
Accession
4bEaJ
Protein name
4bEaJ
Sequence length
838 AA
Molecular weight
91030,14700 Da
Isoelectric point
5,89894
Sequence
MADSVRVTNDNPITNLDSPGPHLARVINNIDPMRQGGLEVELLLPVGAQDAANRQLYVVKYLSPFYGVTDVGVNGSDPKDYNQTQKSYGFWFVPPDTGSLVMVIFINGNPGQGYWMGCVQDVYMNYMIPGMAASKAASDQAREQDDKQWKESTKPTKELYGTDFVPTGEINRKSIGEGESTINPNIDAMTKPVHPITQVLTDQGTITDTVRGTHTSSSRRETPSNVFGISTPGPIDKRTNAQKGRIGRIDNQINKFVSRLGGHTFIMDDGNDRLLRKYKPSEGPPEYADVEKGETDGLVEFPHDESFRIRTRTGHQILMHNSEDLIYITNATGSAWIELTSQGKIDIYAADSVSIRTESDFNFVADRDINLSAGRSINLHAQSRTNINSIDIVNIQSDASVYLNASSNLNIKATGKLQMSGDSNIDMKTQSFKLGSSTTDILTAGLTHITSLGNLEIKSGNTLISSAGNTEMLSGSNTKITANDFQVKSVTLNVLSSGAIVLKGSKIDLNPESPPNISAAVQANTALVANGAEVAPKLVLFPVPGVGPLIVKRAPTQEPWDHHENQNPAGFTFDLTDRESSQMPYTKDGPKVEIRSTTDIEKVPSEQGGNAGYSGQSSEGVAGGGAGNNRRQTKIPPDSSNSTTEKINEAQLANMPQEWTKDQEFLSAVQKLAGKMGAKSEELLALMMFESANTMSPSITNSLGYTGLIQFGNSACETLSKYYKTSITTAMLRQMSRAQQMEWVDKYFSYWMKTKGVSPPMTLAQMYILVALPGYVNSPPDATLAGPNGPNEKIWRANPGWRVGAPSSNVITRESIGNAPRKLIPRVQGLLERNGIKL
Other Proteins in cluster: phalp2_34374
Total (incl. this protein): 55 Avg length: 801,5 Avg pI: 6,44

Protein ID Length (AA) pI
4bEaJ 838 5,89894
15pXW 688 6,74556
17FCo 837 6,13994
17L0s 1053 6,17780
18Dtg 1080 5,95789
18Ndo 596 6,38065
18azV 760 7,22153
18lYZ 863 6,04388
18uw2 716 7,33436
1ALph 687 6,74653
1hCF9 763 5,70143
1hCWv 743 8,13309
1hstv 671 5,33749
1htDC 746 6,48188
1hvuu 882 5,75491
1hyhm 718 5,63663
1jd9t 765 5,70859
1jgH6 722 6,39270
1o8Um 957 6,38202
25gqV 847 6,28812
25hJm 669 5,27684
2DTft 928 6,44022
2DTqr 746 6,12602
2F4bj 899 5,73127
2Flyu 791 5,98233
2lrUc 734 5,57883
464K2 829 5,75781
47FGq 562 6,92438
47Go9 903 5,91008
47IA6 719 8,90349
47ZPy 740 7,12093
47bOR 478 5,24751
47gW2 809 6,23327
47k1e 760 5,63072
47mS7 761 7,68670
47nXd 866 5,86063
47of5 789 5,46367
480qX 845 6,13767
490x8 751 8,82973
49N3G 963 8,51584
4aghf 638 6,31364
4bRk8 729 5,83443
4lcly 951 7,26865
4rmfy 925 6,73567
59RSW 770 6,45551
5mwbv 1005 6,10442
7FPf6 750 5,98682
7qAJ 827 5,77816
8LJG 1074 5,96357
SE4c 632 8,73819
SdKx 840 7,28343
TbkH 863 6,08305
eiv6 794 7,69216
mci6 788 6,34138
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14185
2DyW6
215 40,6% 602 6.060E-200
2 phalp2_3302
2TmLB
139 40,6% 595 4.784E-161
3 phalp2_9553
1erZr
3 34,8% 557 4.645E-132
4 phalp2_36268
5OGZ
505 29,9% 585 5.187E-94
5 phalp2_1713
2jRjO
51 29,7% 554 1.084E-89
6 phalp2_12818
8iGkO
3 27,9% 573 1.854E-72
7 phalp2_20632
4MnjX
5 29,0% 557 4.498E-72
8 phalp2_39930
1ozIo
34 26,7% 572 3.952E-69
9 phalp2_3738
5sWSi
40 27,9% 522 3.509E-65
10 phalp2_19209
27xL6
15 26,3% 573 2.948E-61

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4bEaJ) rather than this protein.
PDB ID
4bEaJ
Method AlphaFoldv2
Resolution 83.70
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50