Protein

Protein accession
4nMyg [EnVhog]
Representative
22KDh
Source
EnVhog (cluster: phalp2_23947)
Protein name
4nMyg
Lysin probability
71%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAGDTNSNIFINIDTSQAMTQLRLLEKELTTLNRSLIVGTKTAAAAQSKYAQSLLHNVNATGQWTASMTRMSTASEQFAKNLDKQKLSLKEYFRYGAASTKTFGKMFGSEFDTIGKLVDKRVKTLQQQYVQLGRDAQGAMNAMKFNPKALNYGNVTTQLMAATQRQQIFNKLVDDGSTKLLNFGKNTQWAGRQLMVGFTIPLMLFGSQAIKTFKEIETQVIRFKKVYGDIFTDQGATAAALKNIRDLGDEYTKYGLKVSDTIKMAADAAAAGFSGKGLEALVEQTNKLAVLGGVTQEKALETTIALKNAFQIDTGEMAGTIDFLNAVENQTVVALDDLTEAIPKVAPVIQQLGGDVKDLAYFMAAMQEGGISAAQGANALKSGLASLINPSNAASKAAAAVGINIKGIVEANAGNLRNTVTGFAQALQPLTDLERSRVIEKVFGKYQFARISALLNNLGREGTQAARVLQLTNASVEELAILSQRELKVQADSPMNKFVGSVERLKAAIAPIGELFAKVLTPAIEFISRIADKFNSLPDGIKKAIGIITVVVGGLGPLFLMTFGLLANAVANSVKGIQVLRKGYQQLSAGSSDAALKTQYLSQEELENISISNALYSKHQQLSAAYQLEAAALTSLTSVYRGASAAMGGFAGQNPGLFMPGKGGMPRRFAEGTTSVPGPRGAGDVVPSMLSPGEAVIPAKQSQKYSGFIGQIIKDKVPGFAGGLFPSFGAAATVGKGIPLSAGPAAFREAQQARYAARDAARRGLSGNVAPVVPISSRLSGIRYSAEGSKVRVSVGDESFLIPAGKLDNFKKKLKENEDWMVANKRTDTTEQELLRTIKRKGYGGKDVTPSQIYSRLPKFSNARNSQQNQDIADKRFRALLKSRNPHLVKLQNYLVNEEKLYLEKVLGQDVVKSLKGWDVNKLTPSHIREVRSQNRTPEDWAPSKIARDWGWFNSGLRGTKFGNVKGGHPLNAAQAREVLTNLQKTPFDKLPNEKKALQAALEYRLSRKPSYYDDFIFTDNAMMKVKPTMNLAEGIVSVPGPKGAGDIQPAMLSPGEAVIPAKQSAKYMPLIQSMVADKVPGYENSNVNPFSGTKAPPGMVYTPSGLLVPAGGGQASAASRSPDRVEKAIDKFFDKPRVKKLGDRIDKFAAQMGKTTPKVAQLENTVTKTTQAFGTDKTRGFRGFIGGYGNVSQTVTGEDGTTRAASAAERTNMRQMNRMNFSQKMMPAQMAGMMIPMAAGMYAQKNPDSGIAKNMDMIMMLSMLTMLLPMLNSPLKLLAATAVGLVAVFKMQASTIKKNIIEGQKQAESMTMTTRRLEELGKITGKVSITQTAAAKRAGRNTDISPVSMEFGNSIISNSEFGKNLKSSFDAAMTTFGSGAAVDSLVNQLGTAVSQGVLDRGQAESIAVALTRNLRDAKLELDVRGRLIQLLGPNGESLVNNPLQVQVDLISTGQNLQKAALKNLNLVAGQQKGINTKAEIGQLGAGAIGGGLLGARAAVQVSDLVAGAGVERAAGAGRVGSVLRAAKAARVATTVGSAGVAATGAGAPIGAAGIAIGTVIFGGIELGIRQWQKGKEKAAIGQAAGMLQGIISQNVAASQASIDVLTSQYDSAIANLELKKKTLKTEQERKAIDDQIAELESKKQSGLKTLRSQQAEILGSASSNYDKVSKRSLGETLSPFGSGRGQVRDKYMEAFSVGMQDKFKDNAPLKAQAAVLQSQLDQIKNDKVTLEISTLVTSDVLTPNEASTLVSTLTKTGGDIQKRLKTLVDVQGTEGVQRLSTILTMLPDENNQKQLVFAMKHMNKTEADATMSAIEELGKVPDYVGIELNIETQKADLDRLKRVGSEIAALNKAFPNGQVTKTALIKMQEEAGGEGKNLTLDSAIRDWTAISKLPKDLQFQAIITMGSIEFSDSFDAILDRELKTAFQKQAKGAKTGRLAAAALQSFVKNPKNIESATKAAMEKIRTQLFGAAVPDTSKKGGPTDTTDEGPKRDESFLSDLAQRLKLVKEGGFNALTPLISLRKFLNDGGKNSINPGLDAQAGAIKQIETAAKTAGISIDKDFMEIIRGLDAEQFMLWSKTLFEIGKNGRITALKDDFVTINEGFRKATIAGYIQDVKDASKEIENQVAAHKLLTQEGYNSLEIQKILQDATLTAKIAAQGGLKATKEEQSELNKEIQKTINLNYELSSIKLSDNISETKMQVEAFKRLTAAGVKHEVILEILKDKNNAFAIASADATVNTKDKFGDLIAKTKQYSDLLELIANQTKTFEQTTQEAIDANVSALDLQARTLQNQFDIANIGLKAKIKTAEIDVKSVNDSIQKEQDKIDAINLTLKYDPGIGQNFLDDLQEKINDTQRSMDINFDRPLQVLSDRSAVLSNDLTLIDKATEAINEKYDAQEKALQTISQLNSDIAAQEKSRISFADALSQGDISAAAQLANDMRTTAAEAANRKSGEFIAASRKAETDNVVSAGGMTKAQIEAEQFRISQQSYALEQQRKTVQVQILSLEDQVYNITELREAKLLSIRGIETVIDGLKSTQLANAQATLDRLQAELDKNQEILDAKLLAIENEKLAWDSVQIKLDAYKLALTNSKLELESMLALIGKIAAAMATIPTTTATKSSAFVPTAADTGGETPEEKAARLKREADEAAQKAADEAAQKAADAAAAKAAVLSGYATAKAAGDMNAAALFAAKVNPSALAAQESGAIGAASIAAQLKAAERALVASNAVMKQASTLASFKAKEAAELAASNIKGRVGRSSGGIIPKYMSSGGMAPKYFAVGGKARGTDIIPAMLTPGEFVMSKYAVDSYGVDKMKAINSGSYEGEKVYNYNLNVNVKSDANPEDIARVVMTQIRQVDSQRIRTQRG
Physico‐chemical
properties
protein length:2867 AA
molecular weight:307140,3 Da
isoelectric point:9,36
hydropathy:-0,25
Representative Protein Details
Accession
22KDh
Protein name
22KDh
Sequence length
3401 AA
Molecular weight
371112,40340 Da
Isoelectric point
9,09870
Sequence
MAGDVNSNIFINVDTSSAMAQLRALEKELTALNRALVIGTKTAAQAQSKYAQGLLHNVNATGQWTASMTRMRTATEQFSTALDRSKLSLKEYFRYGMASTRTFGRAFGNEYSTVSKLVEKRVKVLQQQYVQLGRDAQGAMNALKFTPKALNYRDVTTQLMMATQRQQIFNKLLDDGSTKLLNFGKNTQWAGRQLMVGFTVPLLLFGSQAIRTFKEIETQMIRFKKVYGDIYTDPGQTEQALKNIRALADEYTKYGVKVADTLKMAADAAAAGNSGKDLEQIVEQTNRLAVLGGVTQEKALETTIALKNAFQVGANDLDQTINFLNAVENQTVVALEDLTEAIPRVAPVVKQLGGDVKDLAFFMAAMQEGGISAAQGANALKSGLGSLINPSKKASEAAAEVGVNIRGIVEANQGNLRNIVVGFAQALQPLTDLQRTRVIEDVFGKYQFARISALLNNVTKEGTQAARVLQLANASVEELAILSERELGVQADSAMNKFAGAVERLKAAVAPIGEIFAKTLTPAIEFITRMFERFNKLPEGIKKGIAVITAVVGGLGPIFLMTFGLLANALANTMKGFNLLRKGYQQLAHGSSDAALKTQYLTNEELENISVTNSLYSTHERLSAAYRIESTALGALIAQYGTATSAMRNFSMANPGLFVPGRVVPPIRRAGGGTVSGPGTSTSDSIPAYLSDGEYVVNAKAVKQYGVDTFDAMNARKYSSGGPVIGKDGIPRLFGGGFYRRILDAMLPKTRLGVVQAGASGYDISGFGTGGRPRVGVVVPPSSARVGGSRPLVFTEDPSGKIKVALRDDPNSFFYIEARKRLSFENSIDSHVVERMSRGDSAEKIYNNFREQMLRNRRSFRGQPILGRTETPSKFLFGLTGYKKSASNAKGAPRHVKDAFEKKAIIFDSTGAKIGYTRTKAGELYQELDNEQEAISRWVRTNVGNLDEKQIAAIERYNRVAVSHLDPADSHLWTPRTGSRDSALINQALNYEKGRFGSGNHITNQQEAQLFLNYLESRYAQLGDTMPGTQLGALQLLRMRLKPDTNGKTFYKRHGIKDTQLNLQYTSVRQNPATKKYYNVDKFGNATTEYAMGGAVQKYAGGTTFVGMPKSFTKVLQTRALAEKLNEAVNASRFKNLPITDTGVKLKDLGGFSVGEISRAVNGVYKHPDGRTVVYKAVESEEAALAEMRMAALMRGGSELKTPMNQSIKVIADPTDLTRKRKILAIESDYDPRFENPTGEFTKKQFIKQTLAAGIRGDKDVKRSNISGDDVIDQGNSGVFGTASSRFKYADSMKTIEEQLLINFGAVKGGASKDFVKAVRKMAKTMGYSAFKNAMLKEIEESIPRYKATINSFKLNPQERKIYDDLVVRLENSKNADWRKVYNAAAGIPGYEDGVFSVPGPKGAGDVVPAMLSPGEAVVPADQSQKYRPLIKSIIADNVPGYAGSNIDDWGDDDTPKRGGSYDRTRVDRFQTRMEAKIDRAADRFVGTRVGGWIDRKVRQKQEREDAARNAARASARAVPMPAFSDLKESIDKNTKAANDSTDASKKNTRGVLQRLGFDRGNLSEAERLERTRGGRGFLRGFAPVADFDRTVTKNGKEKFTLASSAQKTNARQLDRMNRSQRLAMPSMGVAMATSMAGMYAMSNPDKQFMGMNLGQLSGPLMGVSILAGLLPMLNSPIKMLIAGVVGLAAIFKMQSAQIKQSILDGREQGKTLLTTKESLEEFGNITNTVSKTQIAENVRASRTTEIVPVSMDFGKNFILNSDFGKKFQADLQKNIETFGKPIAAQVLGNQLATAVSQKVLTEEQAQSIAIALTRDLKDATFEMQVRGKLIELLGPDGKNVITDPIELQLKLISNKKQIQQETFKNLQEVISRERNGLAGLGGKEALAMGLSTIPAAGAGALATGAMYRGKVVNYNLRRAEELAFKGKRAQQFMAATEMARGSGKLNMAANAVRGLRVGTQIAGAGATATGVGAVPGLTSIILGTVIFGGIEAGLRAWQKGNEKKAIGKAAGIYSGVTTELLKSTQQGLDAMNAQIDDSVKILEAKKRIAKTSQETADLEGQIAVLEQQRAQGINKIYAQQVEILNNLETSFNQIGKSTFFETISPFGTGRGQLRTKFLESFQEGTDLKFKKDPISKMYVERMRKDLQKTESIQGSGGSSSAAAMAYNERQKVASDVITLKIEALINSDVLTAEQAAEVVSNLSGDRVLAQKELETIITVHGTEGLQRMAILQQYIPKEKNKKNLQLMIQSSNRQDANEMMSALEELVKLPSYLGFDINIETQKGDMERLEGVGEEISDLKKMIPNGQISLKILQDVQQKLGGPGKNLTLDAAIAQWETLSKLPKNLQFNAMITLGSIQQSDSFDKILDRELEAAFYKANPTLQMSFVDPEKEKSKKAALAAFKLEQKNIDAATKAYFAKVMPELYGTAVKDTVKTGKGTGDGKGKGPDTSWLSELLQRLKLLKEGSIDATGSMKQLLGQVTKFFGPGLMSSVNPSLDKTRGALLQIEQAAKAAGITLSTEFVDFIEGLDAEKFEEFRDRFLSMSNGKIVGFKGTSSQFFGSDAAYDRRQKAIKAGTFDGSQQSLFGQVNEAFRTKTIAEFIKKQQDSIKESNLQVEAFRKLTDATGEFKFDAMAAMEVLKDPALAKEIALGKKIFSPEEREAIITSINKTYAAASALSKIKMIQDTEGLQNQVKAFDKLSAAGYDYATILKVIENEAYAYEIAKDGVDGLSDSTKALVKDTKNYIDALTTLENTKFFEERTNALKLKQDFAAIAPLLVQAGASLSDIQEILSNPNLAKAFIQELQDGSLDAGRIKEYLDQIPDFKQVDVELRISTREGQEEEFDKLFSKAMEYYDLLEDKIEDDFEPLLKNAQDAIDATQEKIEGINDEIQKYQDEIDVKQRKIEIEITRPIEILQKDSAELANDLELMNVSADEITKKYDEQAEALTKVFEINSRIADQQKQQLNLAGALSQGDISAAAAAAQELKASQSAAMQEDQLGILNTAKQTQIAGLRNKGGLTRLQIEKLQFNTSQKIYELEKKRDLELIEVRKLEDAIYNIKTGRLKLAQNELDLNNKNLKSIQDQKNAAIEAIDKQREIWTDAKLAIGFARIEAGHYNDVIEFSNTLVTLMKDGWLGVGNAILAAVSALALYNAGLKKKPLTFEQAQAQAQNTLGGYLNTFGSKLGSADQQIVGLELKIAEAMEKGEDTSALEAQLAALRASTELLADNMYDVAKTLDKVDNATNIAAINSAVSTATSILKDPFGDIDVEWDGPVWIPEEGSGAAGADFVQVASKGGIIKPSYLKKGGMAKYFLGGGFAKGTDTVPAMLTPGEFVMSRYAVNAHGVEKMRAINGGASIGDSVYNYSISVNVKSDANPDEIARVVMTQIKQVDSKRLRGASL
Other Proteins in cluster: phalp2_23947
Total (incl. this protein): 54 Avg length: 3080,1 Avg pI: 9,11

Protein ID Length (AA) pI
22KDh 3401 9,09870
1O09D 2875 9,29694
1O7GF 3101 8,98839
1zDXD 3352 9,07446
2Y2wq 3162 8,98859
2qp9r 2823 9,01315
30jjf 2822 9,07865
30kQk 2863 9,36495
33wVb 3528 8,82580
38dcC 3119 9,27315
38pu6 2363 8,80504
45Qv6 3613 8,98143
47MCK 3254 9,33323
47MZi 3182 9,28630
4JduV 2749 9,34516
4aHEd 3772 9,04209
4aYeo 3205 9,16607
4aZXd 2882 9,35599
4apfc 3138 9,08032
4nM3t 3654 9,04686
4prv8 2867 9,35257
52I2W 3198 8,80743
586zP 3176 9,27379
58x8O 3139 8,99697
59JOG 2897 9,16445
59mQ6 2859 8,96177
5Ay6Z 3183 8,93450
5aIq2 2853 9,28598
5d89A 2929 8,92992
5dcKp 2754 9,36740
5eOIf 2856 9,37237
5g8D4 3165 9,30493
5gaXg 3177 9,36734
5goUb 2863 9,20539
5kyF5 3288 9,24981
5nyhw 2867 9,36676
5o6NF 3288 9,24285
5uF8w 3187 9,14582
5vBON 3146 9,03275
5vkIq 2855 9,33452
5vstg 3185 9,09928
5vukc 3175 8,99858
5xqG1 3209 9,24046
6AejQ 3100 8,52847
6HeCc 3085 9,25407
6L4Nc 3369 8,99942
6LNCG 3101 8,07977
6LO7G 3060 9,13125
6LPoM 3225 9,22970
GUJc 3063 8,64110
GXdt 2702 9,37378
Keto 2999 8,71505
h4uP 2781 9,36121
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1882
467FX
127 26,8% 3816 0.000E+00
2 phalp2_37182
1NZLQ
22 25,0% 3763 0.000E+00
3 phalp2_23044
49sYk
5 21,9% 3639 1.736E-156

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (22KDh) rather than this protein.
PDB ID
22KDh
Method AlphaFoldv2
Resolution 46.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50