Protein

Protein accession
1expc [EnVhog]
Representative
1expc (this protein)
Source
EnVhog (cluster: phalp2_38446)
Protein name
1expc
Lysin probability
93%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
VADEQIVTSIVAKADLSSLVSEVHRASASLQQLQRELLASNRAISSSTKLANNLFRDTLTGSGQYSSHFVNLNSDVDKFGKNLDSGRLKLKNYFQTFREHATTQKGMIRELAKEQVMLQNSVLQPLGRNAQGLMQYNVMIPRGLDAIANSGKLARMEMQIMNRALSEGAGSLINWGKNTQWAGRQLTVGLTVPLTMFGAAAGKAFKEADQELVRLTKVYGGLAATSATDLKAIREEVIETAKVLSKTMGASFKDTIALGADIAATGQTGNELLGSISETTRLAILGEVDRQDAMKATLSIQTAFKQNTKELTESINFLNAVENQTSTTLNDLVEAIPKAGPVIQQLGGDVKDLALYLTAMREGGISASEGANALKSGLASLINPTKQTVGMMSDFGIDVMGMVAKNTGDTTGLLMDLQKALDSLDPLSKARAMEQMFGKFQFARMSALLNNLGKQGSQTLQVLDLMKASTAELADVASRELKMVTESASGKYKRAIEGLKAELADVGEEFLGVATKLINAASKILDFFTKLPDPIKKGLTFLAGFTALVGPLIMLTGLLANFFGYITKGVVQLRAFFMKANGWKMLTPEIIAAQKAAELVENAFYSDAAAAQVLHNALQKLVLDYQNLQAASAKNAIPINGGVSTVAGNPVMLAGRRVVDPSDPYVGDPNTRAMSHINPRDPNRPATIFGGVPGAIPVNRGISRTPQMYMHDRLPNIEGLTSVKGISTGIVPGEAAKFHALMATLGMQTEQEVAALKKTIMMGGTVSRELLDTFDDILPITQRFADSAATQSGLIVQQMRNAEITVEQAKARILALNAQIEADMGSAVGLYAAGRGRTLDLTRAPMMDQPVVDANGQFTLRDLYKKKTNAAVMTEFGRLRGIRTFGAPYSMQTTRMPRFNTGGEVESFSSNKTVVSGPSSIGYDDRLGSVPLGGYVLNQQAAMDPANAALVAMAPSTYLNNGGSITAALTPREVVFGPQIQKMPELYAAVDAANSGYNFGGQIMKGTYGYGRESTLSIWARLVNSKDYPKVIKAATVASDAAILSQLTGMDIKDATRRTYADYEAATKHARKASKANGTTKTEEFVKARTAQLINLSKQYPNANLILDAYSNKNKYDQNAKASGHKNIDTASYSNAMKSVVKDLVSRGILTPAQANKVFGILGGVTDGYDPIHKSHFITAKESASVFKPGELEKLLVSNGMSKAEAAKYASITGVRSNNYIGHSTGLMGSFNTMGNWLQGRTGGAFRDQVSPFQDEAKGNFIRGMKKLKSDLGIKGKTTMPEIIKQLTLMKMKKSYGMGFKMIPTLGGRLGQVHWMKQLTGPMVGFFNKGGVIPGGSISADRSSYGVVPPLSARLEAIRLAQQAQYAKKREADMIKYPWIKEAVEGRNRGSSISPLLKLLAPGKQLDILEQAREMSRATATGSFKDLPPVKYGHMVSPSSGMSYPIPGVSGLYRDSEGRLKFFKGVPNEISAKAEVYGTRMAREVFGLDAPEQTIKTISNPLDPSGKSKLLGVESPFDKRFTTGGTVFNEDEMIRQTIASLVMNNKDLSPSNVFGNVLADVGAAGVFAKASRNTEHAKSLPSMIDQAMINLLGVKGGARKDFAVNTAPIAAGMTAKQYNRKMKAAMKKMHPKLVKFVAGLPREDRAPYIKLLKRFETGMDDATDFGPLHAVHVAAKRNSGGPVGGSIQRGRYSYGKTGARRPGNPAARAAWEAEQRAQRERDAEAKRSRASSHQVYGQQALTSGLGREAARTGTTSFYNPGSLIFPQIGAQIKTGLSSVSNAVLKGTIRMNLELLAATNRFGNAYRKSSDLLMNSTKAISAKIISNATSSALKIKSTGSRFYNAIVREQNAFAARNYPAGHAVPSQGFFGPGFIGKYKDLGDGLQSRKVGSLGFRKTEYLVNGVNMTEKQARAAGQQIPTRANGMSMGAQMGIGMAGSMAGMQLMGREKVTILGQEMSGMTAGMSLMAATSILPMLPLGRAFKAVKTAAIESKLAVKGFSLSAKGLSSGIAWVGRFAKFLGPIGLIITGLTTIFDIYKKIQNDQQDSKMANSITAKGAEEAGIKYFNLQESMQGYLDKQEAVAIAAKASRFNSIGMPGLPKSVEDMKKVKEEGKALKEVIESLNRSKSIEETKILMANQKAQYIAGGMSVEEANRKLYGALLNSEKAFQAMSILGDGAFGKIVDRATAAEFAVGNLINTINSGPGATDDWVGDINAGFEGLINTFSAATDGLIGTKDKFGNVIDEYKAYEMVMSSSEKMYPQFNEAIGYEVLADITSANKLLGSLLNREDSIKGVIAKWTLFTKGFTQDLNKIDSALAIKLAGFTEALGTSITKLTDSKDSSTTFGSVGSVLEKLKKSMASVSAATQRSAAAAQRSVQEELKLIAKKIKLIDDEKNKKLEALRATQDASNYALELQKLEIEYQDALSRGDMAAANRAQLDIEQLTRNRQSQLTQQAIEDAANKQKAPLEKDAAAIQEKADRDAAALAIARDNAASSSEIAAKITDFQGEYNNLVERGITATFLPDPERKAEEAAVQAALLNLVKEIQTSGTGNTALAKSIREAFQKLSLFDKNGNPIPTVTTPPKTTGYPTIGADGKVVNPAGTINLDILKQFNKDMKSVSGMAQEITGGTSLDRLRIEMKTALGLLNTNKKQLSFDSDTNKTQIFDGSGLTVQKSPTGNSFYINKQDFVKQGLAFLPETILSINGERWVTKQTFQGKVYVYKMADGGKISGPGTGTSDSIPAMLSHGEYVVKASSVAKYGVRTLDAINSQKFHKGGPVHAHLPDGTHPPSQISLGNNRYVPILPTGTVWRSKDSGMYDSDKSIAPPWFTINPNEVRNNAMFLTPPTKNYGIDKIKSPMGSLQRFLAIAKSQIGAGHDYRLHTKDNPTGAAGDIPGIDAGLSQVNKFTLWGKKKFGLTSDSSWWCGEFVAWVAEQAGVEISNKMQSAWQATQQYKKKGLFNDLTKKDNFKNVKVGDLAWFDYDGMEQPRDGVPDHASIVSRVIKDAISVIGWFGDDAVLQRTYSKKGLQDFFGSVSPEFKKPAPKIITDPSPLSPATNTNQSSDTSTALSPTPTPTPTGFARGGLAALPHLKKKIFESFPKKPYPTRNPGGMGIDKSTPPLVGSGASAYFGGLLGGGGGMMVGYHDGGPVHPHPHNSLPNLKKKIFESFPKKPYPTRNPGGMGIDKSTPPLVGSGASAYFGGLLGGGGGMMVGYHDGGYVHPHPHDALTNTLIQAPPKGTTAYAKYAYEMARNYGGAAKTAAKLTKPIKVSDGPAYSWIGSDGTVGYSRHDKYKYIDAGMGSASEVESQILSDLSKTAMRAKIIEMNAKRFHDGGLAHPHPHKPFSWKQNVPTSNSSGGPGSYNSSVGKGMWDGFSVPWLDSLGVKGIQQTFNKIFQGGKNKGVNDAYSSSPNKGDYAMAALFPLNFAGMGIGKKLTGQIPNQIYESAIEKGLRLQKGEAGINETYHAILDGVSGFYKPQLGNIETKREMFGSIFSRALGLLAPANIPVVTGNSLRPSGIFSPDVAAQGGSLLKHIQGASTGLWGGYDKVSSVEDAISSGYRAAMMTAMRYSDGHDGNLVLNTTTMQPGLIDFGMILDFLPKGHGINIPSMYSPLRQTVPSLVDAVSDPRRQAFFEGLLKAKDVLSGLSRSDIKDMLKSAGYKGIDLQKKLKIVTSSIKDTINAMPSAVDDMSFKVPDLPELSGDIMVGPFSSPSIKGFPSISSLPNSIKNVRKLWSTPQESTGLEWGDVAGRIIFDSSPINPVVKNIRKLWNAKGFHDGGYVGHKHPEDGVPGISPDGKLSTMLSAAESMIGYKEGKEDNDTFFGKWQNKNDNDISSNYIAWCGAFMNWVANASGVPLDNMVYTSGGANKFKEKGDYHTTNPQVGDMAFFNYQYDNDQKGQNRIQHIGLVRKILSDSMLETIEGNTSEVEPPLVFPKDYVDEGYYDKDKLNRIRDGVFRRNRLYNHPGASIVGFGRPKYKGENTMLGQLVVDGKIPSLVNPNIDKHSLSASTLNRYKGYKDGGVVDSNNSPFLFNAETHKINNPLIRENYKLEAEGKLRPYTPVDLPDRSAITEEEIKLLLKGKTLPGTWQDYEKLSLIQMRQDGLRDIEFIRKNLKKATASKLPYYNMSPAVGRAAKQLAQGGRSEEFIRHVLGINDLSIPVPGSQKVAKPGQAPQSPNVSAEGEKVKKFGFGGIAGYHAGGAVGHKHGNENGNFFSRFNPVNAFSSMLTGMFNFGASNVIKKSTNIQSQITQQEKDYMALQTAQMFSGYTSAFNLKNKTSPEIFGSQNLGIAADLLGVLPGLGAASLAPLAAVKARKALSTSAATKAVNRQGVQVFDRRLPSVVNDLMQVTGVKTKKLTKFDETLTSFQKANELALADPLSSNVTPKIISGKGVVNLYDPKQWDLRTAWNGGAFKTKDLINVLRGKKLIDNAAYADPRTSSVYGPNTMGVKTLFHELGHRDLFLESPTSRTILGDDVVANARAGVHEIYADRFSGASRKVLGRYWDKEWTQANFKDPFADGNSYVTNPRTAFLDDFFNYVEGINTGRANQVTPQFLHDYVRNSGNSLSPEALNKINKYQFLFQEYRKIASANKTSLNYELLELLKSFKNSPETFSFVPDLPKFEAGINMVPADMLALIHKNEAVVPAHMNPFNPNATSSAIASGSVYNISVELNGTNLTAQDVATQIHKEMKLKEMASGVNRRVGSK
Physico‐chemical
properties
protein length:4710 AA
molecular weight:508123,6 Da
isoelectric point:9,49
hydropathy:-0,31
Other Proteins in cluster: phalp2_38446
Total (incl. this protein): 7 Avg length: 4111,3 Avg pI: 9,48

Protein ID Length (AA) pI
2XYIz 4096 9,52206
327qD 3751 9,44708
48Qnq 4572 9,39970
5eq1p 3861 9,50085
5w4U0 3871 9,48409
5xkrO 3918 9,51600
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_20087
1NMbG
38 59,5% 3010 0.000E+00
2 phalp2_20547
4plAY
48 32,1% 3356 0.000E+00

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.