Protein

Protein accession
52HJM [EnVhog]
Representative
4Co56
Source
EnVhog (cluster: phalp2_30333)
Protein name
52HJM
Lysin probability
58%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
VAEVIKTVIDVELNSGQFASQLRSLQQQINSFNLTLNKNQVTQGQAAKFFANELSNAVNSSKFFRAETVKMQTAAAALDSTLRKGQGTLGQFFSAAFNKRSAMAAETFALAAERARTMQTQFIATGKASKGMQDALAIRPLTAFISESSIAGQRMQILNSMFKQGTTQLINFGKNVQWAGRQLMVGFTVPLTIFGTTAGKIFMDLEKQVVSFKKVYGDLFTTPAELNENLEAVKGLAAEYTKYGIAVKDTISLAAQAAAAGRQGAELTDAVTAATRLATLGQMDQNAALDATIALQSAFQIGGEELADTINYLNMVENQTVVSLQDIAAAIPRVAPVIKGLGGDVRDLTVFLAAMQEGGVTAEQGANALKSGLASLINPTKAAKETLSGFGINLQEIVDVNRGDLMGTVTTFATALEKLDEFSKQQSLEKVFGKFQYARLGALFENIVRDGSQASQVLETMEYSTEELRLTAEKELNTISQSFGVQLIAAIEKFKLAIAPIGSLFVKMAIPIVQFITKLANWFNSLPEGVKNIAAIATVITGLVVPAATMMFGLFMNLIGTLAKMSQGAAIFGATLLKKGPIAALKTLSQSTKYLSLAEIDAANSARQLGSATGIANAALLQQVGAATSANIAIKQLTTSYQMLIAQQLQAGSTQPLAFSAGKNASEIAKARPKTRIKGVGFFNKGNIVPGVGNTDTVPAMLTPGEFVVNKESTKQNYDLLTAINNKQVSGFNKGGKVPGMQYFAKENTQRVVQNQDFIPPSQSNNTRGPDFIRPSRISSNIQNPEQINTLVSQHFRGYIFSSEKAREKAASAIEFLQDKPNDLEAFANEVKRLTRNRVEGKQFNINTTSLNSAMTRFAVDPSSVVKATEKSHINSVIHTLEKDLLVADGRTIPKGTKVQFLNNRVLDFSDKLNKKLSKGGATVAEMQAEIAEDIFQTMNFQAERYADELRITGKARQTFMADFLQSKNKVRASYQDILSKLPQNARITDHGLPVDGVGTIKLGDISDVAFSEFGSKTRSKRQLQLFNEALSQNTNVRIAPMQEIANDLYELKGKTGNVNQFAKMSKLELIKETYKAQNIDIKEEPYASFIKSITKISKSTGEEVFDQKQYTSANSFDPNDKAFVKLSERAKNLISKLVFRGSGNIATVVRTAFNKGGQVPGVQYLNGGGRVATGLWEAAMAGITTRRKYDFDKFPIQTLKSGMTAGRRTSGTRPLNVEIYDPITKKTIGQYDVDPNNALEKRSLLNTLQVEKSKGFKEAYIRGIGEGFNKGGMIPGVKYLNGGGPSGKIKSTKNAVEEIFGTDSANALSLFGISLGTRTGSKALGKGKPKDLLDLISSEEKRYMVKIGDQRIASFMKLKKENGKIKKEVIKNDGGYPVRDYKVWEELNPDDILEIFSEKSAIQKFNRGNIVPGLGNTDTVPAMLTPGEFVVNKDATQKNLGLLHAINAQKLNVGGKVKGGIQYLNNGDLVAQVKEIEKYKRENPGATTPQAVRALGIQTGNTTKAVRPRGAGAAGMAAGVAASLPFMGEGSQEQFGMTGSMVASFIAFEAARSLVTKSFITLSKNAQAGAKGLGIVGKGGQAIQKQFVQMRKYMQFFKMSNPYVLAVVAATAAVVGFTKLHNNMKQAVSKYNNALYGTSESLENIAKAMGRTTPGQREAQRTSELISGQRATEEEKAMSSQLMQSEQGQSMLQDLQTVKKMGGDQASALRNQLTQAVMSGVLTPSEARQVATDIGIALKNETVGINAVAQITQLIGPEGELIKGNRIEILSQIAAPTNPEEITEQVNKELKRIDDAPFWNVFSKQYDFGLAMQQIVKVLVPNDVLAAQMKIETLQNNIIENALKEKEVRASLREEYVKGAISYEEFANQTEQLNTAGGSLGIEKQLDALNAKNLSFNAAAGAFGTGNLGLFRDPVARLTGAGPMALNLAGKDVNVKAAEDLANKVTRQFNEEMTNVGIDEAEQQEILKFVTDEFGGNLADQAIFFGDILSGKMSLAALTFVQEMSKTNTDFANLVEKNGLEATTSAFSPLSSNPELMQEMAQRASVSGNLGTVEKVAENYKMLEGLSPEITKSLKIDITNPDVVNKFGPLAEDLNRNWNMLDSLNPQLDKEFVFSLIAVDANGNPKTAEEIEKDAIRLNKAMADLSSGTKEEKRKAQLELTALYNGKDVSDPAVQENLNQLKKDIKGFDDFDAATQSKLIEIDSQIDAKGIEISALQALKDRGVATDEELRKLAQLASEKEKLEKDLASTASSARVAGAGEGDKGQESAFQKTKKSFEETIKYQKAIVQLTKQGVSGENIALVDQASLLEMSTFERKKAINMMKEQQDVQKMLSVMLLSDEEQRIRLQQNAVKVKDKEISTLNRQIEDVERLNRVENQRIDKLQRQNELDSRQVAIRNKSLETLAEKEKDVNSAYDLRTKALDRVAQINDRLSRQRQDRIALASALTSGDFGAAASAAETMSSNFASSQVEDTKAALENQREKELSSLTAEVNGQLFTRAEIEQQIDDINERVYQRNLSIQNLQDIIFNREQELEPIRDAVFAREGERLQLARDLEDAEYNKWKTEMDGINASILRWNQYWKAKRGEGGTVLKDEYVKASSTSTNKKTEKKSFGGFIRAAYGGMINYRGSKESPPAIRANLGMQVPGTGITDKVPALLTPGEFVVRKSVAQANLPFLKSLNSNVFPEMSSIEGGGLAPVISDNSTSTMVSPVYNNYSVNVNVAGTDASPNEIADAVMNRIRMGQQRQIRGIRR
Physico‐chemical
properties
protein length:2754 AA
molecular weight:299830,5 Da
isoelectric point:9,06
hydropathy:-0,32
Representative Protein Details
Accession
4Co56
Protein name
4Co56
Sequence length
3100 AA
Molecular weight
336670,93210 Da
Isoelectric point
9,31937
Sequence
VAEIIKTVIDVDINTSGAAAELRSLQQQINAFNLTLNKGQLEQGQASRVFAEGLRNSINTGGFFRAELVKMQTAAGALDSTLRKGQGTLGQFFSASFNKKGGMAAEVFALAAERARTMQTQFIATAKASKGMQEALSVRPLTAFSADAAVSAQRMQILNSMFKQGTTGLINFGKNVQWTGRQLMVGFTIPLTIFGTTAGRVFSDLEKQAVNFKKVYGDIFTTPAELEENFKAVQGLSREFTKYGIAAKDTLGLAAQAAAAGRKNTELTDAVRESTRLATLGQMDQNAALETTISLQSAFRLSGQELADTINFLNMVENQTVVSLQDIAAAIPRVAPVIKGLGGDVKDLTVFLAAMQEGGVSAEQGANALKSGLGSLINPTKAATDVLAGFKINLDSIIQTNRGDLMGTVTAFSDALSTLDEFSRQQALESLFGKFQYARLGALFENISREGSQAQQVISTLGYSTEQLAKSAEKELTTVEQAFSTQLTGAIERLKIAIAPLGEIFVKMAIPLVNMATKIFEAFNKLPEGVKNVVALATALGGVLLPAATMIFGLFANLIGTFAKFTHSMGMFGVTLLRKGPLAAIKSLTQSANYLSLSEIDAANAARQLGSATELANAALLSQVGSANSADIAIKTLTNSYRVLISEQTRASQVQPFLFGTGQAASSRVGGSAAASVATTMATRGKSRIKAVGLNQGGSVFTPDNSSTVPGVGNTDKVPAMLTPGEFVVNKESTKNNLGLLHSINAQRLNVGGKVKNGIQYAMSGFLIGNTPGYSAATNAVAAKLVGRSQVAKTVGAKASNKVEQLKNNLVYLLTKSNNQQLTGVRTMKEFDKRGVPLDVLKQDMVSEEGLWMIDAFAKGDWKKDAYKATLENNFANIASKYPGKEIRFLDSAGLERVQPLLGKPEYENVQFVSVKELQNKELFDNLTPEHLETLMTDGTQVSKELKYSKSRGTTTLKDVYRGIPGMESYTRIQKNWRGTPKPKNRQGAHMYNMGGKVDGVQYAMAGQLIKKGTKKFKTGKVKHVRKNLKARDKEILQDMVNNVTWEKRVQGARLNDGISLYNQGYSLEEIDNILIEIGYFKKGLTDLAPMSDAVKKFAKPEKIKSSQIKNKLRQAAPTEESRKAVSEYFLKKGYIDEELANLINGPTMGAGVRSHYAEDAIANAKKSSISFPGYLGQAIIEPSKYNSVYNQIGVSGSPKNKADHDKLMGYVQELLGIDKRLIKADFDVFSMMNKGGQVPGMQYANTGKIIAAMFKSKGRNELIDRLSLSNLPSDVKNKIINGEIKKRGDFTLRADRSLFDNSISRTISSNPSKDDLLDLLTTDYNNLYSNSHLYSIALNKNADNDVLNVLATQYRNRLRYAKPEEVGTHLNQTAKIIQDKGIKLNKGGMIPGVQYANKGKQIQEVMSKVFGINADEIAPMFGALPQLRTQFLNRPKPFKKISEYVKDKSFIDNDIYFIHPRGGEPHEARIFMGTDGQLRKQRIGYEPHADSMKDPNFRHGGDPRRYKEEILDPNFEAAIYNKNEYKNQGAETLRPKKFNKGGMIPGVQYLNLGGIGSAVKRSLDTPSRRSRQVLRSLLKDSDEIAKSKGSKGIFDDVKKSKQNFLDTPLSAATHAVGQAYANQVMPVYQSLLQKGIIKPGQFNSFDDFMNFLSKKSLGNRSTSGQKQDPLISKFNEEFMKINHGLGPDDTLRFYRNSRPGGRQAEGSNPKVGYYSLDREMGWGYGLASDISIGGGKRYQIDLRPGDIPGPIMSGGYADEFAINLDAILANKAKEVGEMLPNVSKRLTGQQGMTRFFNDKKFYKDNDLSTMKFNKGNIVPGVGNTDTVPAMLTPGEFVINKESTKKNYDLLTAINNGKVDGYNKGGGVANPMRMFYGNYTKEEIKARAKSIKKAGGVGTLPKGIGAVGMGAGLAASFLPSMFMSEETSMKASMAASIAGYAVASKAATIGMLALSKQTITAAKVFPQLSKGIPALTSALNISAATLFGVIGPIALLSIGIIALNKMANNAVQSGGELIKSMYGSSDRMKEFAKAFGRENTQQTLARQRAELAAGAPIGQEAQQYSTQFLESKAGATLLKDLQNVSKVSGTAGRDQALLSQLTRGIVTGSITAEEARAIALDVGKKLGDQSIGIKLGAQISDLVGPDGKKLTDNILKINAVITPTIDMNEIGKNVSKQWENLGIGSKLGMILTGKGTDDMIVDALAQAAIEANAITSEQLDLLRLELESGNISIQEFTSKKDAITAQSNKTTLDAIETINKKYGEGTDKALNALREFRKEFEKEFKTKISVLSDEDEKNAQKAQDILRKGEAASTNPLPLGQQVVANVATKTTDEEKRKYLTELGLDPSMAVLDPQQMLTAIEQTFTADQKLAKIVSEKLFGPQAKQELGTKAADALALLDPQFNAEIGKLLDEDKTGQLKEKLVNLFEVDPETFAKFQNLFDVGGPDALTKALTMGEDELSKYLDNVSKLSALPKELGIDTTKLISDPEKLDAFVKNIDMATKSLDLLKEGAKKGNITQKYVLETAFAGNQEAQNLLNYYLKTKKFKDIDLNMVLGASMNPEIAKALVIMKKLEEGEGANVSLAEAVWASNTLSNAQVAPGGGKSDNPYVDDGSGTKSALQTAKDAARQTKEVIASQSKLLAAGISVEALEGLSPEAIIELGKQSGKQLRDNIKLFNDQAESIRINKLVLQQLALEENPIKKALEDSKKTVEGYDDQIKELNKTIEKTNRANELDRRRIEDKNKALENLTKKEKNVNDQYNERIKALDKVANANDRVAERQRQQIDLATALTSGDIAGAAQAAAAMTQTGAQNQIEDTKTALEAQRQAELDGLTESVNGRLMSRKDIQAEIDAIEETIYQRNINLRFENDEIYKIQEKINQEKVRQQELNDVLAEQDRAEAKLATNKLTNAKNITAETKKQRDNAIAAAYADYKKLVDPRGTNKNLTLSNYKTDILGLAFGGMIKKYAFGGNVGYKGSREAPPRLKMARGSLVPGLGNTDRVPALLTPGEFVVRKSVAQANMPLLKALNSDVFPSMSLNTPDVAPTVTSSTSTILNNTPVYNYSISVNVPNTTASPDEIANVVVSRIKRSMDTNIRSNRY
Other Proteins in cluster: phalp2_30333
Total (incl. this protein): 9 Avg length: 2700,2 Avg pI: 9,27

Protein ID Length (AA) pI
4Co56 3100 9,31937
1APQy 3069 9,18425
4bt8Y 2652 9,40763
4pgsJ 2370 9,46965
53hLy 2474 9,21345
5dX4V 2637 9,51407
5yywl 2784 9,02404
jVHI 2462 9,24124
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23044
49sYk
5 26,9% 3441 1.473E-308
2 phalp2_35139
5Di68
10 31,4% 2245 2.209E-282

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.