Protein
- Protein accession
- 52HJM [EnVhog]
- Representative
- 4Co56
- Source
- EnVhog (cluster: phalp2_30333)
- Protein name
- 52HJM
- Lysin probability
- 58%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
VAEVIKTVIDVELNSGQFASQLRSLQQQINSFNLTLNKNQVTQGQAAKFFANELSNAVNSSKFFRAETVKMQTAAAALDSTLRKGQGTLGQFFSAAFNKRSAMAAETFALAAERARTMQTQFIATGKASKGMQDALAIRPLTAFISESSIAGQRMQILNSMFKQGTTQLINFGKNVQWAGRQLMVGFTVPLTIFGTTAGKIFMDLEKQVVSFKKVYGDLFTTPAELNENLEAVKGLAAEYTKYGIAVKDTISLAAQAAAAGRQGAELTDAVTAATRLATLGQMDQNAALDATIALQSAFQIGGEELADTINYLNMVENQTVVSLQDIAAAIPRVAPVIKGLGGDVRDLTVFLAAMQEGGVTAEQGANALKSGLASLINPTKAAKETLSGFGINLQEIVDVNRGDLMGTVTTFATALEKLDEFSKQQSLEKVFGKFQYARLGALFENIVRDGSQASQVLETMEYSTEELRLTAEKELNTISQSFGVQLIAAIEKFKLAIAPIGSLFVKMAIPIVQFITKLANWFNSLPEGVKNIAAIATVITGLVVPAATMMFGLFMNLIGTLAKMSQGAAIFGATLLKKGPIAALKTLSQSTKYLSLAEIDAANSARQLGSATGIANAALLQQVGAATSANIAIKQLTTSYQMLIAQQLQAGSTQPLAFSAGKNASEIAKARPKTRIKGVGFFNKGNIVPGVGNTDTVPAMLTPGEFVVNKESTKQNYDLLTAINNKQVSGFNKGGKVPGMQYFAKENTQRVVQNQDFIPPSQSNNTRGPDFIRPSRISSNIQNPEQINTLVSQHFRGYIFSSEKAREKAASAIEFLQDKPNDLEAFANEVKRLTRNRVEGKQFNINTTSLNSAMTRFAVDPSSVVKATEKSHINSVIHTLEKDLLVADGRTIPKGTKVQFLNNRVLDFSDKLNKKLSKGGATVAEMQAEIAEDIFQTMNFQAERYADELRITGKARQTFMADFLQSKNKVRASYQDILSKLPQNARITDHGLPVDGVGTIKLGDISDVAFSEFGSKTRSKRQLQLFNEALSQNTNVRIAPMQEIANDLYELKGKTGNVNQFAKMSKLELIKETYKAQNIDIKEEPYASFIKSITKISKSTGEEVFDQKQYTSANSFDPNDKAFVKLSERAKNLISKLVFRGSGNIATVVRTAFNKGGQVPGVQYLNGGGRVATGLWEAAMAGITTRRKYDFDKFPIQTLKSGMTAGRRTSGTRPLNVEIYDPITKKTIGQYDVDPNNALEKRSLLNTLQVEKSKGFKEAYIRGIGEGFNKGGMIPGVKYLNGGGPSGKIKSTKNAVEEIFGTDSANALSLFGISLGTRTGSKALGKGKPKDLLDLISSEEKRYMVKIGDQRIASFMKLKKENGKIKKEVIKNDGGYPVRDYKVWEELNPDDILEIFSEKSAIQKFNRGNIVPGLGNTDTVPAMLTPGEFVVNKDATQKNLGLLHAINAQKLNVGGKVKGGIQYLNNGDLVAQVKEIEKYKRENPGATTPQAVRALGIQTGNTTKAVRPRGAGAAGMAAGVAASLPFMGEGSQEQFGMTGSMVASFIAFEAARSLVTKSFITLSKNAQAGAKGLGIVGKGGQAIQKQFVQMRKYMQFFKMSNPYVLAVVAATAAVVGFTKLHNNMKQAVSKYNNALYGTSESLENIAKAMGRTTPGQREAQRTSELISGQRATEEEKAMSSQLMQSEQGQSMLQDLQTVKKMGGDQASALRNQLTQAVMSGVLTPSEARQVATDIGIALKNETVGINAVAQITQLIGPEGELIKGNRIEILSQIAAPTNPEEITEQVNKELKRIDDAPFWNVFSKQYDFGLAMQQIVKVLVPNDVLAAQMKIETLQNNIIENALKEKEVRASLREEYVKGAISYEEFANQTEQLNTAGGSLGIEKQLDALNAKNLSFNAAAGAFGTGNLGLFRDPVARLTGAGPMALNLAGKDVNVKAAEDLANKVTRQFNEEMTNVGIDEAEQQEILKFVTDEFGGNLADQAIFFGDILSGKMSLAALTFVQEMSKTNTDFANLVEKNGLEATTSAFSPLSSNPELMQEMAQRASVSGNLGTVEKVAENYKMLEGLSPEITKSLKIDITNPDVVNKFGPLAEDLNRNWNMLDSLNPQLDKEFVFSLIAVDANGNPKTAEEIEKDAIRLNKAMADLSSGTKEEKRKAQLELTALYNGKDVSDPAVQENLNQLKKDIKGFDDFDAATQSKLIEIDSQIDAKGIEISALQALKDRGVATDEELRKLAQLASEKEKLEKDLASTASSARVAGAGEGDKGQESAFQKTKKSFEETIKYQKAIVQLTKQGVSGENIALVDQASLLEMSTFERKKAINMMKEQQDVQKMLSVMLLSDEEQRIRLQQNAVKVKDKEISTLNRQIEDVERLNRVENQRIDKLQRQNELDSRQVAIRNKSLETLAEKEKDVNSAYDLRTKALDRVAQINDRLSRQRQDRIALASALTSGDFGAAASAAETMSSNFASSQVEDTKAALENQREKELSSLTAEVNGQLFTRAEIEQQIDDINERVYQRNLSIQNLQDIIFNREQELEPIRDAVFAREGERLQLARDLEDAEYNKWKTEMDGINASILRWNQYWKAKRGEGGTVLKDEYVKASSTSTNKKTEKKSFGGFIRAAYGGMINYRGSKESPPAIRANLGMQVPGTGITDKVPALLTPGEFVVRKSVAQANLPFLKSLNSNVFPEMSSIEGGGLAPVISDNSTSTMVSPVYNNYSVNVNVAGTDASPNEIADAVMNRIRMGQQRQIRGIRR
- Physico‐chemical
properties -
protein length: 2754 AA molecular weight: 299830,5 Da isoelectric point: 9,06 hydropathy: -0,32
Representative Protein Details
- Accession
- 4Co56
- Protein name
- 4Co56
- Sequence length
- 3100 AA
- Molecular weight
- 336670,93210 Da
- Isoelectric point
- 9,31937
- Sequence
-
VAEIIKTVIDVDINTSGAAAELRSLQQQINAFNLTLNKGQLEQGQASRVFAEGLRNSINTGGFFRAELVKMQTAAGALDSTLRKGQGTLGQFFSASFNKKGGMAAEVFALAAERARTMQTQFIATAKASKGMQEALSVRPLTAFSADAAVSAQRMQILNSMFKQGTTGLINFGKNVQWTGRQLMVGFTIPLTIFGTTAGRVFSDLEKQAVNFKKVYGDIFTTPAELEENFKAVQGLSREFTKYGIAAKDTLGLAAQAAAAGRKNTELTDAVRESTRLATLGQMDQNAALETTISLQSAFRLSGQELADTINFLNMVENQTVVSLQDIAAAIPRVAPVIKGLGGDVKDLTVFLAAMQEGGVSAEQGANALKSGLGSLINPTKAATDVLAGFKINLDSIIQTNRGDLMGTVTAFSDALSTLDEFSRQQALESLFGKFQYARLGALFENISREGSQAQQVISTLGYSTEQLAKSAEKELTTVEQAFSTQLTGAIERLKIAIAPLGEIFVKMAIPLVNMATKIFEAFNKLPEGVKNVVALATALGGVLLPAATMIFGLFANLIGTFAKFTHSMGMFGVTLLRKGPLAAIKSLTQSANYLSLSEIDAANAARQLGSATELANAALLSQVGSANSADIAIKTLTNSYRVLISEQTRASQVQPFLFGTGQAASSRVGGSAAASVATTMATRGKSRIKAVGLNQGGSVFTPDNSSTVPGVGNTDKVPAMLTPGEFVVNKESTKNNLGLLHSINAQRLNVGGKVKNGIQYAMSGFLIGNTPGYSAATNAVAAKLVGRSQVAKTVGAKASNKVEQLKNNLVYLLTKSNNQQLTGVRTMKEFDKRGVPLDVLKQDMVSEEGLWMIDAFAKGDWKKDAYKATLENNFANIASKYPGKEIRFLDSAGLERVQPLLGKPEYENVQFVSVKELQNKELFDNLTPEHLETLMTDGTQVSKELKYSKSRGTTTLKDVYRGIPGMESYTRIQKNWRGTPKPKNRQGAHMYNMGGKVDGVQYAMAGQLIKKGTKKFKTGKVKHVRKNLKARDKEILQDMVNNVTWEKRVQGARLNDGISLYNQGYSLEEIDNILIEIGYFKKGLTDLAPMSDAVKKFAKPEKIKSSQIKNKLRQAAPTEESRKAVSEYFLKKGYIDEELANLINGPTMGAGVRSHYAEDAIANAKKSSISFPGYLGQAIIEPSKYNSVYNQIGVSGSPKNKADHDKLMGYVQELLGIDKRLIKADFDVFSMMNKGGQVPGMQYANTGKIIAAMFKSKGRNELIDRLSLSNLPSDVKNKIINGEIKKRGDFTLRADRSLFDNSISRTISSNPSKDDLLDLLTTDYNNLYSNSHLYSIALNKNADNDVLNVLATQYRNRLRYAKPEEVGTHLNQTAKIIQDKGIKLNKGGMIPGVQYANKGKQIQEVMSKVFGINADEIAPMFGALPQLRTQFLNRPKPFKKISEYVKDKSFIDNDIYFIHPRGGEPHEARIFMGTDGQLRKQRIGYEPHADSMKDPNFRHGGDPRRYKEEILDPNFEAAIYNKNEYKNQGAETLRPKKFNKGGMIPGVQYLNLGGIGSAVKRSLDTPSRRSRQVLRSLLKDSDEIAKSKGSKGIFDDVKKSKQNFLDTPLSAATHAVGQAYANQVMPVYQSLLQKGIIKPGQFNSFDDFMNFLSKKSLGNRSTSGQKQDPLISKFNEEFMKINHGLGPDDTLRFYRNSRPGGRQAEGSNPKVGYYSLDREMGWGYGLASDISIGGGKRYQIDLRPGDIPGPIMSGGYADEFAINLDAILANKAKEVGEMLPNVSKRLTGQQGMTRFFNDKKFYKDNDLSTMKFNKGNIVPGVGNTDTVPAMLTPGEFVINKESTKKNYDLLTAINNGKVDGYNKGGGVANPMRMFYGNYTKEEIKARAKSIKKAGGVGTLPKGIGAVGMGAGLAASFLPSMFMSEETSMKASMAASIAGYAVASKAATIGMLALSKQTITAAKVFPQLSKGIPALTSALNISAATLFGVIGPIALLSIGIIALNKMANNAVQSGGELIKSMYGSSDRMKEFAKAFGRENTQQTLARQRAELAAGAPIGQEAQQYSTQFLESKAGATLLKDLQNVSKVSGTAGRDQALLSQLTRGIVTGSITAEEARAIALDVGKKLGDQSIGIKLGAQISDLVGPDGKKLTDNILKINAVITPTIDMNEIGKNVSKQWENLGIGSKLGMILTGKGTDDMIVDALAQAAIEANAITSEQLDLLRLELESGNISIQEFTSKKDAITAQSNKTTLDAIETINKKYGEGTDKALNALREFRKEFEKEFKTKISVLSDEDEKNAQKAQDILRKGEAASTNPLPLGQQVVANVATKTTDEEKRKYLTELGLDPSMAVLDPQQMLTAIEQTFTADQKLAKIVSEKLFGPQAKQELGTKAADALALLDPQFNAEIGKLLDEDKTGQLKEKLVNLFEVDPETFAKFQNLFDVGGPDALTKALTMGEDELSKYLDNVSKLSALPKELGIDTTKLISDPEKLDAFVKNIDMATKSLDLLKEGAKKGNITQKYVLETAFAGNQEAQNLLNYYLKTKKFKDIDLNMVLGASMNPEIAKALVIMKKLEEGEGANVSLAEAVWASNTLSNAQVAPGGGKSDNPYVDDGSGTKSALQTAKDAARQTKEVIASQSKLLAAGISVEALEGLSPEAIIELGKQSGKQLRDNIKLFNDQAESIRINKLVLQQLALEENPIKKALEDSKKTVEGYDDQIKELNKTIEKTNRANELDRRRIEDKNKALENLTKKEKNVNDQYNERIKALDKVANANDRVAERQRQQIDLATALTSGDIAGAAQAAAAMTQTGAQNQIEDTKTALEAQRQAELDGLTESVNGRLMSRKDIQAEIDAIEETIYQRNINLRFENDEIYKIQEKINQEKVRQQELNDVLAEQDRAEAKLATNKLTNAKNITAETKKQRDNAIAAAYADYKKLVDPRGTNKNLTLSNYKTDILGLAFGGMIKKYAFGGNVGYKGSREAPPRLKMARGSLVPGLGNTDRVPALLTPGEFVVRKSVAQANMPLLKALNSDVFPSMSLNTPDVAPTVTSSTSTILNNTPVYNYSISVNVPNTTASPDEIANVVVSRIKRSMDTNIRSNRY
Other Proteins in cluster: phalp2_30333
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_23044
49sYk
|
5 | 26,9% | 3441 | 1.473E-308 |
| 2 |
phalp2_35139
5Di68
|
10 | 31,4% | 2245 | 2.209E-282 |
Domains
Domains
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available.