Protein

Protein accession
4Co56 [EnVhog]
Representative
4Co56 (this protein)
Source
EnVhog (cluster: phalp2_30333)
Protein name
4Co56
Lysin probability
58%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
VAEIIKTVIDVDINTSGAAAELRSLQQQINAFNLTLNKGQLEQGQASRVFAEGLRNSINTGGFFRAELVKMQTAAGALDSTLRKGQGTLGQFFSASFNKKGGMAAEVFALAAERARTMQTQFIATAKASKGMQEALSVRPLTAFSADAAVSAQRMQILNSMFKQGTTGLINFGKNVQWTGRQLMVGFTIPLTIFGTTAGRVFSDLEKQAVNFKKVYGDIFTTPAELEENFKAVQGLSREFTKYGIAAKDTLGLAAQAAAAGRKNTELTDAVRESTRLATLGQMDQNAALETTISLQSAFRLSGQELADTINFLNMVENQTVVSLQDIAAAIPRVAPVIKGLGGDVKDLTVFLAAMQEGGVSAEQGANALKSGLGSLINPTKAATDVLAGFKINLDSIIQTNRGDLMGTVTAFSDALSTLDEFSRQQALESLFGKFQYARLGALFENISREGSQAQQVISTLGYSTEQLAKSAEKELTTVEQAFSTQLTGAIERLKIAIAPLGEIFVKMAIPLVNMATKIFEAFNKLPEGVKNVVALATALGGVLLPAATMIFGLFANLIGTFAKFTHSMGMFGVTLLRKGPLAAIKSLTQSANYLSLSEIDAANAARQLGSATELANAALLSQVGSANSADIAIKTLTNSYRVLISEQTRASQVQPFLFGTGQAASSRVGGSAAASVATTMATRGKSRIKAVGLNQGGSVFTPDNSSTVPGVGNTDKVPAMLTPGEFVVNKESTKNNLGLLHSINAQRLNVGGKVKNGIQYAMSGFLIGNTPGYSAATNAVAAKLVGRSQVAKTVGAKASNKVEQLKNNLVYLLTKSNNQQLTGVRTMKEFDKRGVPLDVLKQDMVSEEGLWMIDAFAKGDWKKDAYKATLENNFANIASKYPGKEIRFLDSAGLERVQPLLGKPEYENVQFVSVKELQNKELFDNLTPEHLETLMTDGTQVSKELKYSKSRGTTTLKDVYRGIPGMESYTRIQKNWRGTPKPKNRQGAHMYNMGGKVDGVQYAMAGQLIKKGTKKFKTGKVKHVRKNLKARDKEILQDMVNNVTWEKRVQGARLNDGISLYNQGYSLEEIDNILIEIGYFKKGLTDLAPMSDAVKKFAKPEKIKSSQIKNKLRQAAPTEESRKAVSEYFLKKGYIDEELANLINGPTMGAGVRSHYAEDAIANAKKSSISFPGYLGQAIIEPSKYNSVYNQIGVSGSPKNKADHDKLMGYVQELLGIDKRLIKADFDVFSMMNKGGQVPGMQYANTGKIIAAMFKSKGRNELIDRLSLSNLPSDVKNKIINGEIKKRGDFTLRADRSLFDNSISRTISSNPSKDDLLDLLTTDYNNLYSNSHLYSIALNKNADNDVLNVLATQYRNRLRYAKPEEVGTHLNQTAKIIQDKGIKLNKGGMIPGVQYANKGKQIQEVMSKVFGINADEIAPMFGALPQLRTQFLNRPKPFKKISEYVKDKSFIDNDIYFIHPRGGEPHEARIFMGTDGQLRKQRIGYEPHADSMKDPNFRHGGDPRRYKEEILDPNFEAAIYNKNEYKNQGAETLRPKKFNKGGMIPGVQYLNLGGIGSAVKRSLDTPSRRSRQVLRSLLKDSDEIAKSKGSKGIFDDVKKSKQNFLDTPLSAATHAVGQAYANQVMPVYQSLLQKGIIKPGQFNSFDDFMNFLSKKSLGNRSTSGQKQDPLISKFNEEFMKINHGLGPDDTLRFYRNSRPGGRQAEGSNPKVGYYSLDREMGWGYGLASDISIGGGKRYQIDLRPGDIPGPIMSGGYADEFAINLDAILANKAKEVGEMLPNVSKRLTGQQGMTRFFNDKKFYKDNDLSTMKFNKGNIVPGVGNTDTVPAMLTPGEFVINKESTKKNYDLLTAINNGKVDGYNKGGGVANPMRMFYGNYTKEEIKARAKSIKKAGGVGTLPKGIGAVGMGAGLAASFLPSMFMSEETSMKASMAASIAGYAVASKAATIGMLALSKQTITAAKVFPQLSKGIPALTSALNISAATLFGVIGPIALLSIGIIALNKMANNAVQSGGELIKSMYGSSDRMKEFAKAFGRENTQQTLARQRAELAAGAPIGQEAQQYSTQFLESKAGATLLKDLQNVSKVSGTAGRDQALLSQLTRGIVTGSITAEEARAIALDVGKKLGDQSIGIKLGAQISDLVGPDGKKLTDNILKINAVITPTIDMNEIGKNVSKQWENLGIGSKLGMILTGKGTDDMIVDALAQAAIEANAITSEQLDLLRLELESGNISIQEFTSKKDAITAQSNKTTLDAIETINKKYGEGTDKALNALREFRKEFEKEFKTKISVLSDEDEKNAQKAQDILRKGEAASTNPLPLGQQVVANVATKTTDEEKRKYLTELGLDPSMAVLDPQQMLTAIEQTFTADQKLAKIVSEKLFGPQAKQELGTKAADALALLDPQFNAEIGKLLDEDKTGQLKEKLVNLFEVDPETFAKFQNLFDVGGPDALTKALTMGEDELSKYLDNVSKLSALPKELGIDTTKLISDPEKLDAFVKNIDMATKSLDLLKEGAKKGNITQKYVLETAFAGNQEAQNLLNYYLKTKKFKDIDLNMVLGASMNPEIAKALVIMKKLEEGEGANVSLAEAVWASNTLSNAQVAPGGGKSDNPYVDDGSGTKSALQTAKDAARQTKEVIASQSKLLAAGISVEALEGLSPEAIIELGKQSGKQLRDNIKLFNDQAESIRINKLVLQQLALEENPIKKALEDSKKTVEGYDDQIKELNKTIEKTNRANELDRRRIEDKNKALENLTKKEKNVNDQYNERIKALDKVANANDRVAERQRQQIDLATALTSGDIAGAAQAAAAMTQTGAQNQIEDTKTALEAQRQAELDGLTESVNGRLMSRKDIQAEIDAIEETIYQRNINLRFENDEIYKIQEKINQEKVRQQELNDVLAEQDRAEAKLATNKLTNAKNITAETKKQRDNAIAAAYADYKKLVDPRGTNKNLTLSNYKTDILGLAFGGMIKKYAFGGNVGYKGSREAPPRLKMARGSLVPGLGNTDRVPALLTPGEFVVRKSVAQANMPLLKALNSDVFPSMSLNTPDVAPTVTSSTSTILNNTPVYNYSISVNVPNTTASPDEIANVVVSRIKRSMDTNIRSNRY
Physico‐chemical
properties
protein length:3100 AA
molecular weight:336670,9 Da
isoelectric point:9,32
hydropathy:-0,36
Other Proteins in cluster: phalp2_30333
Total (incl. this protein): 9 Avg length: 2700,2 Avg pI: 9,27

Protein ID Length (AA) pI
1APQy 3069 9,18425
4bt8Y 2652 9,40763
4pgsJ 2370 9,46965
52HJM 2754 9,05666
53hLy 2474 9,21345
5dX4V 2637 9,51407
5yywl 2784 9,02404
jVHI 2462 9,24124
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23044
49sYk
5 26,9% 3441 1.473E-308
2 phalp2_35139
5Di68
10 31,4% 2245 2.209E-282

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available.