Protein

Protein accession
6m6kz [EnVhog]
Representative
1emlW
Source
EnVhog (cluster: phalp2_28353)
Protein name
6m6kz
Lysin probability
97%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MIVKVGRKMKEPFEKFGKFIEKTETLEEFWQGLKRGAASAGKDLDKLKAKFQKFSGAAKAAFHTLRRTTFPQLNTIYHAFNKLSLATKQFGANLVHVSKREAKFMLDAFVSLGNAIVTSFHTAVFKARVALNKLSGVGKTVLGFVKSGLSNIGPLLGQVLEAAFFDAYKGLSRLSNWFASLVANPLIQGLPRALAPMASKIAGVFALLSLEAFRHLPALGRMFTALGQHVTTFVRSARAKLARFFAFFGRIGSFILRPFRAVFGKIRGLFSSLIATAKRFTAPLIGAFGRWAAKSKAVFGILKRGFARLGQYVAGFARMAVGMFAKIGGILFQAIMPALMAVGAGLGAMAGQAAIGMVMSLANALISVAGGAALIAPGLLMAAGISFAALKIGLDGVKEGVKAAFSAESPEEFEKAIEKLSPSVQGVARSLREFKPMWDDIKKATQENLLQDLGPEMGKTLQNLLPTFGEGLKGIATAWNGAFKGAFAELQTDQARTGLQTIMQGATEMANNMQPVLANVIAALGSLGEQSAKYLGGIGTYFADLSQRFRDWAEGLKQIDPSTGMSKFDSIIQSAQKNASLLKDIFGGIFGVIGNILKASSEGGGGMLAGLAEGAQKLKDITAEGTPGFQALVGFFQQASNAARELATLIEPILTIATSIGSALAQVAAAAIPGIKVALDALASGLQPLMDIAPRIGQMLGDAFAALGPALQGLGAALAPLIEGIVSGLSIAVQGLGQALTPIMNALGPAMEALKPVLESVGQGLSAIFIALEPIITSSINLISQLMPVVQTVMDLLGQIAAKALEVIAPLFTGHDSVIAQLVQALEPLAQVLGDAILKVLDALAPVIPMISDGFGRLLAACIPLVDPIKEVIDLLGRMLVDAINWLKPLIPPLIDTIVAIGKACVDLVVPAIKIFMGIIQAAWPIISSVIEFAVKTIIAPALELIAGACKILGGVFGWLVNNVIIPLIDIWKAVMKGVGEFISWVIDNLITKPIEGLEGIFRKAVDMIKGVWNGLKKIFSDPVEFLVNTVYNDGIVALWNKVAGFLGMDDKKLEKFSYASKYASGGVLPGYTPGTDVHKYYNPYLGWLYLSGGEAIMRPEWTQAVGGPAAVEAMNKTAREGGVGAVRRMLGQGAAYKKGGTIDLDKRIAELFRELKPEHGKPYQYGGTGNPSWDCSGIWSGITQFLNGGDLRGGRIFNTESNFESYGYVPGLSGRVTIGVLSGQGGGTNGHMAGTIDGVNIESGGSNGVQIGGLAIGSDNGMFNHTYTLKEFLGEFVSGGHGGGGFVNLVLQQVMHAITAILDPIENLIKEKLGGNGWKDLQAGLAMKILSGVKDFALDKAKAFGGSAGVAGNAESWREMAKAAMRRVGFNADDPRQVQAMLEQIMDESSGDAGTAQRIVDVNGTGDAAGVGLLQIIPSTFEAYRDPSLPNDRRDPMANMVAALRYYRARYGDDLTTRWGHGKGGYDKGGHAVGVGYMPKYTLEPERVLSPAQTRAFDVLVYRMLPAYIDEAKKKPFDFDSNFKLLVKELKGLRSDLDRDRDKWIDAQSDKILVDYRNHAEKKVKLDPVDLEKLKNQDKKEVEKAERHWKKADTAVRTATYDPQAYLKAEEEAKKRLDKEQDEKKQKEREARKEQRKKEREARAEERKKIREGIQEQRKKEREARTEQRKEERKERAAERRQESKERRNQRREERKQLQGDRKKLTDDEKKALEEKTDAENDALREQREAENDAISAQHKAENEAISAEHKAENDALKAKWKAEDEQIKKQEEAEDKKREKEEKEEDERINKLKETGEYYYGYKVLSADGNNPYAHEETREEKIGKATVQKVGESVGLGGLANALVEMYNIVVDTNNDVQAAMPAWKAAAAGDPSGLAHNSAVIAEKNNKQLESDLEGFIPGAIASSLEFAFSGSWKAGREAPLVGTINTGISKAELRQELDYLHSKQRRSVARVR
Physico‐chemical
properties
protein length:1957 AA
molecular weight:211537,3 Da
isoelectric point:9,10
hydropathy:-0,12
Representative Protein Details
Accession
1emlW
Protein name
1emlW
Sequence length
2198 AA
Molecular weight
239870,81750 Da
Isoelectric point
8,49186
Sequence
MAYLLGEAAIRISGNAQRLHADIRSAVEKGKKNQVEIPITGDSDSFDMTVQDVEDEINELESREVEIPIYFYIDQASHEHVIRKIDELENRDAKIDVFYDVHNDYADMAVQDFIDKWASDRGRKIKLPVETDIKMPKMPKRFPGAIPNSAQQEMDEYFEKFFRYGDRSMNVLNVWEHAWARLHSGIDRGIYQVYNSYFKLFNDVGAMIAHPIDSFRKMKELAYDTSDENKSLARDIREVSQEWKRLGARIKNAMVGAKGEIGITQRAFQRLKASLADRGVFGTFKRGMMKPIEAVGTGFDKLRVKWQKFRDAFAQFKAPPGLATISRLFGRGFSTAIDKTTASFRRFGSIFPAIGRSSKRAFSETIESMKAATSMAARMTGAGMENMLLGASKFSKKLKKSALGGADFMDNIFHSGKYFNSMVHTASGAFAQIKTKAFNSLGFVQQFMTSIPKNFRGMRRLFGGALTAFNRRMGRTFRTVGAAARTTAGLMGHYFGRSLGAITKLSGRAFGGMLRQFGRIGRNVKVVAGAMTTLFGGAVARVGAMMARMAASKTFSLMGRGIRAGFRQMSKYAAGFGNIAMGVFARTAGILMQSLVPAIMAAVGGLALMGGTAILGGILALAGAIMSVVNGALLMAPALVAAAGVSFATLKMGLKGVKDGVQAAFSAETVEDFEKAIEKLPPGAQEVARSLRDLKPVMDELRTNIQNNLTEGLGEPMRQAFGNMLDAAKPGMEKMASSWNKSLKNMFNEMGSDRASAGMTTIMENMNKMSTNMEPVLGNLAAAFGSLAEQGSKFFGGFGEGLANMSESFMNWAEGLKEIAPGEEMSKFDKAIESAKKNAKLLGAIFGGLFGTIGNILHAAQEAGGDGGPLYGMAVAMQSMKQATEEGSAGFEKIKGLVYASGDMAKELGKVIGPVVSGLLDAATILMRIGQGALPGIAKVVAGIADGLATWKQYAADFGKNLGDGLGSLAPFINNLIAALGPALDGLGEGLGKALEGLAEGLAPVLNGPLMEMGKNLGDILNVTGGVLGDVFGALAPYLESTFNILNALLPVITDVVSLVGDVLVGALDALSPIFTSHDDALKSLLDSLKPVVGIIRDGLMGVIEALKPALAVIGEAFGKIIEAAAPLIPLLGEALKQAFVMIIDVINWLMPFIPPLTDIIVDLIERGVKILINVFQWLLGVVQTVWPALSTVIQFAIDYIIKPAFDILMGAIRVLASVFEWLYYNVVQPIFTALGWFIEKTFQAIAWAIDNVVKPALHGLKRAFELAVDGIKAVWSGLKRIFSEPVKFFIDVVINKAVIGAWNKVMGWLKQDDKKMQPVGNAISEAGFASGGVLPGHSRGKDNKRFYGEDGSILNLAGGEGIMRQEFVDAVGGRKGIARLNDDARHGRLNRINPKAMSDKELRLDRMRKQLLNAHRAHGAHEQGYAGGGVYQPTSTEMAQLGGGIINRSLLIALKTAFPNAVMTAAKTDHENDGGYHPRGAAIDIGGPMQQIANWIFKTYPQSAQLIWGPGPMILNSATNGRIPNSDQAGIRAAYTEPVVALHYNHVHWGADGPIDSDGKMVSMDGVSGGFFDVAQMVADWIQEHVISKIVDPVKKQFDGKLAEWGEYGHMAIGAGKQVLGSTVDYLKDFAKEHNPFGSISGAGGANIDISGVAGSNLQIGQELAKRAGWTGAEWEALKTLWTGESGWDNNAQNPTSTAYGIPQFLDSTWASVGYQKTSDPATQIAAGIKYIKSRPDYGTPSRALALWNSRSPHWYDRGGMAGGIGHMLKNTLEPERVLSPAQTRAFNDLVYNFLPKLGNQILRNPADFNGHRRMILNKMDEIKEELAKERRQRVMDMSRYVKGVFQDRIDGKPRQNRLDDLPQGLLDMPKSFDDLGKKINQAQGWADANMPRLQKNLEGAYKQAQMVTYDPLGYLKAEEIAIERIEKEEEAAKKKAQEEEQKRQQEEDQKATEDRNKAKEDELKAIEDRQNKELDGVDNDDEKKRIEDKYKKEKEAIDKKYQDESDAISERRNKERELEQERKQAEEEHIKRLKETGEYYYGYAVRQQDEDYKYEQGWQEKMGRTIGKSVADVFGLGDAYSSIDAQISELQDIGEQVKVVAPSWWAAANGDYSGLNHNIAVASARSDERNRQALEDIAPGAVAGIIEMSSTAAQNKQYAPFIENVYTGASGPELEQALSHYEGVQARRNRGTTRTR
Other Proteins in cluster: phalp2_28353
Total (incl. this protein): 45 Avg length: 2104,0 Avg pI: 6,94

Protein ID Length (AA) pI
1emlW 2198 8,49186
11uAD 2130 9,40557
1HJvY 2074 6,09816
1cxao 2074 5,96687
2biGm 2074 6,03774
3ceMj 2135 9,40337
5KjXt 2125 9,36063
5QWQa 1551 5,85847
5Twfr 2125 9,38236
5oNnX 2559 5,23620
62nQs 2127 9,35161
6YeZX 2085 5,98591
6c84H 2192 8,42062
6e4dI 2074 5,96471
6eis1 2192 8,42455
7ABjA 2143 5,43059
7ABl8 2143 5,58877
7AevS 2143 5,50244
7Aexq 2143 5,44548
7MW2c 2074 5,94845
7MuqR 2193 7,06090
7NACU 2018 5,84228
7NADO 2192 7,82422
7Nu7N 2132 9,37114
7OPDN 2074 6,00188
7OXrT 2137 9,37836
7PxEo 2127 8,23901
7Zyyl 2137 9,43380
8LmbN 2143 5,42752
8r3n0 2136 9,39428
HGPG 1526 5,35113
HMJA 2074 5,89741
HNeY 2074 6,00091
K64x 2074 5,96613
Kyby 2074 5,96613
NaGi 2192 7,82422
Nu1h 2074 6,00131
A0A3G3LWE3 2143 5,43059
A0A4Y6EKH7 2143 5,58877
A0A2H4P8D5 2143 5,50244
A0A2H4P8I4 2143 5,44548
A7IYB6 2127 8,23901
A0A2H4P994 2143 5,42752
A0A2H4P971 2143 5,42752
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_3995
7AGUw
47 21,1% 1585 1.893E-76

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1emlW) rather than this protein.
PDB ID
1emlW
Method AlphaFoldv2
Resolution 45.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50