Protein
- Protein accession
- 1gzyW [EnVhog]
- Representative
- 8N0Ls
- Source
- EnVhog (cluster: phalp2_14941)
- Protein name
- 1gzyW
- Lysin probability
- 98%
- PhaLP type
-
VAL
Probability: 99% (predicted by ML model) - Protein sequence
-
MALNKRYQAGTAWIKVAPDFRGWKDDLQRQVENSAKQLSARVSLDTNDARDTLRDFERQNAKVEANVKLSQSDLARVREQLAGLHKHKRDVTIRTELDNKHFREQLREVRHGFDKLAIPADVKLNKKSVTRTYDRLQKDFAQRAAEGVSFDSKSLGKAFESALRVDKSRADIARVTKDLEALNKVEEEFERKRKDYALGRINDFKEVTRLAEEYEAHQAQIANAQKRLGKYTRQQNAAQRKLNKDLKVAEELSRSSAQGQSGVDPRVTRRVRTMSARRLEMQATEKDLARLEKAQERYNQKLRQYQKNEISWSEISKLADEWAIQEQAADKARKKLTEFTEAQNTAQRKLNRELQKARENSEQAMAARRRPELAAHPEDKDVVYVDRVREAARSSKVGNSKLEGDVVALRNATQSARALDSANHNLARSVLTLEKNQLALAAAQLEVNNLRARGNISSKNGAAALDRESRAFHAVTMSARQSISARRAVDGLRESYNETANVLQRRIDLNPIRRFGDVVQNTVARGISFFTDRLILAGRLISSVGAVGMAAGAALAGLGLVNMIPLVGSLSQVVGVLGLIPGLAATAGAALGAISVGMSGIGGAFKAAGKLSDALGAGGSDSRAAQTARKELDKANRAAAKTAVQGARSIADAEKGIQRAQKSSADAQKDLSKARKDAQRQNEDLTESIRDMAYEEEDAALSVEEARERLGEVLRDPESTANQRRRADLSYRQALEQQRDLRREHGRTKEEYADAQRKGIEGSDVVVEAQERVREAAEGVADAQQSLAQAQQDAAEANAEAMERVAEAQENVVAAMGGGDAAAKALEEYNRELEKLAPNAQGFVAGVYGMRDAWMDLRNAVQQRLFRGIGEDVHYLAKEQLPLLEVGLGKTAAAINVGMRHALDYLGSQRATMDFTSIFNNSAEATEGFARSTSNLLQALIPVAEVGTRFMPWFGRSLDDTTRRWRVMAENAQDSGAMEAYFARSITRTQQLGKLLGALGGIAKEAFRVTSGLGLESLTSIISRLQQVRTDLQGESGTRLASNFAGVREAFHSALNLAGALVQIVINDVVPAFRLIADVTGPIVQAIANLGVLISEHIPFVRTLLSLYLAFKVINMTMGGLGRAMDRLRPKVEGHVPLTQRLRDSWNAATKSKERYSSQTTRLGSGLQAFGVQNGKIQALGRSWDNAANSAGRMSTVMGGVRGAASSLFSFLGGGWGVAFAGLAVGYMLWDSHRRKIKEAEEAARKYGVAMEKAGIKIREAIRESRGDTGSQVLTATEEEFETRLTEIERTAKDGGPSFLDRVNRVVNGSNALDKYLLTDRAQGLTRQVTTEFDAKVDAKKTAEAQRNLLAKLKYDASDLARAVAGTEQEWRKYDSTLRANGETGKQLADIIAERRGQIQDEARLVRDLTPGYLDLVDALDTLGDSSSSTADKFDALRRAFDSLIPGNEQQEALAKYGETIDEIRKKVEQVDPKGGMGAELFDNGVLDVANSANARTLNQSIMEGQQAVAEAQMQNVDPEKLKQLWANYDSAIRTLGGSMGVADSTMDDLLTKMFSTPDQVTSVVVLENADPVSRDLGAISLMLQSLHNQGKPIEGQIEINSTEAYEALEATGAQLEHIEGKQWAVKFDTEESFEDFQRLQAMLTTWNTVKSSATVDLDTDTFKLKEADARRLLGVLDQYKSEPWATLLHQSLIDNQGGAIQLLDELAGHPAAATARLELDQLRRDVLEATEIIDPRNWVLLQGGAFTGKPEDKSPVTDPFDNDTGIELPGWMRDGDQPAPGANKPANKPAPAAAEPAPAPAPAPAPTRAGDVPAGAHRRADGQLGYVKNPKTGYAAARGWEWVPKGDGFVQKKRDDYSKVLKQRYLGGRLPMLGLGGRMPTSGPGTERRDGFLAVGPDGMPQARVDGGEWVINSRSSEEWDWLLAAINSGKLRGFAAGGSLAGRKSGSFGLGAVGNPFGSMANGVMGLGGAFGQAIDGALPAWKQFGSTLASTAANFIQPALSSVNSAITQMGIQFPLIAQTRVTPAWLNMANLLNTAKQSIIDPMFAGVGTRLDDLTAHFPAAAAIIEPTWLNLTSRILAAKTGTIDPAFTGVKEGLRNVQSAFGIAVPAIATQWDGIRSATADPVRFTINTVFNDGLVGMWNSVSDLIGSKPMPRYTAKFARGGVLPGNTPGVDIHTFYSPTLGELHLGGGEGIIRPEAVRAMGGARAIDQINKDAREGRLHPVKGNPAHYANGGVFGRFARGGTISGGHIQGGAEITSPIQRVMWDAVRTAFPNVNLVSGTRYADVGSGFDNHMGQRALDLTGPMPQIARWIYQLNKTQPVEELIHAPLGGWQNLKAGRPLNYGAGTDADHYDHVHWAMAAMRSFAGRLVSMAGGGAPGAPMQSPADMVKDLLKPMKAEVQQKISAKKFPGMMGQLPGLIFGSLSKQIESQAVNLAQMSGMYSGPAVAPGGSVERWRPMVIAALRRNGFDPSRRNQDLMLSQIQSESSGNEKAIQTVIDVNSGGNEAGGLLQMTPGTYEAHRDPSLPNDRFDAWSNMNASLRYYRREYGDDLGTMWGKGHGYDRGGMWENNTLGWNTSGKPEAVFTNKEWLLVDKLVRALTKPEMFRALTAQPTPVAMGATPTPEKLRYTPPEEPKDEEVVEEKASPIEFAPAKIDPETSQPYEVDPETGVPIDPKTGTKFTVDPVTQQAITTDDRGLYIDPETGKPFYGTIGEKAQNSATTKPAIEYTPAKVNPETGQPYETDQNTGVPIDPKTSKPFEVDPATGKPIAKDERGLYVDPETGKPFYGLDAERYELNQSDQDDTFTFENNADELGAQSDWVDEYTGPMSDVFKKGKLLGQLGHALAQPGAIQRIGNDKLMNTQIQDAKKRQDELNLYKSDQAQKINELRAQGKHDEAAAIEREAREKIAGMSEMPGTSKGTQAMLATMQENPGAQWRRESEDEWRKWAGDNWAGMAETVVAAGAGGAANSGAGQIAGTVNINTTDLSGAFREMDRRTKRAARSNSRVVRR
- Physico‐chemical
properties -
protein length: 3014 AA molecular weight: 327762,6 Da isoelectric point: 7,01 hydropathy: -0,50
Representative Protein Details
- Accession
- 8N0Ls
- Protein name
- 8N0Ls
- Sequence length
- 2765 AA
- Molecular weight
- 297270,64770 Da
- Isoelectric point
- 8,21651
- Sequence
-
MADKRYQAGTAWIKVSPDFRGWTKEVKDQVNDSLNNLQARASLDKSSARATRRDLDKALEGVEANVGVALSKRELKAAKKDLDGIAGKHRRIAIVQAELDDARARKQLREMGNDFGKMVLSPKVQLDPKSISKEFDRLQNQIAKRANAGMVLPPKVTRDVLDSALRADLLNKKLADARKEVEKIEKGRDKIRKALNADRDQGKYSFGEVRRMIEDELKLERALKKARLEFDRAIKSQVAANNTLARSEAKMIRSLKNESARVRKPKRADSIIAQPGDQANFSDRVRQRRESSAGKSSLKEDVVAFKNAADFAQKLDREIAKLEASEVSLAHANRTVIETQNRLNEVIEKGISTSKQGVAANIAHTRAVQQLERAAVAKDNQELVTTRMREDYNETAKVLQERLDTNPISRWTQQLELASSRSITAFTNRLVYAGRVVSSFVVLSSAAGAALGTLGLINLGPLVGTLGQVAGALGILPGLAVAAGTAIGAIALGVSGIGGAFKSATKLSDAMAAAGGPGQSSDNSKAVRNAQRGVAQANKAAARTAANGAKQIADAERGVQDAQKSSQRAQKDLTQARKDAAEQNEELKESIRDMALEEEDAALSVQEARERLGEVLRDPDSTGTQRRRANLSYRQALEAQRDLRREHQKTKTEFADAQKKGIEGSDVVVAAQERVAEAAEGVAEAQRGVAEAQQNAAESNADAQERVAEAQEALSEAMTTSAAGGSAAQKALEEYERELAKLHPQARALVKQVLGMKDAWMDVRNSVQGRLFGGVATDIQELATKQLPLLQKGLGETAEGINVGLRAALDYLKGSKATADFTSIFDNSAEGVEHFSVGLTNLLRALVPVAEVGTRFMPWFGRSIRDTTNDWRIMAEAAQADGSMEDYFIKAITRAQQYGRILRDLGGGAVAIFASVTGLGEDTLNRTEVRIAAWRERLETELGGEGVVNWFTKVRDLFSTILEITGDVAAFIIAHVVPAFSAITDVVAPIVGGVVHLTTAFADMFPIVQHLLTLFLALRIVDTVIGVGRGAVVRMNAALATQTTAMTTMRTAWNQATVGVNGYTGALARAHIMQQRLAMSPNPMLRQVGQYGAMTRAVGGATAALGGLRAAGSSLVGFLGGPWGVAILAVIAGLGMWYSATQKTKQETQELEERTKRLTKAHKDYNRAVVESRGRRDPQVMSAAEAIATERIDQVDSDADDKVGFTDRFGDLFTKEYWSTDTFFGDDMQSKNIEKKEQEHALAEKQKQLLKDLKIDAAEVGRALAGTDEDWVTFRNRLTASGEAGRDLAATLDADRVAMSEQARVIEQLDAGYLDLHDALSVLRDTQADATDKATALRQAFDALIPGNEKMEALSKFGETMKSLRDAIASVDPEGGFGDELIGAGGQLRTTFANANTLLQLIKDGQQSIAEAQMAGATTEEVTQMWDDWRGSVLQFADAVQIPRDRMELLLKQFAATPEAVSTVINIEGAGNAAATLDEIKLKMDTLFAQGKEGKGTFAIDERMRNTLREMGAGVRDINDVYAEVTFTNDGVYQSFLKLQQAVQGTNTDLDWLASKVQGMPAGKKIIIEDNSPATVARLDALGFKVKTLPDGRIVAIAETETAQTALRALEVSRSTTITANVVKGSGWEHIHDMMLLTENPEEWKRRHQQRQQNQQMPGWGGPWFNNKPPGGYRGMRLPGFANGSRMPGTGPGTEKRDGIYAVLPGGMPIAMVNGSEWVINDKSSEKYDWLLNLINQDRLPGFAVGGPMGGGPKKKQPGLGAIGDPFSAMAGGVMGLGDAFGTAIGTAIPQWQQFGTTLASTTSNFITPALTGVHSQISTLGSLLPTITQSQILPPWMNMATQMANAKATYIDPAFQGITSNLTNLSARVPATAGIVSPTWQSMATQIMGAKTGTIDPAFQGIQGGLGNVQSSFATAVPNIASQWNGMREATAAPVRFTINTVFNDGLVGMWNSVADIIGGKKMNPYAAKFARGGVLPGYTPGRDVHKFISPTGGELHLSGGEAIMRPEWTRAVGGPAAVDRMNREARGGKGPGSTKPGEMHRANGGAVSVGYGMPAGTNISYGGPGFPMWVYKLAQSYGVQASTYAGHQEDNRGEAGYAPNPQGLNRGIDWSGPVPAMQGFAQYLLGIAPRTPALEQIIWMNPSTGQRIGWAGRSPDISGAYYASDYGGHQDHVHTRQSGPLLPGMAGMAGALMGMAGMGAMDIGAMVRGQMDPKAEEIRKKIKGTPFAGMVGQLPSQVFESAYKSMSKKAVDMAEKSGLFSGTVAPGGGVERWRPMVIAALKRQGFDPSRRNQDLMLAQIQSESGGNPTAINLVDSNAMAGMPSQGLLQTIPPTFQTHRDPLIPGGITDPWANMNAALRYYKATYGNDLGARWGKGMGYDQGGIFPHKTLGFNMSGKPEAVFTNRQWKLLDQLVGALLKPKMFDNLTNEAALGVQPKQAPDTKVITMPEEPKDEEKKDDEAGTDDPTSAAGAPAREYTPAKLNPETGSPYETDPETGVPIDPATKKPFTVDPVTKKPITAGDDGLFIDPETGKPFYGSKAEILDQRATTTSPEDTFTFDNPADELGVPEGDWVDEYSGPLSDVFKKGKLLGQLGTALAQPGAIQRIGNDKLMNEQIAAARKREDDLANYKTDTAAKINELRATGKTAEAAALEKEARNKIAGMSEMPGTAKGTQAMLATMTDNPGEDFRRKSADEWKKWLGENWAGIAESAVAGGMGVAQQGAGQAQVIVQGGIHTTNWTAARRDLERRTARQSRANARVGRR
Other Proteins in cluster: phalp2_14941
| Total (incl. this protein): 31 | Avg length: 2675,7 | Avg pI: 7,32 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 8N0Ls | 2765 | 8,21651 |
| 7AoJi | 2936 | 6,83866 |
| 7PJJJ | 2931 | 6,72010 |
| 7PJMg | 2944 | 5,71535 |
| 7zSrK | 2839 | 6,44363 |
| 8N0Un | 2836 | 6,34195 |
| A0A142K904 | 2765 | 8,21651 |
| A0A7D5JKS3 | 886 | 5,69802 |
| A0A345MIN4 | 2936 | 6,83866 |
| A0A160DI33 | 2931 | 6,72010 |
| A0A8T8JJ17 | 2145 | 7,34743 |
| A0A1B3B241 | 2146 | 8,52983 |
| A0A142KCJ3 | 2139 | 7,70182 |
| A0A160DGU3 | 2915 | 8,76004 |
| A0A160DHQ8 | 2931 | 6,72010 |
| A0A166Y7S1 | 2931 | 6,79717 |
| A0A1B3AZG2 | 2931 | 6,72032 |
| A0A2D1GG46 | 2931 | 6,79717 |
| A0A2H4YHA1 | 2931 | 6,72010 |
| A0A345KTX4 | 3016 | 8,89034 |
| A0A410TBY1 | 2931 | 6,85219 |
| A0A410TD13 | 2931 | 6,72010 |
| A0A4Y5NYU5 | 2147 | 8,24926 |
| A0A4Y6EGC4 | 2147 | 8,24926 |
| A0A514A2X3 | 2146 | 8,52893 |
| A0A514TZ56 | 2914 | 8,85049 |
| A0A7G8LG74 | 2139 | 8,02581 |
| A0AA96GPD9 | 2931 | 7,67999 |
| A0AAE7F8K1 | 2931 | 6,79723 |
| A0AAE7WEI6 | 2931 | 7,26586 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_12415
8Itvn
|
7 | 27,9% | 2135 | 2.553E-236 |
| 2 |
phalp2_29973
7of38
|
4 | 24,8% | 2186 | 7.964E-176 |
| 3 |
phalp2_36243
7zfxz
|
7 | 22,5% | 2027 | 7.718E-117 |
| 4 |
phalp2_12266
72uge
|
6 | 22,3% | 1999 | 1.090E-106 |
| 5 |
phalp2_18324
5inLv
|
5 | 20,9% | 2027 | 7.011E-87 |
| 6 |
phalp2_6553
11DGv
|
4 | 21,9% | 1987 | 4.646E-65 |
| 7 |
phalp2_3732
5nCFz
|
2 | 22,2% | 2105 | 1.480E-62 |
| 8 |
phalp2_3994
7AFQR
|
49 | 19,8% | 2043 | 9.431E-46 |
| 9 |
phalp2_27297
4grXe
|
6 | 20,2% | 2035 | 4.942E-40 |
Domains
Domains
No domain annotations available.
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available.