Protein

Protein accession
1c3gz [EnVhog]
Representative
1ep70
Source
EnVhog (cluster: phalp2_32490)
Protein name
1c3gz
Lysin probability
90%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
VATAAGWAALPVSVSLQGVVGELQQALVKPTEQAAKAAGAAINKNIAPEVDKLDKAVRSAQYRQKKATDEQIAAERKLVDAKSKAEQAVRAVEVAEANRAVAVSKAGTKVADAEAKYKAVLSDSKSTAEQVERAEQALNNARSEEKSTILTQENKLASARDKSRTAALGVEKAEQGVVDAMSEAKRAADSLEDKQKSLKDAQDGVSDSASDSAQAVEGFASSADKADGASGRFHLSLKNVAVGAAAISAAVVGAGKAAYELGSQFDDAYDSIRVGTGASGDALAGLEDSMRKVAGESIGVGSDIGEIGSTLADLNTRLGVTGEPLERLTTQFQQLKGMGIDADINDVTGAFTQFGIEADKAPEALDDLFRISQATGKSITEITSNLDKSGPALQQFGFGLTESAGLLGALDKAGLDSEKTLGSMTKALGVFAKEGKDPQEALWGTIDSIDELTKAGKNAEAIDLANSIFGAKGGAGFVAAVQSGKFSYDDFMGSIGASGDTISGVAEETADFAEKWDQFKNKAMLAIEPVATAVFNAMVPALEKASGAITGLFNFLTQTAIPAVKDFGSSLQDAGKWVEDNKGKLLGFAAAVSPVLVPLLVGLGIHWTALGTKATLSAVAQVRAWAMTKAEAIKSAAISLVNLWKLGAGWIATGAQAMLGAGRVAAAWAIAKAQAAGALVASIAAVGAGWIATGARALIGAGQVAAAWAIAKAQAAGALVASIAAVGAGWIATGARALIGAGQVAAAWAIGLGPIAWVTAAIAAVVGALTWFFTQTETGKAVWASFTSFLSTAWQATVDALIGAWNWVKTAVIDAWNFAWAGVQANWALVTGALSSGWTWLKDTFVAVWTWIKTAVLDAWNLYWGLVQSGFQITMNALTGAWNWMKDMLSAGWNFIRDAVINAFQNALNVFRGVFQGIMNALSGSWDFLKNAMMAGWNWINSNVLGGFRSGLDALKGWFETTVDAIGRTWNRVKELTAKPIKFVVDTVFNNGIRKAWNAVVGFIGMDDKKMSKVELGELGKYATGGVLPGYTPGRDVHNFVSPTGGRLALSGGEAIMRPEWTRAVGGPAAVERMNWAARSGKLTKHGRESAAFASGGVFDLGAFAGGGIIGAMTRIVQQKYPMLQMTSGLRPGDGGNHGAGLAADFSNGSGNTPAQLSLARDIAKTYPNSMELIYDSPGWSNNIKNGQNVGPFGQFYTMGQAGPHHHHVHWAMNTPPTMPFGGGVFAGGSDGGGGGGGFFDFIGKLVKPAWDKVINAIPKYSGSGGTVAETPAAFLKTGAKLAWDFVKEKASLFGGSGPGYSGPVGAGVEQWRGLVKKILQDKGLSLSFTNSTLRRMNQESGGNPRAINNWDSNAAAGTPSKGLMQVIDPTFAAHKDPGFNDIWDPESNIRASMNYALSRYGSLPAAYDRPGGYAMGGVLPSDLSMKLYDDGGYLKPGDVGTNASGKPEPVFTAEQWAVLRGNILTNAQAQNWQGIAKDLKTIARGYQKWWATSDEAKQLRESAQKSAEDAAVSGAKSALAPYGLDPLVDLGTGVAKRVESAWDASGMDVGVRGRSVVVNIDAEEGQDTIAINQLQRLEKDVDWLKVNVKRKPKAAVTTRGGVM
Physico‐chemical
properties
protein length:1604 AA
molecular weight:167399,3 Da
isoelectric point:7,29
hydropathy:-0,04
Representative Protein Details
Accession
1ep70
Protein name
1ep70
Sequence length
1151 AA
Molecular weight
122116,53720 Da
Isoelectric point
4,85265
Sequence
MASVGYAAMPVIPSFAGISQQLQREVGGPLEKMSLKFGKSVEKGIGDGAAKAAERVEKANFRVKKSAEELTEAESKYRAEKLKQEAADKALESAAKKLADARAKGGDAAEKAEEQYLRAQAKAETASNNSRKAHNKLLDAQEESARAAKRLADAEEQAANGADDSAKSLRGAGDAAGYAAEQSKLLEDWQLKAAAGAAALVGGAVAAGKSLVDMGSQFDDAYDTIRAGTGASGEAFEGLQDSMRKVAAESIGVGSDMGAIGSTLADLNTRLGLTGEPLEEMTAQFLQLQQLGVDADINEVSQALNGFGIEAKDMPNALDELFQVSQATGLTVTELANSAVKAGPALRGFGFSMADSAALVGQMDKAGLDADKTLQSMQRALAEFASEGRDAPAALKETIGSIEDLVQSGDDAAAIDMASSIFGTRGATQFVDAVKTGTLSVEDFMDATGATEDTIGGLAEETADFAERWDQFKLQAMLALEPVATALFNSIAPALDVVAGKFEVVSAWLVDDLIPAFQSFGEWAQKNQAWLAPLAVTLGTFAGSIAAAVGAIKAWNAAVGVYQAVTKIATAETKLFNLALKTNVIVAIVSAVAALAAGLVYFFTQTEKGREIWETFTGSLKTFGEAAFNAVGDGLEWVQEKWDAFTNALSTAWNSYIKPVFDGMLEAAQATIGVIATVVLAPLMIQWELLSTAIQFAWDNVIKPVWEALVGFAQNTMQPIVEGVFAWIGDRWQAMATAIGEVWDWLKEAMQAGWEFIDAYVFQPWRVALGLVADWFRDRIDAIVFVWDGFQTLLRAGWEFIEGNVFAPLGKGLDTLQDWFSKGVEGIGRIWDGLRAVAAKPVKFVIDTVWNNGILKAWNAIADFLPGIDTVNKVDLGQLGAYAQGGVLPGYTPGRDVHDFYSPTGGMIHLSGGEAIMRPEWTRAVGGKKTVDRMNAMAKAGKLDKDQMAIGLSHGTLGAFANGGVIGAMANIVRAKYPMLQLTSGYRPGDSGMHGAGLASDWSNGHGNTPQQLALAHDIAQTYPGSAELIYDSPGWSGNIKNGQNVGPFGSFYTMAQAGPHHHHVHWAMTTAPNLQFGGGVFEGGSNGGPGGPLGGLFNWIADKARGVWAKIVDPIKGKIDGVREDSKFWDIPFGFLETIKDKTWSFLSSK
Other Proteins in cluster: phalp2_32490
Total (incl. this protein): 5 Avg length: 1395,4 Avg pI: 5,95

Protein ID Length (AA) pI
1ep70 1151 4,85265
1cmyt 1600 6,44170
1k5vQ 1026 4,71300
7ubtQ 1596 6,46085
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_23844
1pqSu
4 32,3% 1026 1.542E-142
2 phalp2_30576
5QqLi
5 32,3% 817 8.022E-124
3 phalp2_26022
6Tdzz
16 32,4% 756 2.128E-107
4 phalp2_27799
7ym26
26 27,0% 1218 9.911E-106
5 phalp2_32224
7cxS3
54 28,2% 1258 1.294E-100
6 phalp2_24521
4Tmyk
3 28,5% 858 1.447E-83
7 phalp2_6241
6BCsI
53 27,5% 1045 5.772E-78
8 phalp2_7805
7vHjO
1 25,7% 926 1.334E-68
9 phalp2_25748
4M8O7
4 24,1% 1093 1.024E-63
10 phalp2_24626
5tJdD
1 22,8% 1086 4.853E-61

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1ep70) rather than this protein.
PDB ID
1ep70
Method AlphaFoldv2
Resolution 61.19
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50