Protein

Protein accession
6yqWr [EnVhog]
Representative
5Di68
Source
EnVhog (cluster: phalp2_35139)
Protein name
6yqWr
Lysin probability
73%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MANTRVNIDVNINTGDAARSLRQLQAQINSFQSALNTNNRLQGDASRYYSQQIKDLANQSGFFTAETVKMRTAASQLDQTLSKGQGTMRQFVSAKFLKDSAAAAQVLSLANSRASALQTQFVATGAAANGFREAVAIRPLQAFNSEAVVSTQKLAIHRAMLSQSTTSMINFGKNTQWAGRQLMVGFTVPLTIFAATSGKVFREIETEAINFRKVYGDAFTPPEEMEANLEAIKELAKEYTKYGIAVKDTIGLAAQAAAAGSQGADLIDATRESTRLATLGQMEQSQALETTIALQTAFGLQSKELTKTINFLNMVENQTVVTLQDLAQAIPRVAPVIKGLGGSVEDMAAMLAAMQEGGISAAQGANALKSGLASLINPTNRAVEKLGELGININAIVDANRGDLMGTVQGFARALATLDEFGKQQALEKVFGKYQYARLGALFNNIIKDGSQASRVMDTAALSAAQLAASATKELGAIEDSSATKFVAAMERLKLAIAPIGEMFVELATPVLGFLASLVDKFNSLPDFAKKFIGFGTVITGIIIPAGTMFLGLLMNLAGTLTKFGLVVGVAFKGFAQGGLKGSIEAVSQALNYMSLTELDAANASTQLGTSTGIVNQALRDQVPAAAGADAAIDSLSRSYAALIVQMAEAAQLSKVAFVAPGAAMSGAATTGAEAGRRRGGPRIRRNSGGTIPGSGNTDTVPAMLTPGEFVVNKQATAENLSLLHSINKGKKTLGFNKGGQIPGMQYFGATNPQRVVQQASRFWNADSAEFAGMFSTGSKKIIRSSDTPSETRRTVSVAFQDSANGRLTSKQLSQLLGKPERTIQEAATSLGIEQMKLPGISSQLVRAWKKEDVDGYLASLNLRAEKAHFGTKKATKSDIDAAIRLRGTGERAQFAKRLFDADIPVTALTDDWATFSSSLNQAANRGNTRISIEQAIDDILKNKNILELGPAITNAGVSKEQFLTSLIAELKNKNGKEIFTDTILTSARNDALRKLGKSVTPSTTGLRIPAKADLQEIVAPIVRKANPEDVTFDEINKHFLNKKGFEILSAKDTGTGYSAIKEKASGKFEKLAASTGYNNGGMIPMLGAGGIALAANAARGARGRIPANIVEWFSRMTTSINQSNRDDMLQNLPANIKRGLSTSRKKIDITRGSVDPISGRTLKDVSSWSTGQGVGRFMESSRLFDTMNNSKFQIAQRSQGMKRDIDWLRSLGPDGPHPLKTWSPFNQENGALGLRLTPEEIQASGGVIWKANKKKLEERIKRDQEFIAGHQKIIDDLQESNIPTIYKTSIKKGEKYFDVSGRIVPRDEQRINNPAASNEILDEKEVALLGARLTGRPKTRIVEREEAQARYDKAKEEIKKIRMSIFNKEMSAEKAKPLLDRLHDETENMAAFVNWGRGREVSYPNVVSNLSMGGILPQMRNIGGEIFDSTKTSKTIVPGVGNKDTVPAMLTPGEFVINKDSTRKNLDLLRSVNNGTIRGYAKGDLVDFGNGEYMVEEEDGNFSDPMSKKEAKKRSRMIKTKMSGGRGMGISMGMGAIGGAMMMAPMIPAIGENATLSAVAGGGGMALSVASALPMLAPMLAPLAAIAVPAGIVAASLAVVGVALYAWRKTVDDAAVKAAEFGANIGGTANALNTMAQVLGTLTPAQARQKAVMGISSEEEATIANEFSGVFESGAGQKFIEDLQNSTSAERFQKLSDYLKTAIASGMMDQKAATTFAKAVGLKLGDSVLSNQTVSSISGQKAGTSAIKEIADMRKAAVDADGSLAKLAASSGEASLSNEAAGRAIGAATQIIQDYGNVIGVAEQQYAEGTIGWNEYHTAITEARTAQEQYTNVIKKSVSNAAEQGAARQALDQQMVAAGYTEDQVGAITNSTNIGTSSLGFSESKDYLRPALSLLSGNLVGAALGMVDAITPSQSAYQKEFVKQQADVRFDKSFGELDSTQQTSLVEESKEAQDLAREVDLEGLNEAIKASVLAGDMTAEQAAQFGQEIMYNVDALKALRDGASIATAQFITFAQNIPGITKGQSSDVVTIGKEFEAAGGNTALIENYIASLPQNLLKTGIETLKSAGIQTGDTGRNIISPEEQERRRTDIAFGLQSLSGSVGVEQAVAIQKSDAYGKAVREVDGSEKINKAFARAAEGGVAERVIRYVIQTEGENTPAEMISSITKLTIAQEDIMSLPDEMIKGLKIDTTDPADLLKFGAAAKTLKDQWKIIENLNPNINMTAAMEFLTLDAEGKPLTPEQVSKNVIKLNKAMKDLESGNSEIRKRAMIDIAVAYSGKNIAGGTPQEAAANAIAGLREQFKNFDKLDPVVQEQMIEIRMGYEVKNLSLVMQLNSAMAALKTATDAESRQIILKRIASLQSQIGTAEASVKANLAPIPGSGSSSGSSGSGEKSIAQQLKEQAKLSGETFKGISNLQNEKGFKNFIAGPFAPEFLEYLRSQGKAGLKLIKGGLEKVRAAYADYEKTKISDAAAMAALTPQALVKESKKTALSSKYNKQLANKGMYADDIDAIRQLISDEDLLYAQQLRRKLKDKGTSKKEKREINKELEAFKGEREAAERDAPIINQAQKISGSMVDMKTQTSLLVAELGFINEGKTPELATALAAAGLKAGDATGKVELFTDAYNKLKAAEYARDPGRLEQEKRDNLMEINDLNRQIEEINLIRPLEDQVEVQQKIIDGQQKLVRVKQQEIDAIGRTIELKQRELEPLDDQIEKLQELSDKTSEAYDAQIEALDEVYRKEENIAKLKQGQLDVAQALSRGDVAAAAQGVLGMSQELARQSREEARSALESQKDLAIKNIQNQILEIEKQKKKINQDIEDLQMRQRAIQDDIYTIQSTKILPAENEIYKLQGMINVEADKLKGKYSSAAIEMENLIRLLKLAKEEESRLNNIILGPNDEEKKAAPAQPAPDPYPIGQSGMLLNPAGPALTIGGGHLGGSGGGGSNSVNTPPVGPPGPPAPASGLSPAQKAEAARLQAAAAAAERQRAIDAGAAQDRAKAAANTAAAAAERQRAIDAGAAQDRAKAAAERERAAQAAAAAARAEATRQAIARAAAAERQRAADAQRAADAQRAAQNAAALRANSAARRAFGGFISNYAMGGMVNYKGSTERAPGMMYGGSAKKYAYGSTVPGRGMTDKVPALLTPGEFVVRKRVAEQYGPLLKSLNGQVFPKENFDKNNDEYARRKRDMDMMRITEKKSSSSLKDAVQNAARGGLNKINKNPYEYTQRLIQPVMNKEDRLRSGLEPIFRIQPMPRKPDASIMPMPRKPDVSIMPVPNDKDASSLKDALDKIFAINNQNKKDRARLAVEPNRFNADKLNDFRAMINQKVFPTMKTNSFNSSNKTEKSGSMYNYNVNVTLNGSDMDANDVANAVMQKIKMTENKGIRSNNIRG
Physico‐chemical
properties
protein length:3390 AA
molecular weight:363642,7 Da
isoelectric point:9,10
hydropathy:-0,34
Representative Protein Details
Accession
5Di68
Protein name
5Di68
Sequence length
2642 AA
Molecular weight
286800,26290 Da
Isoelectric point
9,55346
Sequence
LADVRTIIDVDLNTSAATAKLKQLQRDINSFTLALNKGNILQTGTVKQLKEDVLDLANSSRFFTGEVVRLRTAASRLDSTLAKGQVTLGQFFSARFKKDSFAAAQVLQLANERVRSLQTQFVATSGAAGGFQEAVAIRPLAAFNKQVAVSTQQLAIQRAMFRQATTSVINFGKNVQWAGRQLMVGFTVPLTIFGAVAGKAFRDLEKEAINFKKVYGDVFTTEAETEKMLEEVKGLTKEFTKYGVAAQKTMQLAAVAAQAGQQGAELISATREAIRLSTLGDMEQEAAMRTTISLQSAFKLSSDELAESINYLNYVENSTVLSLEDLSAAIPRVAPVIVGLGGNVRDMSIFLAAMREGGVTASEAANGLKSSLGRLITPTAKATAMAESFGINLEKIVQTNRGDLLGTVMELANAMKTLGNIEKQQLLSTLFGKFQYARIGALFENITREGSQANRLIQNIEVSTQELAQTADKELSAIENSIGVKFTKAMEDLKIAIAPIGELFIKLSIPIVKFLSKIAEAFNNLPDFAKNFIAGGVAITGIFIPAIVMVFGLISNLIGQLLKMGGIFGMFVKGLATGGLKGALESVRQSFQYMSLAELDTAVASQQLASATNIVNQALLDQVLRSEGATAALVELGSMYDLVAAKARNFASAQGAALFVPGKGMGISQATKGVTKRNTGGTIPGFGQNKDTVPAVLTPGEFVVNRDATKENFHLLKAINNGDVQGFNGGGIPGAGPGLFMQRIQKTQQMRTQMAKQQAQASQLQDTHIYFSGKRRSIKYGNIDPRIGGSQVEDIKTLAKLGFPVEQLDDKYLRLPFNRDLVNGIKGVRAKQMLEELDPDVDILGPAFKRANVNQRQLVDNLIANIDERTTYKDFGPNAFGQIVSKSLGTTRVNGLPSQLRLGTLSSIKEKFPNEVALAMASPGKRIPVGNTGASLLYEDNKLFAITPDGVKLMSSGTRGHRMNRGGGVARSGRMFYGNVLYPLFKFGTTRLRPRGGVGPDAVDVLKSVRGKIVSQKVQREMAAERFAAGDARIAAAVESGRRSAEAQRLREEIFTKIGKQRRALDLYDRNKIQMTAVLSESLDAPVHNTTLRKLLESAMRDPDLSHAPFISMAQKLDDGSVVVFNVPRGTLGINREGNLSMMDRKAAMPGGPGIAVPKNQAIIGQIAGGTRTDLHGVNMDKKGKFAILAEVIEFMNKGGVAGVSSMQRFALGGSVFAEQLRRFAAMRKSGSTPEQIKSEYKKLAKQYHPDLNPNNSEYMKQITAAYDQLMPKRKIKLRNYFGGKIISLDEADRDFDNFKRNNLTGKFNLSENSIRTGMRNGETYVAYDDIRTGSFANNVRAGHYVITDKNGSLGYTVPGGMRYGISSPFDKGRKHNVFPLSEFIEGYNHGGMIPGLQYFQKGGIAQLVQNIARERLMMSGARSLKDLGSRTTRSDRQIFEKENFARWSRTKGAGQRGSGYADAVAAKYITQQELEAMSGRVVAHAVGGAFRRRMESLGLKSDQLGSYPTGTFGQIARALAAKGPFTPDGRSRKSFERQAKDLAQVLPSTGVSVPLKFNNQIKNGVGTVKALGALGPQHMVFLMASLLQRNVPPAVARAVLTWAAVGLKRNLRAYDGKINESIFYDELTKAENKALIRYEDDFIKLGIAANSGGMIPEYGTGGNVMKSFISGRVFDSARRNIVPGTGDKDTVAAKLTPGEFVVNKKSTKENLGLLKAINSGQMASMNVGGQMIPLQKFIKGYALGDIVSADQSDVKTRESLYYIEGRDGRRFGPMTYDQALRRQKQLTFASARKRLVPPPNRQLQAMGGMRGFGVSTALGMGGMGIMMVGQQMQAAAAQRNETSAMGGALANVGMGASLLSFLPMIPKIFSPIGLAVTGLVAALGVGVIALKAWRESVDQASRAAAKIGSNLGVTANSINKAAELLGLQTPAQRATAIAIGFSQEEQQKAQQEFANIFQSEEGQKFIEELKSATGAERFKKAGDYIAAAVATGMMEKDVAYRFAKSLSMILVDPTLAPKLSGVIATQETGSKAIISMMQKREKEITNTKAFRTVQQVQETEKQRMRPSLFFPDETVLKRGYLPIETGSKVLGSAIQIIQDAANAQALAREEYSKGTISLKEYRDVVDQSRTTQEKYGNVIQTILDRSVDPGATNQAVKSAIVDAGLATSQEFDATRSAIEKAFASGGAASQVENRNFNFGQDEAMRIMTEGYASGMGQQEILSIIGSLLSSRGIIASGYSRAQEEGLGVFESAQFGILSQNLINMQGLSAPEISKAGLTTQGIQDIAFKFVQEGGTSQEFQAFLESLPPDKRLEILIDFKESGKSPREYVTEFQTLQSLIGTEMAFKVKTKFETEGMSKEEIDSVVAGMKDLEKLPPFVEKEAFFKIVTEDGKKLLNEKQMEEKIKQYNKIIDDLSSGDKEVVRKAELALATFYNGNFSDIEAIKKDFENWESMTPDQKEEVILQRTQIKNEILATFGIDIDKKSLEEVRKVIKNIQTSYKSMAIKAEIKGSPDLVKFANERLAEANAIAAQLDVLSNLEPSAIGEGSQTEKAGGGTKPLLQQLKEDAKLAQNIYAGLANLIDEKGFKKFIAGPFAPEFLQYLRSQGEQGLKIIQGGLNKVREAYKLYQQQQSSDLRSGLLS
Other Proteins in cluster: phalp2_35139
Total (incl. this protein): 10 Avg length: 2968,8 Avg pI: 8,06

Protein ID Length (AA) pI
5Di68 2642 9,55346
2ObET 2714 5,19965
4A6iQ 2560 5,82147
4W8ci 2575 5,56655
4YeKM 3181 9,20198
55xbY 3410 9,03391
5c6Eb 3453 9,02217
5u7mO 3267 8,65541
6U7uD 2496 9,41343
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19451
300HN
2 23,0% 3601 0.000E+00
2 phalp2_23044
49sYk
5 27,5% 2333 4.980E-298
3 phalp2_30333
4Co56
9 29,4% 2289 1.022E-292
4 phalp2_32961
4623L
1 24,0% 2314 3.801E-206

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5Di68) rather than this protein.
PDB ID
5Di68
Method AlphaFoldv2
Resolution 42.89
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50