Protein

Protein accession
4bdjJ [EnVhog]
Representative
Ghy7
Source
EnVhog (cluster: phalp2_27950)
Protein name
4bdjJ
Lysin probability
99%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
LDSIRTTLRYDADLGAAMAQVKALTGQVGALNLAFNSLDKNAVGVRNTLANTFAANLSSMGGFRTEMVNLTGQTEKFGRALAANKLSMREYFAEAYRGYTRQSSMMRQLAREQVKFQEAIAIPMGRNAAGQMQGLMAVPTQVDFKNASTRMKLLSQEFNIFNQLVRNGADELINMGKNTQWTGRQLTVGLTMPVVLFGATFAKVFMDIDKQMTRFAKVYGEDVIGTSARATEDMKKQVMELAETISSQFGVAASETVGLAADIAATGKEGQDLLDTVAQTNKLAVLGEVDRQEAMKATLAIQSAFKQNTDELAESINFLNAVENQTSTTLNDLVEAIPKAGPVVKGLGGSIKDLSVLMVAMKEGGIPAAEAANAIKSGLASLINPTKKASEVAKQFGVDLVGIVEANKGQLMPTIMAVQTALNGLDAFSRSKIIEEIFGKYQFARISALFNNLGKAGSQTQQAMELAGASTTELAGIANQELKAYTESTTVRFQRMVETVKNQLVPMGASLLEMLTPALDKLSGIIEVIRNAVGNLPDFLKEPLKIIGAFALFAGPILMLVGLFKNLIGNAIKFSMSIVGLGAKIAGLDVRKFELLDAETAAATMQIDNMSTSFVEQKVALSNLNVELGKYITQLRTVATQNPNLLVRAPASPPLIRRSDGSRGPEIVPGGYGGGDKIPALLEPGEFVMNKEAASRFGSILNAMNRGTIQKFQEGGLARSHVEEVLVDGKKQFGGVISIENQDYNQILNSHKDLFPGRKQPVTMADFYASLAAARAYAQNNQVTPSFAAGLAAMEHDARIINNDENTFRRMLAERMALGTLSVLPSSVVTRTGLTVDQIFNAERDRFLGILTSNTDATSIRAALVESSSGTLRANGVTYAGEGRERSLSNIRGGGRRGEGETRYSLGLSGRLMPLYQKDENRRNVPSLSTVTRLANSAFAPFAPEYFRGPGQGQPVPPSMARPVGPGRRPLVGVMPGELLVPPRMFADNGAIVDENGRVRPLPPAHPESPLMQRLAARAGRTTSSISAPGTSQASQAQPDVERSRGSLMGGGMVGGMFGLSMAGSTMSMFMDSTSEATQALSKFSMGLMAVSSIMMMMPSRMPSNMLGLGTLGGRMSAAGASRAAAGATGLGTSALLRGGAALSMLGGPVGIAAGIGVTAAIAGFVMYKKAAEEARQRAISAFSDPVKTAEYFGKRVEDVTDKIKQNTLELQAGADSVSQIDESLREAIRQDYSSLIERLKYSAAEAGAKQLAIAFNKMVSSGLSADEARSAIQAIAEESGTAGGQAFGIAMRQSMLREMTAPELIASTRSLFDPSQQTALADSMREQQQQMRTEAGALMLAVAKEQATFRAGLEDPVSAIVNMAARTGNVPSWAEWLMSPNQQALNEKSKALEQLASDLEQLSDIDSSQLIGITEQLFSNFEKAPREAIKAFDDLAVAARESGGIAFDPGPVAEYLKELDDISGPMLARFIQNNEERAQAVMRAITAGLTPQEIQKALAEGGLDELNIRIGIQIDIQEAKNRVEEAAQAVKDLASETGTMTEALTKGQQRLSNLIQQRTKAVDRFEADAETMAEAFKIQQQAAEDEINALEKEQNQIEKSADSYIKSIENRRQADKFYADQRRSSLDALSALAEGDVFGFLTARNEMQANAADYAYEQEINRIEERKNLENEVLQARIDKRQEEMDLASRAHDAAMQAMEDEKQAFFTSQDEKIAKQKTANEEMSKMIDGIKNGEIEAIDAVKTYFSKAAAAKYEEAVKYQMKETYALLQQQILKGDITKEQAGIQLQQMMQALFPRVQSGAFNVSQTIEETLKYLQLTPFQSQIQRSIESNIGDAAATAAGDPPVFRAAGGYISGPGTSTSDSIPALLSDGEYVIRASSVEKYGPEFFDQLNAAKFALGGMVEGYGEQQSRRASARFVGRTPLGGRRGGTGRGTGTDQTSMAGQGTDGFLAVEFAKQQLGEAYSLSPNNINSWGCSSLTATSWNQGVPNGSKKYGMVSYSATQIANSRQTAVRSSGEPGDGSAPPIPYSTMRIGDIVYFKNTGLAPSGQHVGLYAGAGRMIHAGNPVGYSDLSSDWNRRYFSSAGTPIAKFAKGGMVGRAFGAGGLAKRNMGYNLGGMVDARTVNSSNPMYNITINADGIKDPAIVAEMVVRKINTENSRRSHGRVI
Physico‐chemical
properties
protein length:2169 AA
molecular weight:232574,3 Da
isoelectric point:6,05
hydropathy:-0,23
Representative Protein Details
Accession
Ghy7
Protein name
Ghy7
Sequence length
1975 AA
Molecular weight
210018,42020 Da
Isoelectric point
9,42085
Sequence
MAEIRSTFIYDGDFSSVQSGIKTLTAQVNLLNRSFNSLDLNARKVQSDLARSFAANVGAIGGFKTAMVDVATATERFGNALQKQKLTLREYARESVNAFRKASNARKLAEEQVKRMQSTLVPVGGGKGMLVTPLTLDTKDFTTKLAVARQQYAIFNKLVSDGTTQLINFGKNTQWAGRQLTVGLTVPITIFGSTLSKTFREVDAELTRFAKVYGSDLVNNNKYATDQMRGQVLQLAQTIAKEYGIAAKETAALAADLAATGLEGQALMDSVAETTRLAVLGEVDKQEAMKATLALQTAFNLNTQELAQSIDFLNAVENQTSTTLTDLVEAIPKAGPVVKGLGGDIKTLSVLLTAMREGGIPAAEAANAIKSGLASLINPTRAANEQLANFGVDLQGIVERNKGQLLPTILEFKTALDTLDEFSKAQAIETVFGKYQFARMGALFDNIGESGSQTVKVMELMGASTADLARIAESEVSAVTNSTSAKFNRAVESIKASLIPVGEALTNAVIPFLTKLSDLIAQITEKFDKLPGPIQNIIKALGVISVVAGPVIMLVGLFANFVGYATKGAMGIVNLTRRMMGLPVDQFQQLTNDQLAANIATDTLTGSFASQRSAVMSLNAALEDYIISLRQVQANQPNLAVAGKGRPPIRKQKGGSIPGYGGGDTVPALLEKGEFVVNKAAARFFRPQLEAMNARKMQDGGGTEGLTAAQKEYMQDLKDNPYSRSHLTPFETEDGFKVFGGTTAASTGPYNSALNNITQILKDPDSENSKTLKQAANIVAGKTGVSAEFLLSGREPGNLQEANAQRAFLSHPLVATMKTKKGEQAIKGRRAAVGFLRSYTGGIKGTPDEKEAIFRQRLAERMALSSASLYRDATPEQIKAIYDAEKRRILPILERTHGDPDERGKQLRDSAKVTYYGNTVGGKLLSTPLLYKGKSRGDSAGSARSMERTMREVGGTSFKFQGVGMGFKSQTGVPVTVAEKRPEVVTNKPKEGTSKTITTQSGRVFIGPIGKRPGLEEEGVVNPDGSVERLTKKERMARVGASAGNYGMTASLLLNTMSAMSGQMSKTTENLIKFTTAIGVAGMAMQAASGVGGMFGKSGKLFGAGQKVGLQGALLKGGAKTAVGRGAGAAMMGAGRALSFLGGPAGIAVGLGITAAIFALNKYKQSLEEARQRSVAAFAQSAKAAEYFGVKLKEATFVRPQESKLFADNAEREAMLKIANEDYKVLVNKLKTQNNADAISELQLTYASLISQGFDQDQAQKIIKGIADAANKENIVIDVIANVKDIKTPKDALNAVGNQAVRTLQAGGVENQGAFAAAATMSIAKNYQAAPLEALGQIKDIVAEIDSTEFSGFRTGLYDSLKESQPELAEFIKGLNNSEQAGNALLLALTGLDYTLASGATSARDLAGALAQAEARTSAASVIDEEIKKEQERFKVIEKGYNDAIDARQYELDHLQENTEKKIKALEKENRALQKKQNNIRKTTDAYLKALEEQTSADKFYQDQAKSGIGAAAALASGDVVGFLQGRADMAQQGAEFSKEQSITAIKDRSDAEIQLLQDRIDANEESIRKAREYAEIREATLKKEITAQRKALGAEREANSARIDALTTLKNTDVTLNNVTSTLEKLKGKAKMGSDEIQRIATSLSSAYGNAYGNAIKQAEVSMNLPAGSLSDLIKNEVSRRMSTPTYPTPSNVSGGNGGKGGDASSGMSGAGLGSVASTGAMTDLPLADILRSVGSTFGGMPSGGDYRNPLSGSQYRISSNPGERINPITGRREVHKGIDLAAAQGTPIGAFKKGTVKFAGLDPAEVMGNKVVLDHGNGMTSVYGHMLPKLGVQNGQQVDAGAILGRVGSTGRSTGPHLHFGVLKDGESLDPRQFVSLRKGGIIAKDGTMAKLHKGEMVMPKPIAENMFRQSSPRFNPAVPTAPRMSPSSTQSRQTNVGPTTAYISFNGNMKDPVAAANAVYGKMMSMNNRRPK
Other Proteins in cluster: phalp2_27950
Total (incl. this protein): 42 Avg length: 2390,1 Avg pI: 7,94

Protein ID Length (AA) pI
Ghy7 1975 9,42085
15fIs 2871 9,11726
1AD9m 2220 9,06401
1esGq 2149 9,21416
24i6H 2991 8,64368
25wp5 1957 9,00960
27QhP 2621 5,93674
283Rj 2132 6,54333
38jcD 2532 8,74586
38kL8 2205 8,88556
38p7f 2056 9,08954
4aSIO 2723 6,39714
4aaop 2608 6,27874
4b8T5 2161 5,91332
4pn2S 2140 6,22742
4qnJ0 2590 6,39538
4zTee 3092 6,74715
50MPt 2160 8,55890
51z3U 2182 8,69912
53Pfr 2242 5,58121
53t1s 1246 9,09348
5HBrG 2739 5,40484
5bNCK 2227 9,04016
5bj2w 2224 8,82619
5cWN4 2465 9,03797
5fa2H 2530 9,03507
5kFsT 2391 9,29101
5n7ga 2037 8,81794
5tBM6 2599 9,11365
5w9ua 2569 8,42229
6Ghn6 1928 6,14682
6LJcR 3384 6,29216
6Lmpx 2501 9,04744
6MJZI 2646 8,46671
6z7VC 2162 6,35786
6ziPC 3017 8,80427
6zqeO 1618 6,24532
83XFU 2434 9,05737
MiPt 2085 9,12165
S02T 2452 8,30831
XaVi 3353 8,93901
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_17593
6xCjI
21 26,2% 1709 3.753E-166

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (Ghy7) rather than this protein.
PDB ID
Ghy7
Method AlphaFoldv2
Resolution 54.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50