Protein

Protein accession
1Ozea [EnVhog]
Representative
467FX
Source
EnVhog (cluster: phalp2_1882)
Protein name
1Ozea
Lysin probability
71%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
LADVNSNININFNTAAALAQLRQLQAGLSKFHQTLAEGNLAAANAQKGLNAQLLQSVGATGKFSASQVKVAGSTLAFTSALEKNKLSLREYYRYTMAAATANTRVMGKAFAQEREIINRARRDRVKALQSQYIQMNKANAGFMDAIRIMPKSLQMASGKFTELGTRIQYAAQRQQFLNQLLKQGSTQLLNFGKNTQWAGRQLMVGLTMPLALFGAAAAKAFKDLDAEIVKFRRVYGDAFTNDAEVNEAVENIRRLGNEYTKYGVSVTKTMEMAATAAAAGFQGDALTAQVETATKLSVLGQIEQQQALETTISLQSAFGISSEDLAKKIDFLNAVENQTLLSIEDLTIAIPKAAPVVKQLGGTVEDLAFFLTAMKEGGINASEGANALKSGLASLINPSKKSAAFLADLGINIRGIVEGNSGDIKATVVGFAQALDTLDPLNRARAIEQLFGKFQFSRLSTLFQNVTKEGSQASRAFDLAGSSIEELAVLSEREMKKIEDSTGVKFQAAIEKFKQEIMPLGKAFLEALTPIVKFFGGLFEKFNGLSDQTKKVVATIIGVVAGLGPIVLMTFGLLANGVANVIKFFAMLRGGIAKLNGQTSVMGAGFNYMTQEQIESATASQQLHQTHTRLIEVFNVEKASVNNLAASYHALSTQMRSMASQNPALFAGGLGGAKVAASKLPPVRKYIDGVISVPGPKGAGDVVPAMLSPGEAVIPTETTDKYRGLITAMFQDKVPGFMAGRLPGGPGRGIPLSSGPEAVRKAQQAKYTRQSDARQGYDGPHPERKTGPVFVGMPKTFKEASQSRQMLDKISERTSLGIFGTMPPSNFGRKLQGFKGYSFPDTSIGGVYRKPNGKVVVIKPTVSADTALAEVRMAEIEAARGMIVPKQQIRTMIDPTDPTGQRKFIVLESPYDRRFVDMDGKFSKGDMVKQLVGSTLRGDKDLQKSNVSGNRVPDLSNAGVFDRASGLREYADVMPSMKEQAMINLLAVKGRARKDFALSTAPIAAKMTPKEYDDAIKGEINRTIPIVERIIKSWNLTPDEQIVYGKMLDRLKAGAKVEWDQFQPIHARAGQGIVKADDGIGSGAKGSIVQKQIEKYLIGSDDIKTKQISDLSESVEKEFASKVKELKARGEAQVGGTSAENRARIVQAAWNGDVVPLPPSGKDKNWSARQTMFQNEIRSMIPINVDGVTKYLHPDDMDSFKSDPDHRAKYARTRQEVIDNVLYRMGVIPNKDGVFVDKGGFKGRFANKTEFAKDLRSSNKEAGGGINMTSKVVKPFAKEEIKEYNARVGNPLKTQTGQWMLANGWSPDEVRDRLKENLSHIEKEVTGSTAGAEKMKTGKALYDARLLNNYLNADRRSSNILAWNGTGPNNILGLSDTEIEEYKKAAAFMERERHPQNPQERALLSKAVELDQKVIEHKSKGNYVPSNLLIQDQRVPKALQMLLKDPKIQPGRLIDLAATTSDKVFVNPGNEFKVNTKSGKESKLTQANTNKPPANAKPSSGSIRDTRTTSLGAHETVATRRQLMNFRRGFTPPPGFSMPGMVYGDPRLDQQSTRGGSLSKAAQGRVTNAMQKQAQLLKQRNNLTQKEINQALTQYRTRLIAAEVEKRNQHEKQKAMADEKRQRAQSSQTAKEIARQEKADAKRDIKEKRMARQQKIGGMSGGLSMGLGTAGMGLMMAGQQTAGMAVMGASAVAGLAPMFAGMGPGGLIATGVVAVGASLWALDKHFENAAKKQAEFISSVSATTKKMDQIGEVTGKVGASRAMEQFRSKGSFGDYNDVSRAGTQFGDTFLESEVGKEMAKGFVDNMARFGSKQASQDFGLQLATYISDGVLTSDQAASIAEQIGVELGSRSYTTNIQAEVRSLVGPNGEDLAKNALTVRTNLVQEANKRSVAELNALQQGGGRDNAARVAGLSLNNIEIARAQNDATEWQYKKQIQIFEKQLLQTTVLEKQLELKTKISELESKMEADTKVTTDLLNKQIKAEMDVFANNIQKNRSALTAWFLPGDDEEDAYFTGLKARVQERFKDTFPGLTAQLQKDLAGLSNRTDLGIKGFTGGQKEAQQFEVFANLMMGQGFTNPQQFKEFMTLYAGNEGKLSKDLQIAVDAQGPEKAFEMLQFFTEYADKPFAQKIYADIITKDPKQFDKIGKALAQLSVLDSKEVNVEAYLKLVGADGLEKLANDLAIIDNIPDVTTKEAILKYFEQGSGKGMAGMDKAGLETLMAKWKNWDKLPDVAKKEAISKYKTIYETVFADEAARIDYAQKIAREKANAGSDGGRQTYLYEFIYRSTYETLMKGDKNQQASAIASAGAYEYVGEKGVVDPKVPGVVPDSSGTGTKVDPYANLLKRLKEVRNAAIDASGGIKALNKALAEGNVTSVKNKYQGIEQQLQGKGYNQGFIDFIQGMDPKEQAKFMRTASKATKGKYKGQVVNPFSKKPAAKGKTPFKAGQVFLSEDAKSMGSGFDALVSGEFNTQADRQIKSLKYQEDAYKKLRAAGYDHLTSQKIVSDEYLAQSISLGKITTEEIAKNGVLAKEIVTRQKINGLIQSGRDKINEQNMYLAKSKTEVSKVQELLMFMGSKNIKFGEGAIRDILGDPETLSALIMAMDDVKNNTEGAAKAMEDLVAGVKALKDNSDIELAIKIVTQTATETVSMGAQAAQRINDALKGVYKNLTASELGKVTSFNPKTKVTKNTGAQAIENVGKRYAAAGATVPKVDPNAKPGSKGFDSADTVRSVQKERAELGKTISIAQSNANAAQQAYSNIMDSIRNADENLRKNIDGITERYKTQVGGLDDQNKKIKDAEKILDGFTEKEDELNKNNAMYSNDLALIDNLSDDINKKYDKQVEALETVNKLNEQIAQSQQQQLDLADALSQGDIAAAARAAQEMRATSAANQGDAMMQGLEKSRQGELGALVGPESGMTRLQIEKEQFRISQELYKLETHPDRKAANEAIKEAKKAIAKLEEDMAAEIAKAEEDHKAIVESLKAAAENAKKIADDAAATVKSLEAEDTALASIEDYLTAIAEEAAALDESTGYTLEDYLEMLEKMPDLVKLAQDYKEAMVAAELASGNMSKSWTEILASINALPETINIKSVLDIVENITRNITEYITTVNLKSGASSSSSSSSSSSSSTGGAPGKAWLKDGNGVWIKPTKPFGDYEWNDNEGWTKETKIDPNDGDTILKNAAKAAAEKAAAEAKAKAEADAKAAADAATNNLLNGGSFFDFSDGYRAKGGLINPMKFAMGGFARGTDTVPAMLTPGEFIMSKYAVQAHGIDTMKAINSGKTTGGAVYNNTYALTVNAKTDANPNEIAQAVMSTIKQVDDRRIRGVYLNAR
Physico‐chemical
properties
protein length:3330 AA
molecular weight:361251,2 Da
isoelectric point:9,12
hydropathy:-0,43
Representative Protein Details
Accession
467FX
Protein name
467FX
Sequence length
3224 AA
Molecular weight
347019,08270 Da
Isoelectric point
9,39242
Sequence
LSDVNANIGINFNTADALAQLRTLQSGLSRFHQSLAEGNLAASNAQKGLNAQLIQSINATGKFSASQTKVSTSTQAFTTALEKNQLSLSQYFKYTAAAATANSKTLTGMFAQEREILNRARRDRVKALQSQYIQLQKANGGFVDAIKIMPKSLTMANGQFTELGTRIQYAAQRQQFLNQLLKQGSTQLLNFGKNTQWAGRQLMVGLTIPLGMLGSYAARTFKEMEAATVKFQRVYGDAFTDSRTTDIAVENIKRIGMEYTKFGIAVKDTMDMAATAAAAGFSGTALDAQVKQANKLAVLGQVEQQQALETTISLQNAFGISSEDLGKKIDFLNAVENQTLLSIEDLTIAIPKAAPVVKQLGGNVEDLAFFMTAMKEGGINASEGANALKSGLAALINPTKKSSEMLAGMGINIKGIVQGNAGDLKGTVVGFARALDTLDPLNRARAIEQMFGKFQFSRLSTLFQNVTKDGSQASRALQLSGASMEELAIISEREMGKIEGAVGVKFQAAVEQFKQTIMPIGKQFLEALTPVVKFIGNLFEKFNNLSDGTKKFVTILTAVVAGIGPIFLMTFGLLANGLANLIKLFAIIRGGIAKLNGQNKMLGGGFDYLTQQELENVASSNALHGSHEKLIQVFNVENVALQKLANSYANAASQARALATSSPGLFASPGAGAAVAKLPGGQGRPVRRYAEGVLQVPGPKGAGDIQPAFLAPGEAVIPADVTAKNIGFLHAMMAGKTPGYMAGKIPARPAFHAKPDELQSGAKFVGMPKSIGQVTQSRQIADRIAESVTKSQFGKVPPTDFGTLLQGTSGRSFPIPNVGGIYRKPNGEVVFVKPAVDATSALAEQRATIIARDVHGLKAPNQTIKTMLDPTDATGKRKLIVLESPYDPKLAEASGKFTKKQMITQLVASLLRGDKDLSKSNVFGNTLADVGPAGVFSKASGFRDIQSSMPSMKDQAMINLLGVKGGARKDFALSTADIAKKMTPQEYNSAISGEIAKVLPKLKKTVAGMNLSPADASPYNAMIARLEAGQKTDWSSFQKIHAAAGSPVKKLMAGFIPELSVSQRSQSQVEMQKFFDWADKQVESSKLDGTFKDRWKKSTGPAFRQSLMEKMIYDSNSKTFWTNQGGTKGVNLERMQERFNYRFGIAPDAQTGKYVKSNLFNINKFLSNVTSGGQSKRGGDIGANNPKAKKVWDEIKVAAANPGGIKSDKTIRKYTDFLANMKDGAGNETQLAKDLKSDSPLVRKSAMESVDKMFRLDASHAQAVRVYDSLENAQAGKSRPASKSEKYSLGQMGPDYRVINEFVKTNDPGNKSRFEKILEWNNKNGNPLSIDGTQNRNLQSAIKSIIKEENHPFDPQNQKHVKALAELEIKARDLMTVDPSKAKGLSFLSKSSSAYINARLTDNIMSERISDSNWFKRMNENKMFMNTSVLDSQGRPVRGALETVRKSEYYFDSKTGTFIPYTGQNTSAVATGKVTGKGTETRTPNASKTGGQPTATRSQARAFAVRRDAGDPRFQGLGTESTLSKNAQGRITNAIQEQARLLKLRNNLSAKEIDQALSQYRRRLILAEKEKAYANAQTLRMREQAKIDSERVVNSRAIAKQERAQRQMARQEKVGRYSGGASAALGGVAMGAMMTGADSKVTGGLFAASAVAGMAPMLTNPYVAAGAAVLALAGSFFIADKASKAAAEKISKLTDATSATTEKMKSIGELTGKVGASELYSRKRSTSSSDRYTTGFERGKQQFGSTFLDSSAGKDVMTGFTESLKSGTDTAAKQMAVQLSGYISDGVMTAEQAHSVASQIGINLNNQTLTSQISGQLLSLVGPNGEDLLKNPLEVRVKLVQEQRNVTSGLKDQLQGSITDTRNSLDSRLSQASKVVSPFFPSAFIGQKAPGFTAYSAKEMFGTTKAESQAAAGAASGAQNLEFNQAQIDSLNVQYDKELKILETQKAATVDAAKRKQIDDQIAALETKRVSGIQTLRKSNSDILKDQIDLFKVAQQRSAVENAFFDSLKNQVRTKYKGTPQEAFVDPLLKSTADLKSKNLEVKINTIVASGQLPPATATTLLEMFSGDEKGLNTFINTSLKIHDPGKLSELINSLGGIKDKKVVKTIITNIANKKPKEAERLMSTIALMQKMAGKEINIQAFFEQKDAMAKLEKLQGKLEEVEGMPTPITKEAIALINTDGNSVTQDMGALLAVWDQWGNLPDETKKTVIQEYITLLKTITEGDVDAEIKKRIAAAGGASTVADYYSTSAGRDAARSGLAAARTMQQTKQDIASAKANASNADSTSGSKKADPINDILARLKQVRLASLNATGGIKELFKAVGDGKKVTGVIGDLFNGMQQQFVKKGANQQFIDFLTSMVGDPAELAKYMKTAIKASSGSNKGKVVDPYTGKVIKGGKVGDVVLSDKGKAAQAGLTKAIGGDYNLTQLKSIKNDQDRAKVMERISTLAKTNNKFVIDNQTLQNILNDEYYVTELAAGRITDAEFETNTLLAKQAELRQRINGIVSDGLSAKQELSDKGRIGELLKFMTQTTDLPKLSSNALLDMIKDPNQLSAAIAAMDMYKSGIEKVPSSLQKVVDGLNAVQKNAKIQGYINFASQTVPEKISQGAAAAQNILGVKARLREKMTVSELRKYGTKANPTMGEKAYQGAIGSTGGAEITGGGKSLSQIQIARQGLGSQMNLVQARSNQIQNDISAKEDELNRAIELKNKYYDDLINTEKDSIDTNESLLKKQFTDLIDTKQTESGKLSNDLAIINHQEEEINKVYDERIKALTETQQINQRLIEQQKAQLGLADALTQGDIAAAAQAAQEMRSQNASAYAQSTSDALTQARDNQVKGIKGAESGMTKEQISERQYKISQEIYKLETSPERLALTTAIEASQSKINGYEKDRSSAIKTINADYSLQISTLETSLKSQKDILDKLEKQDLELAAQEAEMLLILDSLINMDDFAGKTLQDFEDMVLKAESMATSLEEDIVKAMMAIEEDSKSTSGSWTNIVDKINALPDSITIKSIIDEVRNIVENITRYITTIENGSTSSSSSSSSSSSSSSDAVAALDKALQGKDGRDAAGGSSLGFYNSLGSDGEKGSTSVANSQADAATLGNQTAQGYANKIAAMDRAMRGKDEYMASGGIVPKYFAVGGLSRGTDIIPAMLTPGEFVMSKYAVDTHGIDKMKAINSGSDVSSSVYNYELTVNVRSDASPSDIANTVMSKIQQIDSMRLRGNKL
Other Proteins in cluster: phalp2_1882
Total (incl. this protein): 127 Avg length: 3165,7 Avg pI: 7,81

Protein ID Length (AA) pI
467FX 3224 9,39242
15s96 3228 9,21874
163ri 3036 6,12050
18P1Q 3332 9,24465
192Ad 2860 8,63149
1DIba 3207 9,14589
1NL6D 3332 9,38771
1O3kS 2865 5,39393
1Q8W5 3241 9,35302
1QGwq 3623 9,15562
1QkIc 3241 9,33491
1WGYg 2786 9,28804
1XQG1 3255 9,40408
1Yjpq 3241 9,34400
1ixwj 3277 9,38068
1zAhi 3292 9,35605
20yr1 2885 6,55276
24vz9 2934 8,92463
25utO 3940 9,34149
27BQk 2926 6,26550
2WFjp 3408 7,30997
2ZwLF 3183 7,30719
2hDky 3166 7,99074
2hO9D 3023 5,67920
2hSiw 2923 8,83812
2qoKI 2883 6,52929
2qq2O 2951 5,38648
31C49 3266 9,38629
35Sxb 3508 5,49698
37qmD 2783 9,35818
38mZ8 2791 6,56430
38nGe 3214 6,57396
3UrIZ 3131 9,04841
3Xqtq 3232 5,48078
3Y2Zk 3127 9,03404
41FzH 3127 9,04841
41XC9 2990 9,08754
46LxM 2855 6,05451
46U3B 2908 6,36633
46UDw 2990 5,68233
46wiE 2862 8,95796
47dE1 2870 8,84095
486Fa 3022 5,62919
486cx 3063 8,94146
48Cjj 3138 8,55910
48Dn8 3000 5,85722
48NVe 3250 9,21319
49011 2782 9,00257
49I8H 2548 5,92441
49MgZ 3917 8,91296
49v8A 2894 8,71311
4CRPc 3922 9,31499
4Rr9k 2834 9,10411
4aEFB 3948 9,28185
4aET9 3120 6,07253
4aLid 3928 9,28275
4aOB9 3435 5,69398
4ad2F 2795 9,25961
4afzb 3183 9,30351
4agGO 2786 9,28811
4al5L 3228 9,30951
4at5h 2970 5,73093
4atsF 3540 5,79169
4bUv0 2780 6,74704
4boAH 3133 5,41149
4c0Cs 3978 8,84888
4lb11 3305 5,74315
4lh53 3016 6,26175
4mm4y 3209 9,14866
4nHZ9 3145 5,50249
4o6F2 3240 8,92650
4oXtL 3337 9,17941
52H0Y 3480 6,04059
52Ubm 2988 9,07349
53qCL 3127 5,64556
56fOC 3126 8,55387
56p6S 3341 9,40408
57aXQ 2859 8,56774
57rwV 3852 9,25078
59apB 3052 5,65022
5CsQV 3401 5,95891
5DHbn 3118 5,34675
5HBdc 2999 5,65061
5HzlE 2861 6,19343
5cEAa 3227 9,16491
5dXkL 2786 9,28449
5dgiO 2950 5,88013
5fqgN 3114 8,88743
5iTXd 3420 5,83977
5iUjt 3490 6,08060
5kTRB 3081 6,39356
5kvZJ 3030 5,45480
5kwTm 3195 9,27147
5lbK9 3957 9,21616
5lcv1 3211 9,16884
5mpiH 3468 5,40876
5un3d 3031 6,35906
5w6jC 3519 9,14492
5xgyq 3138 5,46111
5znr5 3071 6,43238
6GUb8 3228 9,24794
6Gd1k 3318 9,03997
6GlSx 3229 9,32859
6Gzzg 2928 7,99796
6Iuyy 3020 5,84790
6Mks7 2952 8,13702
7JTJI 3121 5,67272
7JVHS 2786 9,28791
7LgY0 3029 5,93583
FFh7 3139 7,09313
Fflz 3180 7,30719
G4kE 3681 9,35541
GEMX 2973 6,30142
IMuj 3145 7,71876
SJRF 3327 9,42162
UtUs 3182 9,25903
X5YR 2975 9,32614
XaVr 2987 9,19914
avxS 3154 5,48067
azrq 3116 6,01967
h1IW 3127 9,04841
hZ8A 3350 6,44380
iRJ2 3181 9,09438
qCMY 3291 9,06769
x50Q 2786 9,28449
xopG 3254 9,30171
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_37182
1NZLQ
22 42,0% 3384 0.000E+00
2 phalp2_23947
22KDh
54 26,3% 3803 0.000E+00
3 phalp2_35064
51kDg
15 32,3% 2282 0.000E+00
4 phalp2_17496
5lNm7
2 20,9% 2315 2.936E-237
5 phalp2_32961
4623L
1 22,3% 3552 3.363E-199
6 phalp2_8136
6Lxlw
6 21,1% 3815 4.933E-185
7 phalp2_23044
49sYk
5 21,7% 3482 1.340E-155

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (467FX) rather than this protein.
PDB ID
467FX
Method AlphaFoldv2
Resolution 51.40
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50