Protein

Protein accession
1WaJu [EnVhog]
Representative
4M6gI
Source
EnVhog (cluster: phalp2_19720)
Protein name
1WaJu
Lysin probability
93%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAEVKAKVNKAVVYKMISYKGKTGANNYTPLTAAQRLPKHERSIATGFTTLIAGLNSLGATLNSIAANVQGMLEFWRDAVRTQIKDTKALVKEEKKTDIKEEKREKKKEKQKFDQRKLAEREKDQEALKPKPKKFKSTLAKPAKKKRRPWWDKLLEFVGDMVKIAVFKVIADNPETINKLVKIIIAIGTFAAKAIEFLGGLALDGIIDFLENPISLKGFFGIFKFLIGAVPLFLAFKFLKDPGSLVATVGKVMGGILEGLKSMFGMNSKADKLKNFKLKKIAGQKGNFFTTKAGKIATGLTAGVGTGMAIKAAGGSDSEAVGGGLGATGGQMAGAAIGSAIGGPAGGMIGGVVGTMAGGAVGKAIGGLVEPIVGPIGDFFKQIGEVFNSVVADIKKPLEEFFSTLGAFLNGILDAVEPHLPLIGKIISVGLQVMFAPLFLGIRALTAVMKLFAGDKGGSDEEITPPKKENKEGGRSKRIGEGGGPTEKVTKQVLIAGMPVGKTLTKEQMGVITMSRMMDERNYQNLSPHIRTMYENQSAESSSDTVTPAPEKKAAGGWIKGPQKGYPVSLDGRSTSFIGHGTEWVGRKAGGKAFVVPFDTPATRQNKRLTGRRIGEAKRQGYSLPKAFDSYKSGGLVKPKKGFFGKAKDFLNKTPQVKLAKWLGGKASNLLGGKDEEGKPAGAMRWLAGAADQATGGFFDFDKRGHSLHQPAGVLQKSGEILDNLKQKKNEERIKKMQAALDGAQPTVINSQKPAIQTGGGQSDSFPIIVPSDHDLDQDKYIMPKFGLIAEFKTDPVEFM
Physico‐chemical
properties
protein length:800 AA
molecular weight:85589,6 Da
isoelectric point:9,83
hydropathy:-0,25
Representative Protein Details
Accession
4M6gI
Protein name
4M6gI
Sequence length
820 AA
Molecular weight
87650,94710 Da
Isoelectric point
9,95323
Sequence
MAKVAPKVAKATMYKMISYKGVTGVQSKYTPITAAARLPKVENSVNTGLRSLVSGLNALGATLNTIAANTQANLEGWRDNIRGQVKGANALQKQEEKTDKKDRKRKLFKDKQTEKRRKFQLRSTKEEKSEKKKNKKKMGFAERGIGAAKKVGGGLFGILGNLFGLLMDAIKFKIFEWITQNPKKVLKLGLVLASIGKFVFNVVGFLGGMSLDGLVSFLENPISLKGFFGIFKFLLGAVPIFAGMVLLKNPKLLLDGAQKVIGGIVGGLKKLFGFQSKDQKFKEYKLKKLGGKKGNFFSSKAGKIATGLGAGFAGFTAAKASGASNTEAVGAGAGAAGGQAVGAKLGEMTGIPGAGAIGGMVGGMAGGKIGQAVGGLIEPIVQPISKFFKMIGDTFGGIVAEIKAPLEEFFKTLGGFLSGILEVVEPHIPLISKIIGIGFKTLFLPLFLGMKALTAVLKLFTGGKDGKKDVKPGGGGDDPLKGDKKGNKTVVKQTATVEKGKVTSGNMSNDDIVDYKIRKLERRKHPGMEKWEVENIDQQIANLEKNRKTGGVTVSDGSEPDPFGFAKGGWIKGPMSGYPVSLDGRSTSFIGHGTEYVGRKSGGRAFVIPFNTPATQKDSSLTGRRYGEAKRGGYSLPGYAKGGTALVKPKKNKAFGWLKKKFDKLPQVRAAKWLGNKVGTKIEQAHKMLGAKDDQGRPQGIARWLAGAADTATGGVFDFDKRGSMLEGASRLKDNVGQKLEDAKQKMQQQRYEKLKNSLQDSASTIMIDQGESPMSMPGGDMTTDNPIVIPGEDHHDADKYIQPKYGIIAEFMTDPVEFM
Other Proteins in cluster: phalp2_19720
Total (incl. this protein): 67 Avg length: 781,5 Avg pI: 9,80

Protein ID Length (AA) pI
4M6gI 820 9,95323
11273 827 9,98198
112Th 727 9,70341
113T2 794 10,01073
114fz 721 9,90255
1STRk 959 9,83454
1TW78 697 9,79334
1TYot 822 9,94098
1TasO 584 9,97057
1TtLp 815 9,74718
1U02n 793 10,07346
1U0uY 727 9,71734
1UD6h 582 9,96367
1UfHh 451 10,02711
1Uvyc 791 9,40737
1VHso 770 9,82126
1VP4X 697 9,77349
1VZ4L 794 9,91422
1VjZQ 721 9,95316
1Vn2p 799 9,45772
1W2lw 930 9,69613
1W3vX 730 9,77072
1W9IE 959 9,79502
1We8h 789 9,39686
1WeMa 712 9,78812
1Wg0Z 813 9,76736
1WiSN 730 9,77072
1WmuY 882 9,96638
1WpAL 712 9,79541
1g3VN 764 9,87909
1g4Ar 831 9,63424
1g5B3 710 9,70380
1gdDF 790 9,96651
1whNB 562 10,16249
2Qcdh 723 9,85343
2SX5k 964 9,78541
2UijE 794 10,01834
2Uoat 786 9,91280
3EKzc 821 9,72572
3FTr6 959 9,84008
3HWUn 727 9,70328
3Uo1B 815 9,72991
4M1Co 939 9,77020
4M6hl 740 9,94336
4unWP 464 9,84782
4vG0g 810 9,39706
4vHbO 816 9,75176
83ycG 832 9,82049
847zh 959 9,84008
8B4BK 553 9,42078
8CQrH 836 9,77168
8Cb0Y 890 9,92518
8ENft 959 9,84008
8GSnZ 810 9,35683
8u2zc 764 9,84189
8uvKQ 810 9,36231
8uvXf 836 9,77168
8y52d 832 9,82049
8yYKX 730 9,77072
8zyD0 821 9,72572
YDaE 786 9,84137
g237 964 9,83441
id6L 891 9,94053
paWb 626 9,80624
pbQA 730 9,79489
pbcn 770 9,83757
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5745
4SbyE
114 33,7% 989 4.992E-220
2 phalp2_17849
icYV
5 31,9% 520 3.707E-112
3 phalp2_8494
1gcVP
15 27,6% 692 2.941E-74
4 phalp2_1947
4upJo
19 23,9% 648 2.056E-33

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4M6gI) rather than this protein.
PDB ID
4M6gI
Method AlphaFoldv2
Resolution 49.00
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50