Protein

Protein accession
4EXJQ [EnVhog]
Representative
1gvsd
Source
EnVhog (cluster: phalp2_23809)
Protein name
4EXJQ
Lysin probability
80%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MIDAGDESAKDFGEAFKERLKEVLDTIPDFKMKGDASDIDLKVEELRAKLKDLSEQDIIGSKKAIESLTIVETDVKHLADMSKDIKLDFNTKDALAKVAELRLESEKMLGGVGVGGGVAGKKKGITLTGLESLVGTGTGGASAGSGILGNIMNFFGGIKGFFGGGGGGGIIPAAGSAASSGGAGGGGGLSLAGLGGMAAGPAGIGAAIAVASPWIGQLISGAIVGALGSALVGIGIGGAVMTGKLGAAWKSLTSQMSADMKKIGTPFIAPLKEVLRAIGIIAGPLTKVFTTASKILAVPFQKFLDAIVKAFGSPAIAKSIEAVANSFMTILTAITPDIAPDMTLIANSITTLANTVAKNPQAMANVFQYIVDIINFLIASAGDLARIATDVEKWQHGAVGRAIGATTRHTTEAVKGVVDVNNALWDLGTGNIPATGRAAGAAGRALNQATTFRDNTTPGWVQAFSNQSMSHITDNVRIFFVSLAQNIGHWSMVAWEAMLPWIVHTWERGHNLMVAWGHDIAHWFDDVKQWFIQGWNSTYNSTVQWITRTWESGHNLMVGWGHDIAHWFDDVEGWFESGWNWVYEHTVDSITHLINQSEMLFNGWYQNAVNWMHNVENAMSGAWNWIYNNVINTISNLITRAMSLFTGWYHDILNWGKDAKTWLLQTGKDIVAGFQIGIIDAMKNVAKWGYDDIVKPVINFLLSPAGFHIGSPSKKMEPIGKQIITGIIHGMMGEGKNIGTFVGKVFGSWPNAIMSYLSKGLISMGQILKLPAKAISELGGALGFTGVAGKVASGVSSLWHAIVGGGAGGNVSQWAGMVSKALTMLGLPQSLSGQVLYQMQTESGGNVNAINLSDINAKMGDPSRGLMQVIGATFSRYHVPGTSGNIYDPLANIAAAINYAMHTYGRGLMSGGRGMGSGHGYDTGGWLPPGVTLAYNMTGRPERILSPTEMAAAALGGTQYHAHFDGLTLAAIESQVRTAFNMMSMSQGNMYRQGRRS
Physico‐chemical
properties
protein length:997 AA
molecular weight:105382,5 Da
isoelectric point:7,14
hydropathy:0,10
Representative Protein Details
Accession
1gvsd
Protein name
1gvsd
Sequence length
1046 AA
Molecular weight
110815,05130 Da
Isoelectric point
9,69232
Sequence
MAEIFVGSVAVGVVPDARGWNTRLRAQLVPSSEEVGREVGNNVSSGIVKSLDDNKARMARAGEANASAFSTTFRKRIEAAMKALPAAEIKADSTKAERKVAELRAKMIELLSKDIDVNLSSKEAMAKIAEIDAGLKILQEDANIKIRFDARTARAELTKFRAEVSRASGGGGGGILGTLSRLVPGFGPGLGGGGIGAGGAGQAGQAASAAASGGGGLLNPWTIGIGAAGVGAALPFLGQAAGGLLTGGLGTGLAGLGVLGALYGNIGKQVTVTNQQMRASHLQLAAATQRQTAAQDNLNKLEASGKATAGQLAAAHASLDSAMAGVATAQGNLNKLQQQNQEARQTARVKDMQQAWTNLGKDAKKSIAEIGAAFVPVMTNIFKTADKVMKQMTPVFADVEHLIAGPFQTFVDTILKAFAQPAVQQSIRDVANAFVEILKAFTPDIPGIMKSFAEAISRMAQAIALNPKATADFINFLFQVIIAIIDVIAWLTVAANWLESHWPTIWKYAGAVIMGFVDVFKVGFAILNGVFGFFLSLIQGHWSDAWNHLKDAGKRIWNLIKDEAGRVWNAILGVIDSITGAIGHNIANRFDNIRHGIAHVWGDIASSTYRVLADLGHNIAAHFDQIRHNIAGWGNTVLHTIRIIWNDIYGATIGALIRIGHNIEVQFNNMKHWILTFFNDAIHWLPRAGHDIINGLWNGLKAAWNFLWGWFTRIDNAMIGYFSGAVNWLKDAGGKIIHGLLAGIWNAMRNVASWINQMVVQPIINAVKRFFHISSPSQVMMGIGKNLIQGLIHGLLTSGRSLAGLVSHIFKGWPQALASFVSKGLVDIAKLPKAALNALGKVGGFLGGLWKKIAGGGGGGVQQWAGMVMQALAMLHLPGSLLGQVLYQMQTESGGNPNAINLTDINAQMGDPSRGLLQVIGSTFAAYHVPGTSGNIYDPLANIAAAINYALHTYGPSLMRGGMGMGSGHGYDTGGWLPPGVTLAYNMTGRPERVLTWEQAQGGVGGTEYHAHFDGLTGAAIESHVRTAFQAMSITQGNLGRQGRRT
Other Proteins in cluster: phalp2_23809
Total (incl. this protein): 54 Avg length: 1014,2 Avg pI: 9,50

Protein ID Length (AA) pI
1gvsd 1046 9,69232
11FC2 1111 9,33833
15Fsr 947 9,78561
16hqi 1137 9,23434
16izU 874 9,45205
1IcOY 1338 10,09054
1NCPB 901 9,44773
1NHJ0 861 9,43297
1gKhP 1304 10,03691
1pJe1 916 9,49473
23Vas 879 9,51684
23X5W 865 9,19914
23XEA 820 9,17728
2535M 1360 9,73016
254Ek 820 9,06872
25kht 1046 9,65983
2SfXY 897 8,73484
2VHYC 779 9,90694
2pMEs 1336 10,03110
3Xaxx 981 9,78290
3grNs 1080 9,95761
4C3aU 844 9,60780
4E1cA 1028 9,24543
4E6m1 1036 9,45372
4ECLS 951 9,64139
4ECU7 1021 9,53618
4EIht 948 9,57499
4EKeM 914 8,64265
4EKtf 836 10,29768
4EL0g 997 10,56787
4EONe 962 9,62012
4EQrO 873 9,57286
4ESAb 946 9,77594
4ESrf 976 9,71701
4ET51 1014 9,98901
4EYpk 1007 9,20552
4EeRn 957 9,84344
4EmeY 1099 9,33214
4Eo0e 1044 9,75653
4F0fZ 1113 8,73213
4F2Wo 951 9,48622
4FdCW 818 9,14866
4Hzkn 1043 10,27047
4viMo 846 6,76994
5nKk2 975 9,88547
5nM6P 1368 9,99739
6EqEI 1360 9,97863
6FzRU 1316 10,01170
6H9i1 1329 10,00280
6I36F 1024 8,42275
jEcF 953 9,77252
jGvz 1024 9,70283
loTe 901 9,44766
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19683
4E9jP
2 33,9% 1069 8.921E-212
2 phalp2_31893
4IKSQ
4 27,0% 792 2.216E-77
3 phalp2_26986
4JjfT
15 21,3% 829 1.742E-34
4 phalp2_12555
1cxmn
2 22,8% 699 6.383E-24

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gvsd) rather than this protein.
PDB ID
1gvsd
Method AlphaFoldv2
Resolution 55.15
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50