Protein

Protein accession
4EjfM [EnVhog]
Representative
gdKu
Source
EnVhog (cluster: phalp2_2849)
Protein name
4EjfM
Lysin probability
92%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MYGVKASFDQVAQAGLDVASKIAVPLNQIQGGLYDIFSSMDVNLSQAKFLLTNFSKEAVAGQVDLSTAERASIGIMNAYQMKVSDVTQVQDIMFNLVKYGVGTYADFANSIGRVTGPAVRANQTFQQTAALMAFTTRNGLSASNAASAVGRALDAIGKSRDKIQNFGQIVVNTLGPATAAKLGITAQSMIKMTDAGGKLLPINQIMTEMGTALKGLNPTQLNDVLTEMFKGTGGTIQAMRFLDIAVKNYKQLNTITKEMGNSKGALQAAYNVMANTPAAKIQLLKNNFHVLMIEIGNVLIPIVKKGATILAGIFAWIAKLNPAIIKWGVIILAVVSVLAIFVGVLVAVTGAWIVLSTVMAASEIGLAPIIAIILAVIAVIALLALGAFELYKHWNPISTWFHNMWFDMWHWIDHVWQMIAGSISGAWDKIEGVFKRIEDWISSNFDKWWSTHGEAVEAVWRTVWNNVSMVFSTFWDILVSILKTEFDLFTSIFKIYLNTLIMIFRVSWSIIQGIFVFAWDVIQAGWRVFWAILLASAKIFWATIQFMFRQVWDTLVVIFSVFLDILSGHWHQAWVDIRSFGVQTWNNIKALFTVIWHGIETVANAVVGAMENLFFGAWHSIYNTTHGIWGSIKSYLGSIWGDIVGGAKIFVSNLGRVWNTIEGVFKSPVNFLIQYVYDDGIRGLWNTVMNAVGLGKLDLPSVKTLAAGGHLAGFGGGDRIPALLEAGETVVDKHKSRAYAWLFKMMGVPGYQSGGVPGPGIGRGGALGAGNPTGPTFGPLQGLLGLTGAAGKMLLAAATGNSTAFANALIGATGSGNAGGSFASMIAALPVALFKHVIGKVWSMITGSASTATPGVGAGPGGGSVTANMRLAQQLMPAWSSGRQWASWMSLWNQESGWNQFAFNSSSGATGIPQALPYTKMPRAAWLPGQGGSANVRAQETWGIQYIGGRYGNPANAWAHEVGFNWYGNGVNGTFNKPTVIGVGDGGPEDVTVTPHRRGGTHGPVQQFFITTQEINPRMHAAQLGFELSRRSG
Physico‐chemical
properties
protein length:1033 AA
molecular weight:111199,2 Da
isoelectric point:9,49
hydropathy:0,29
Representative Protein Details
Accession
gdKu
Protein name
gdKu
Sequence length
1085 AA
Molecular weight
116014,12810 Da
Isoelectric point
5,44247
Sequence
VALSTREILLVMRAQDEISGVLSKLVGSLGSVDKAAQSAAKSQMATGVAIAGVGLGMASVGAKTLESMKTATDAAKEFEQGVASVATQVTTTKASNQELGDTILNVAKNTAVPIKDLTSGLFDIFSTIDVNVPQSQELLTAFAKEAVAGQLDLQTAGRATMTVMNAYHIPIDQVNKVLDINFQLHRVGVGTYQEFAAVMGQSVPSAIRAGQSYETLAGMMAFLTRNGLSAGAAAAAAGRSLDAFSNPKVVDRLKAMGVAVLDSKGGFNDMSVVMEQLQQKLAGLTAPERSQALHDLFLGAGGTIQARRFFDMVTSGTESAKQFTGFVDDMKNSTGAFTDAYNEMADTTQNQSQQLENEWQVMQIKIGQALIPVMQELIKILTGVLEWWNNLDDGLKQNIVRWVAIGAAILVVLGVLVMIAGAFVTLGGVAALLGISLGALLGIFAAVVVGIAAIVAVVVLVVQHWTEIKNAVIPIWDAVLAKLQQVYDWIKSQIGDKLAALWKDITNTIRAAWSPVADFIGGIWQKIADWANKIWPDVKKIIDPIIQWFKDIWPYVKDIVSANLNAMADALTFLWGIIKAVFTAIWDVIKGVVSGLVSIFQGLIDFIVGVLSGDWGRAWNGIKEIFSGIVDILDGIVQGLWDLIKGVFSAGVDFVVNLAGDFGKLLAGVWNVISTTAVDLWTNFWNWLKKLWNDAVSWIMDVINGFTKAVSDSFSWVIDKITQIWNGLMDIAKKPVQFVIDVVYNNGIVPLWNGIAGLFGLGKLDPLHLADGGHVNGPGGPRDDVIPAWLSNGEYVMPADKTSRYFGALEAMRAGRFADGGLVGDIGSFFSSAGSWLAGIGSSIADFFSDPVGSVKKMFQGPIDQVKQIAGTKWGQALAAVPGKVLDGAVHKAEDFAKSLLNLGGSGGNVDQYSPLVLQVLAMLGQPASLLPNVLRRMNQESGGNVTAINKYDINAQRGDPSQGLMQVIPSTFAAYAGPFASLGIMNPLANIYAGLNYALHTYGSIQAAMDKPGGYKNGGWLKPGQLGYNETSKPEAVFTQEQLAGLLNHKKSSQQVSQNFYITTQEIDPVRHAADLGWELAKRS
Other Proteins in cluster: phalp2_2849
Total (incl. this protein): 25 Avg length: 1150,3 Avg pI: 8,09

Protein ID Length (AA) pI
gdKu 1085 5,44247
11FI4 1483 9,97444
16LyO 1112 5,37000
1fBpD 1196 9,74254
1fvZL 1075 5,64476
1zFMt 1138 6,63802
2SaJF 1099 9,35199
2Sqd6 1085 5,20545
2VHYw 1119 9,74564
2VoxE 1484 9,84705
2mM3P 840 8,69777
2oTkq 1468 9,95142
39lbl 868 9,03010
3fEEx 1170 9,77884
4C2SS 1105 9,55139
4DXnI 1061 5,50420
4EmXV 1121 9,48228
4EzHo 1403 9,69529
6Ep5l 1122 9,52045
6HRIG 960 9,71450
6Tcyk 1481 9,56564
6sGp 1102 4,89119
ghz7 1063 5,26070
hKQZ 1085 5,15629
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25706
4EKKm
16 32,8% 1123 1.319E-297
2 phalp2_21893
4FldJ
1 27,0% 1200 1.565E-169
3 phalp2_34757
3gFYO
3 29,5% 735 9.200E-124
4 phalp2_8106
6DOYh
4 24,1% 924 5.550E-103
5 phalp2_35703
3OIDf
36 24,3% 1140 2.904E-94
6 phalp2_6468
lLS2
4 26,3% 995 1.345E-81
7 phalp2_22110
6cZ09
37 24,1% 935 7.714E-79
8 phalp2_4612
4jjYp
24 24,1% 1400 1.027E-76
9 phalp2_18752
YoZh
2 23,7% 924 9.386E-63
10 phalp2_25988
6Hb5t
7 24,1% 1021 2.057E-56

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (gdKu) rather than this protein.
PDB ID
gdKu
Method AlphaFoldv2
Resolution 61.99
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50