Protein

Protein accession
7YqIx [EnVhog]
Representative
1dg4y
Source
EnVhog (cluster: phalp2_16789)
Protein name
7YqIx
Lysin probability
97%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MANKTAKGLVEFAKSKIGVHYVYGAKGEILTKTKIYSWARQYPNIYTQAYINKALQWVGEEAVDCSGLISWYTGIIRGSGQFEQTGNSKIPPSKLTDDKLGYAVWKQGHIGIVLDRNHVIEAKGINYGVIQSNLNSTPWKKAFKIKDIIYDNTVTEYKNGFFKVNGSWRYYRNGIIVVNSWVNDNNRWYVADGEGKLITNQWFYEDGKWYYLSGDGGMISNQWLEYKNNWYYFDGAGVCLTNTWYRYHDKWYYLDDTGAMRKGLLEDNGSWYYLDENGAMVSDVNIKFSASDDGSLKFSGLTNKED
Physico‐chemical
properties
protein length:306 AA
molecular weight:35187,0 Da
isoelectric point:8,19
hydropathy:-0,56
Representative Protein Details
Accession
1dg4y
Protein name
1dg4y
Sequence length
283 AA
Molecular weight
30692,37710 Da
Isoelectric point
5,46958
Sequence
VKNERGFYMKTKAELVAWCESKLGTPYVYGAKGAVLTQTQINTWAALYPSTFTAAYIAKAKTFIGQACTDCSGLISWLTGTLRGSFNYKDTASQTVVIGKLDESMIGWAVWKSGHIGVYIGNGYCIEAKGINYGTIKSKVSDTAWTHVLKLCDIDYTDSSNVSSTYEIATGAAGLIITASSLYVRDYPKTGDVLDTLMKGTTVYPTGKAFVDGEAWLQIPAGWVSGKYVEGWIQESGGAWWYVMAGYTYPSGTLQEIGGSYYAFDTDGWMLTADRISESGAIV
Other Proteins in cluster: phalp2_16789
Total (incl. this protein): 102 Avg length: 265,7 Avg pI: 6,68

Protein ID Length (AA) pI
1dg4y 283 5,46958
13ypO 217 5,56223
1d7z1 293 5,42326
1dapC 283 5,46953
1k4gs 262 5,84796
1m2jd 262 5,85325
230 257 6,88902
2m0wc 240 6,08884
2mxLi 306 6,13136
2mxUb 257 6,43027
3dnGO 257 6,88197
3kAKK 259 7,48703
3lGfI 257 6,43130
3oQfO 257 6,10436
3pJS2 257 6,42874
3q8hF 257 6,43027
3qAzp 257 6,92853
3qqBv 257 6,43852
3qqqC 257 6,43261
3qrFj 257 5,61020
3qrit 289 8,91309
3rYXK 257 6,42874
3sKmF 302 5,73400
3sMa2 257 5,95385
3sRqT 257 6,87868
3srX9 259 6,52173
3srnV 257 6,21423
3swco 257 6,42897
3uAXh 257 6,10805
3uO5v 257 6,10186
3uOK2 257 5,84966
3waHt 259 6,52639
3xLCg 306 6,13807
3yNWe 257 6,10965
4VBpg 281 8,54633
4VzIX 257 6,10175
5Mfdm 306 6,45057
5OJjl 306 8,65264
5RLOh 257 7,49146
5Ro5i 306 7,61497
5UBZd 306 6,90437
5UmfJ 262 6,13073
5W7Qk 262 5,86234
5Wgrz 262 5,85694
604Gm 269 8,94236
61sOM 257 6,93859
62NBC 257 6,10436
63e4Z 218 6,87925
659H6 262 7,03265
66ru5 262 5,86257
67bZa 262 6,43238
6aLD4 257 6,11090
6aRAd 306 6,95967
6ar5N 262 5,85876
6bJxA 233 6,08185
6bi7Y 262 5,63635
6d24K 257 7,50430
6dr8V 262 6,13727
6hkRa 257 6,43050
6hqBR 262 5,85489
6pg3l 257 6,42988
6puW4 306 8,18279
6qEjM 262 6,22401
6qMVv 262 5,63862
6r3pH 262 6,11419
6rUw6 266 9,17948
6shfG 270 6,89061
6tZ5L 306 7,62901
6vPrb 257 6,42738
6vzhY 257 7,51544
6wHOh 308 6,95598
72zAT 257 6,21645
737Fq 257 6,10680
737K6 257 6,43141
737TH 257 6,87947
73Ong 259 8,26892
75Gd 259 6,52287
7BiwK 274 5,63623
7UZY6 266 9,00058
7UkxC 257 6,87748
7W6Q3 297 6,90522
7Wj2y 257 6,87788
7Xial 306 7,61861
7l0w4 257 6,88197
7mJr1 257 6,52230
7oeZG 259 6,08685
7plPT 255 5,40655
7rBnx 257 6,43209
7s4iD 257 7,02396
7tHRn 257 6,31307
7xy1R 257 7,53534
82A1I 258 8,37562
85kIl 257 7,55944
863Bt 257 6,52042
87K1I 257 5,83437
88VU3 247 9,22783
8fzQS 257 6,87748
8k1mo 306 7,62901
8lk03 256 5,65255
8sdzv 256 5,48510
aNSh 258 8,37562
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11216
6pi59
34 32,2% 257 8.242E-42
2 phalp2_23391
5URxG
99 30,5% 239 5.540E-28
3 phalp2_33418
6WYXp
10 26,1% 245 1.327E-18
4 phalp2_31731
3P98Z
2 28,8% 222 7.174E-16
5 phalp2_17704
71inw
139 28,4% 197 1.045E-14
6 phalp2_16393
5QiP5
40 24,7% 242 3.112E-10
7 phalp2_13292
3o9d4
478 29,7% 282 7.604E-09
8 phalp2_35102
5jO3n
4 23,8% 239 1.812E-08
9 phalp2_40066
41ciE
3 25,1% 207 7.586E-06

Domains

Domains
Unannotated
Unannotated
Representative sequence (used for alignment): 1dg4y (283 AA)
Member sequence: 7YqIx (306 AA)
1 283 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1dg4y) rather than this protein.
PDB ID
1dg4y
Method AlphaFoldv2
Resolution 91.99
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50