Protein

Protein accession
1dYXQ [EnVhog]
Representative
5tggt
Source
EnVhog (cluster: phalp2_39519)
Protein name
1dYXQ
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MTKFTFNPLKTCKLRTAGLYSVKGATFGMVRKNNDGTPRAHQGIDLATDESYRLYAVEDSKVIDIDKGLSGYGWTVTLQLNCPHKKELHNKFAFYAHLDRVDVVEGTIINAGHVVGLSGDTGNAKGMSTVSKGGHLHFELRDKAFCGLGLKNRFDPLPFVTLSE
Physico‐chemical
properties
protein length:164 AA
molecular weight:18018,4 Da
isoelectric point:8,96
hydropathy:-0,29
Representative Protein Details
Accession
5tggt
Protein name
5tggt
Sequence length
151 AA
Molecular weight
16286,24720 Da
Isoelectric point
6,85805
Sequence
VTHPLLGRAVKSERELTPVRSGGFTGKVGSFGPTRVHADGSPKMHKGVDLLCPERWPVFAAHGGTVERAGWEHQVYHTQGYGMRVRLVGKEAVTVYAHLSELFVVQDQVVAEGDVIGRAGRTGNIEPSTPTHLHFELHLAGGPVDPEANLV
Other Proteins in cluster: phalp2_39519
Total (incl. this protein): 99 Avg length: 165,2 Avg pI: 9,08

Protein ID Length (AA) pI
5tggt 151 6,85805
1HWqf 169 9,42239
1IKfk 156 9,37894
1KGtx 153 9,17309
1KYDt 164 8,58347
1KpU3 166 7,81558
1KsKM 164 7,81674
1Kz2f 161 9,14382
1MHDh 158 9,43419
1MLVy 161 6,57857
1ci7q 173 9,21448
1d7TI 194 9,54295
1euYk 162 8,94223
1npWg 161 9,40634
2R5mz 171 9,49737
2cpKg 168 8,79827
2fLG7 165 9,57009
2rP3N 182 7,11661
2seS2 155 9,27470
2sieY 154 9,32840
2skzR 154 9,30035
2slp8 154 9,12068
2smoG 154 9,07684
2smuV 155 9,63926
2snBd 156 9,51465
2snKu 155 9,12068
2sp06 154 9,44206
2tznq 170 9,63617
3QAsw 149 7,26137
3V5Ki 131 6,96593
3XFKv 175 9,47564
3XrDQ 175 9,69671
48yR8 171 8,98246
49ZX6 173 9,34000
4A3QC 176 9,72243
4G8IM 173 9,44708
4KSxi 167 9,54856
4KYXK 157 9,41975
4Krd6 190 9,87180
4Ks3D 175 9,66750
4MUZM 164 7,84131
4RAOp 171 7,84111
4Rvh6 147 6,74482
4RwbW 164 9,29797
4RzjT 170 9,34593
4SzhM 168 9,93666
4eXPh 163 8,94255
4kOni 164 9,94858
4pcr7 176 9,52109
4r3xn 152 6,70242
50h7M 173 9,34020
54nGg 162 8,71949
56xt5 173 9,54230
5EgQo 175 9,49776
5aOP3 177 9,91255
5aqVP 162 9,08071
5btK6 175 9,67027
5e9Qu 173 9,31982
5hq8A 175 9,49776
5ntFG 168 7,00486
5o4UD 175 9,56455
5yWKF 154 8,92554
5ze5s 176 9,61251
6AGaa 150 8,84920
6ASmX 162 9,38004
6BKvy 167 8,84269
6I1Ec 158 8,50765
6xUlh 173 9,44708
7I7bH 187 9,49737
7MBYU 161 9,34290
7MYYZ 161 9,51787
7d8DE 169 7,89443
7dFk 170 10,26957
7kAGc 160 8,93069
7oB1Q 168 9,21094
7w9Cl 174 7,09370
7wnsy 168 10,31457
7yGtk 149 10,01731
7z1KC 169 9,62818
80WZH 169 9,77207
8dJMC 159 8,73445
8luoH 158 9,60445
AVaW 190 8,69371
CoFJ 164 9,51787
ECtK 163 9,85091
HLte 163 9,51787
Iqdv 170 9,72494
J4ZQ 173 9,30061
LsaX 176 9,83383
ND3I 163 9,54443
O5a0 164 9,16587
aB52 173 9,36121
fyCD 160 9,43619
jJL4 173 9,42568
jQSU 132 9,42491
kZwz 163 9,09638
oRWz 164 9,16587
x1lP 161 9,43103
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40658
5cY9d
663 40,0% 115 2.608E-27
2 phalp2_15625
2vc0h
12 39,4% 119 6.706E-27
3 phalp2_18623
3ebV
83 44,3% 97 6.069E-26
4 phalp2_35648
1pm6e
9 37,1% 121 1.410E-24
5 phalp2_19030
17y4D
9 32,5% 132 2.645E-24
6 phalp2_34915
4pbkX
159 36,6% 101 5.536E-22
7 phalp2_35838
4GGll
4 36,5% 126 5.536E-22
8 phalp2_27075
2pChT
5 37,1% 121 1.038E-21
9 phalp2_5594
4aDQQ
125 35,1% 108 3.645E-21
10 phalp2_18230
4Rl2g
1 35,0% 117 1.576E-19

Domains

Domains
Representative sequence (used for alignment): 5tggt (151 AA)
Member sequence: 1dYXQ (164 AA)
1 151 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1dYXQ
Method AlphaFoldv2
Resolution 97.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (5tggt) rather than this protein.
PDB ID
5tggt
Method AlphaFoldv2
Resolution 90.24
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50