Protein

Protein accession
1gAL5 [EnVhog]
Representative
1gBcI
Source
EnVhog (cluster: phalp2_26341)
Protein name
1gAL5
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MALLPISINTYDAFRADALARAAQNLGYDVDGYYGYQCWDLAAELWMNISAFQDGQLWPKTGPNLAAMECWTVSKFQNAGTEFDLIYGIIELKRGDVIVLGPSAISNVGHIAFCDEDYNGATSMNLLGQNQVNPDINYGHIPTVTNLDISQFLGAFRYKAWETTPPIVASYKPQKKRFPWVLYANKLRSKM
Physico‐chemical
properties
protein length:191 AA
molecular weight:21402,1 Da
isoelectric point:5,31
hydropathy:-0,14
Representative Protein Details
Accession
1gBcI
Protein name
1gBcI
Sequence length
186 AA
Molecular weight
21341,32380 Da
Isoelectric point
5,82119
Sequence
MAEFHGHVSVPYSSYTVFKNAVLGNWYDWDGYYGAQCWDGVQLLYGQVGQTLYTGPNSRASECWTVPESRILNGSGHFYIVNGVQNIKRGDVIVFNRNTDWTHSAGHIGYADEDYNGTNYLRILSQNYENPSATYGSAFTLDSVSLTPFLGIFRFDEWQAPQPEPEEKKKRKFPWVIAWNHWKGYT
Other Proteins in cluster: phalp2_26341
Total (incl. this protein): 110 Avg length: 177,9 Avg pI: 7,14

Protein ID Length (AA) pI
1gBcI 186 5,82119
1FXTv 154 7,64765
1gBUE 190 8,73864
1gz8r 204 6,62972
21DkL 185 5,57553
21fAm 154 6,68849
21rX2 192 6,01961
21tw7 185 9,29887
21y5p 192 5,67176
23LH2 154 6,69156
23tVL 186 8,52067
24S91 189 5,74099
24Tyx 185 8,35112
24r7l 186 9,25742
2Vi7D 188 7,77529
2miNE 163 8,31089
2tkb 162 5,50039
3TD9q 190 9,44199
3TJcm 190 8,76868
3TM4M 183 9,09734
3WIMH 186 9,51787
3WM7Q 188 8,40237
3ZZfI 180 7,00162
3bXMG 199 5,29963
3dQQy 192 6,80888
3dQng 198 5,47356
3dRZD 198 5,44236
3dZ3B 191 9,06569
3fPeK 189 8,97395
3fUPP 196 8,78519
3gH5D 186 5,12451
3gOLK 183 5,35039
3gSY7 191 8,33049
3gTRL 198 8,32933
3ikn4 203 5,41502
3ioE3 200 6,03422
3p3MA 162 6,79825
3pHwI 159 6,87265
3qpcp 159 7,64293
3vMFk 159 7,64714
3xWSG 154 7,62242
3yHNZ 162 6,69145
406BI 189 8,82168
408gI 192 8,97395
40gMg 192 8,79125
40idA 208 9,38126
41e5M 191 8,63343
41fdL 186 9,05125
41iF5 183 7,67835
41mm8 204 8,96119
4MdIx 194 5,89110
4MdNe 201 6,16825
4MdWz 176 8,64349
4Mf83 187 9,19553
4Mg71 188 8,88673
4Od6F 198 8,91522
4ka2N 189 8,97395
4ker2 186 5,20369
4khIL 184 8,71801
5EAP 183 5,55768
5K0WN 163 6,23100
5KP2Z 166 4,77598
5M0jZ 166 4,90250
5MBqP 166 4,72141
5P6IA 118 6,21099
5PO7v 159 7,63032
5Wa3p 159 7,63032
5ZAZQ 166 4,72141
616zJ 159 6,79211
63OOT 162 6,02558
64S0h 162 6,02558
6abpu 166 5,38290
6bSPD 166 4,90250
6cEBO 163 5,78538
6cQEY 160 8,31089
6dA3m 163 7,62992
6dKSN 162 6,24032
6eFbs 159 6,68560
6hGnc 159 6,02240
6ixEM 163 6,79109
6kEAh 163 6,68247
6mxZ9 141 4,53140
6o4hO 163 6,68287
6pks8 159 6,68525
6u5Vg 159 6,87265
6u78p 163 7,64674
6uaFq 161 8,62324
6unin 159 7,64293
6vHp3 166 5,38290
7DI3J 193 4,58301
7DV71 186 9,08361
7JGpf 211 9,78683
7UlqQ 159 6,68304
7WWYH 162 5,76048
7WjTn 155 6,79342
7qBCc 167 4,88516
7sfOX 155 5,09484
812M2 184 7,75877
82GvS 186 9,39287
835U4 184 5,26536
84hUh 198 5,66897
85mAF 201 9,16226
878ks 184 8,65715
87Ime 145 6,36980
8bfav 187 5,29759
8bsPT 203 9,28082
8lzOY 201 9,14428
8nazn 194 5,22870
8oPMR 201 9,16213
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8366
ouh8
1 36,2% 182 6.579E-41
2 phalp2_15706
3dX4M
3 29,4% 170 3.536E-38
3 phalp2_10293
1gHkh
4 28,7% 174 1.996E-24
4 phalp2_31615
4y86g
15 34,1% 120 2.158E-22
5 phalp2_24703
6bxM0
24 29,9% 147 3.570E-21
6 phalp2_34078
8lpKm
17 26,1% 176 3.158E-20
7 phalp2_31225
89YgB
257 26,0% 169 5.468E-17
8 phalp2_11577
18WTO
29 27,0% 155 1.016E-16
9 phalp2_21248
1kqFW
1 28,8% 118 3.619E-14
10 phalp2_5181
3w0XN
5 23,4% 149 6.706E-14

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 1gBcI (186 AA)
Member sequence: 1gAL5 (191 AA)
1 186 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (1gBcI) rather than this protein.
PDB ID
1gBcI
Method AlphaFoldv2
Resolution 87.18
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50