Protein

Protein accession
5wg3n [EnVhog]
Representative
5wg3n (this protein)
Source
EnVhog (cluster: phalp2_20765)
Protein name
5wg3n
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAKSLNGWTAIEKSSDKDLRVIAIPGTKRTIRMQKDAAPLFAAFYADWQREMPARMNLDPGPTDGWNYRKSRATTGLSNHSSGTAVDVLYTSVLPADGKPHMTKQEKEILDRILGRYVTGDGHRVLANGEWWNPPHCDGMHTELSQSWDRGALRNTNIEDVREVIKRLHIDNDGNRPLGMWDSVVPLYQNVITAEANLTANDAVWRLASRLADLGFFKGTPVRGIQKYPTKAVAAWQESIGAQGTGKYGPIAHEKIFA
Physico‐chemical
properties
protein length:258 AA
molecular weight:28723,2 Da
isoelectric point:9,10
hydropathy:-0,53
Other Proteins in cluster: phalp2_20765
Total (incl. this protein): 29 Avg length: 274,3 Avg pI: 9,62

Protein ID Length (AA) pI
2CZDQ 273 9,83164
2TbxL 255 9,09715
2ZrpR 282 9,23440
2iE7T 300 9,73004
30Voz 242 9,93163
461gR 291 9,87735
463iC 291 9,90513
466R9 227 8,95899
4683c 291 9,97669
46YY0 260 6,79296
46ZX6 310 9,73874
46diJ 278 9,80521
4718f 261 9,77323
473Fn 275 9,59014
4aho6 302 10,09460
4bCXL 291 9,97540
4fLFh 263 9,84331
4rfKt 272 9,84659
4rny3 307 9,93569
4vYLF 266 10,04438
5A83R 260 9,96754
6ETe2 262 9,14170
6Mq8l 258 9,23157
6U5NA 271 10,20033
7FvNl 258 10,40831
RUwL 282 9,61283
U2nH 291 9,95323
U32Q 277 9,39332
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_15931
4J3RB
2491 32,0% 209 4.168E-31
2 phalp2_5098
1IyeS
221 31,6% 177 2.812E-22
3 phalp2_34934
4zLgX
5 31,4% 178 4.403E-21
4 phalp2_14627
5zyHC
3 26,1% 176 3.723E-20
5 phalp2_12264
727mo
153 25,8% 263 4.249E-19
6 phalp2_19831
5lDZw
29 24,7% 226 1.348E-13
7 phalp2_36143
6QfrR
13 25,9% 185 3.302E-13
8 phalp2_34485
4IjU6
4 23,2% 250 3.835E-11
9 phalp2_38848
1Icxu
17 24,3% 181 9.311E-11
10 phalp2_11272
6Q92e
9 21,5% 260 5.919E-08

Domains

Domains
Unannotated
Unannotated
Protein sequence: 5wg3n
1 258
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
5wg3n
Method AlphaFoldv2
Resolution 95.71
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50