Protein

Protein accession
9RE2 [EnVhog]
Representative
4W6lO
Source
EnVhog (cluster: phalp2_16283)
Protein name
9RE2
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKGGIKVAATKGIDCAVPLTAEKAKEMAAAGMRFVCRYLVPVSMAWKRLTRAEAEAITAAGMKIVSVFQRGANDAAGGAPNGTRDGKAAYQEAKAIGQPAGTAIYFAVDFDAQPKDYDAIEAYLRAAAKELPGYNVGVYGSYAVVEEMARRGACAHFWQTYAWSKGRLSAATNIYQYKNGQTLAGHTVDFNESFGDEGWWDTSMKNVDKPVNNSVDKEAAEKVISVLGSLWMASADKKVQDAAHYAANALRDAAGIPRP
Physico‐chemical
properties
protein length:259 AA
molecular weight:27731,1 Da
isoelectric point:8,60
hydropathy:-0,25
Representative Protein Details
Accession
4W6lO
Protein name
4W6lO
Sequence length
244 AA
Molecular weight
27005,50950 Da
Isoelectric point
5,25433
Sequence
VAAQIAAAGFKFVGRYLVPKRYAKRITAQEAQILTDAGLLILSIFEATSNRAAGGEPYGIEDGETAFACAVELKMPKSAAICFAVDFDMKSYDVLEAYLRAAKKKIGAHPVGVYGSYYVVEEMARRGVCDFYMQCIAWSSGNVSSRANVYQYAWDETLAGIGVDLNYLYNGTGLWNYKEDNMTGEDIYNALTDYLREQPCPPWAQAELQEAIKMGITDGTRPCELMPRYQAAIMAKRAAEEARK
Other Proteins in cluster: phalp2_16283
Total (incl. this protein): 111 Avg length: 266,4 Avg pI: 6,34

Protein ID Length (AA) pI
4W6lO 244 5,25433
15GfP 266 4,73244
19jC6 279 4,78462
1GxXm 291 6,03598
1IaBq 326 9,36444
1NNPD 261 5,20568
1fPga 258 8,52254
1zT9v 284 8,62273
23iO 281 6,13437
2AXRd 273 7,67522
2AYMM 291 6,80109
2G2bp 248 7,09654
2G5B7 252 8,38181
2G69O 330 6,90408
2G6lI 301 7,74615
2mrkz 323 6,21088
3N3RW 252 8,77771
3N6DJ 298 7,74030
3Ncon 249 8,65644
3V45V 252 6,32518
3WCKi 263 5,36659
3ZIWy 268 4,87601
3ZMCw 279 5,01157
3bTJB 281 4,79740
3fLdK 249 5,81511
3gHg6 269 4,94280
3gJDg 291 4,87039
3gN7c 269 4,95252
3iqSy 257 5,02919
406cd 269 4,95246
407Mn 277 4,35253
407O8 263 4,59665
40eAj 234 6,91409
40ew1 264 4,79007
40g8R 297 5,19545
40mqn 275 5,15180
41mWy 262 5,35590
4Hy1G 234 5,58872
4KwtV 253 4,65059
4Kyqj 249 4,41403
4LNAR 251 4,73568
4MKpK 246 4,44455
4MPyO 279 4,65491
4Ohy0 244 5,52114
4T0Jg 245 4,73170
4k9Jy 281 4,72971
4kdWs 263 4,77097
4klEC 254 5,72439
4kpbN 264 5,20761
4lCGi 278 9,03713
4lFW7 287 8,21077
4lG7N 249 4,77859
5EEed 267 5,07938
5ip4M 266 5,12457
5nMG5 204 5,00998
5uc15 269 5,07171
6FrYx 269 8,98311
6PuLJ 235 8,44402
6xYd 255 5,10615
71QQY 238 9,35515
71XrZ 251 6,84156
72956 252 9,37069
72bBK 295 9,26167
72bDA 296 8,84160
74iNO 248 8,40901
752yq 250 8,87132
76rsi 266 4,70220
7BOO1 283 9,21016
7DG1z 276 5,20284
7DIQR 272 5,45929
7F9Nn 246 8,62846
7JnGJ 298 6,12903
7Ut9F 299 9,67930
7bgwr 266 6,63285
7cUVM 229 5,97431
7cb0D 257 8,70318
7ckCU 251 6,91761
7dKkz 225 7,73501
7epJO 251 7,62202
7nTHm 268 8,74618
7ofze 249 5,12963
7s6DO 254 5,64794
7sBMr 251 6,83986
7sBPt 252 8,27486
7skvM 250 5,74014
7teUz 273 5,37216
7vEgK 256 5,34709
7wEFz 252 6,96524
7x2Db 265 4,61791
7x2EG 252 5,24223
7xRpx 248 5,63493
7y5li 266 4,52941
7yjO2 246 5,60543
7yjPw 250 4,98537
8J6yE 247 8,57940
8fAkW 265 4,75341
8m9JQ 330 6,94438
8m9vL 262 4,69936
8ma3P 328 6,07640
8ma9Y 320 5,12025
8maIj 261 4,40800
9S7Z 287 9,10837
DodU 293 5,65851
aJT2 246 5,25280
fnzD 247 8,46304
foRK 298 6,92057
gY4N 273 8,52151
giRd 289 6,15904
o5b3 263 5,02215
okwP 258 5,22108
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_6406
boMu
55 39,8% 178 4.262E-51
2 phalp2_8820
2pMjJ
148 39,2% 176 1.339E-49
3 phalp2_14477
4KyKr
1 32,8% 195 5.741E-48
4 phalp2_4951
6F0j3
21 34,8% 201 3.362E-46
5 phalp2_30729
7s2VN
6 31,5% 228 1.436E-44
6 phalp2_2850
giZX
26 36,1% 188 3.276E-43
7 phalp2_29092
6Wdu0
51 32,7% 180 9.819E-39
8 phalp2_7041
8m8Yy
2 27,3% 183 1.833E-38
9 phalp2_31044
1EaUP
5 33,5% 176 3.035E-37
10 phalp2_39756
fRRp
6 35,3% 198 1.056E-36

Domains

Domains
Rv2525c
Unannotated
Representative sequence (used for alignment): 4W6lO (244 AA)
Member sequence: 9RE2 (259 AA)
1 244 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF08924

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
9RE2
Method AlphaFoldv2
Resolution 92.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4W6lO) rather than this protein.
PDB ID
4W6lO
Method AlphaFoldv2
Resolution 95.72
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50