Protein

Protein accession
8dAhb [EnVhog]
Representative
4gSoB
Source
EnVhog (cluster: phalp2_30257)
Protein name
8dAhb
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MAAPLAPQVLLDALRGEGLTVVEYPGWRDRCRCHSGSHERGEGTRKPFEPLAVAVHITAGDLGSRSMATYIRDILANPNNGNTPLGCNFAVAPDGVVWLVAAGRAEHVLYMGSRAMAALKAGSMSIDSWQDLSGREHNGSRYLYGIENVNSGPANAAQQRASERIIAAICRAYGWSGRDAAGHGEVAADRGYADPGVNVGLIRRAAMNLVGAAAPAGAPSTGDGELDATQNDWLRRACAAAEVAVASTANITNGMLFLIERTVRIEARLDDPDRILTPDEVKAAVQDVVDKIYNQNPVEGEAGQ
Physico‐chemical
properties
protein length:304 AA
molecular weight:32139,7 Da
isoelectric point:5,44
hydropathy:-0,21
Representative Protein Details
Accession
4gSoB
Protein name
4gSoB
Sequence length
307 AA
Molecular weight
32554,08410 Da
Isoelectric point
5,20528
Sequence
MATPLTAGQFMAALRAEGLTVVELDGWAAHSRNHKGPWGPVHGVLIHHTGSDTKDPAAYAKSVLWAGYAGLPGPLCQVGIAPDGVVYLTGYGRCNHAGGGDPAVLDAIMADEMPYDDELTPHRGNSDGIDGNARLYGAEVMYSGGHPMTPAQYDATVRFAAAVCRAHEWTAGSVAGHREWSDDKVDPGHAPLDKLRRDVRARLAQSPEEDDMPYTPKQLAEAAWMTDGVLPAPTDAPDAKTNTFWRPVSYLTGLLKEVRALRTQVTALSEAVRALSAGQSEAVTQAVTAALATGVVRVDIDVQGVDQ
Other Proteins in cluster: phalp2_30257
Total (incl. this protein): 83 Avg length: 297,6 Avg pI: 6,51

Protein ID Length (AA) pI
4gSoB 307 5,20528
11y3C 322 6,00734
1IdPx 287 6,09043
1eoH6 282 7,67403
1fPfE 327 5,90298
1fVtj 327 7,72359
1jnsH 379 5,36176
1rdss 279 5,77827
28yFF 306 6,24083
2S92r 248 6,75573
2Y8Cc 248 6,50343
2YGZa 315 8,79357
2YpHp 252 6,75983
2gbPl 280 9,46623
3NHVI 307 5,41161
3OLhR 283 5,88098
3OQU4 297 5,64595
3OR2v 296 5,75224
3PsPp 336 5,15526
3X4CQ 249 6,36627
3X89o 328 5,45122
3eeJ3 330 5,83023
3eemW 247 7,79212
3gm1V 328 6,15966
3gm9e 329 5,94561
3gmJw 277 5,69154
4F5g9 271 5,64351
4F7aA 345 5,24581
4FdRq 308 5,10797
4FeRI 343 5,99574
4HGaD 303 5,91043
4Jjx1 258 6,21537
4JlCg 324 5,50670
4JlHp 323 6,60267
4JlRZ 323 5,71331
4Jlup 255 7,78638
4KHqw 320 8,64858
4Qrap 249 6,76062
4XBkz 252 8,37117
4XW9B 280 5,88536
4XuTG 255 6,31114
4Xv1y 267 5,03408
5BjJF 254 8,68275
5F0VH 303 9,09373
5F1ic 304 8,88370
5F3aZ 319 8,91019
5kp8j 382 5,38387
5oGFR 282 9,06279
5tG1z 282 8,17944
5txeM 282 7,62884
6CKx1 286 6,66286
6D62A 255 6,50297
6Di9O 262 6,20587
6KH5h 218 5,61878
6KIIK 299 5,51466
6KIqh 286 5,65300
6QhCV 291 5,64857
6SIYx 319 5,46282
6SNNN 347 5,87018
6SyYB 248 6,75767
6T7qd 323 6,21838
6T9GL 366 5,80868
6T9Gw 319 5,66607
6T9Yp 306 7,13099
6TaJw 249 8,71891
6Tb1X 317 5,81045
6TbVE 278 5,67005
6Tc32 328 5,81653
6TeBH 283 5,99671
6Tezb 330 5,73030
6ThBn 319 6,13772
6VbtB 361 7,18703
6W6RA 338 6,07782
72L0d 352 6,80240
7G9ro 282 5,48186
7o7IN 323 6,14682
7pyEL 323 6,44210
7uP7b 306 7,12399
7yGaz 302 6,56021
7zhwP 250 7,71785
bhW0 235 8,41192
gfTt 248 6,50218
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7405
4FizF
295 53,3% 223 1.960E-91
2 phalp2_21923
4LFwQ
4 39,8% 236 4.069E-83
3 phalp2_19071
1jII3
225 50,4% 224 1.154E-80
4 phalp2_36765
6EBV3
40 39,0% 323 2.387E-78
5 phalp2_36854
7dhe6
1 40,8% 252 2.559E-70
6 phalp2_11664
1IlHB
1 37,4% 219 1.267E-53
7 phalp2_36860
7gDKs
246 34,4% 209 1.945E-44
8 phalp2_12497
QM59
22 35,2% 204 5.359E-40
9 phalp2_24159
2eCaD
21 32,6% 208 1.938E-35
10 phalp2_12024
7Qd9Y
81 28,8% 215 2.637E-35

Domains

Domains
Ami2
Unannotated
Representative sequence (used for alignment): 4gSoB (307 AA)
Member sequence: 8dAhb (304 AA)
1 307 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
8dAhb
Method AlphaFoldv2
Resolution 81.34
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4gSoB) rather than this protein.
PDB ID
4gSoB
Method AlphaFoldv2
Resolution 90.87
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50