Protein

Protein accession
5Yfvb [EnVhog]
Representative
72d4D
Source
EnVhog (cluster: phalp2_38172)
Protein name
5Yfvb
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTIKVVDVYSGSPRSYATDPNAQGVIVKATQGTGYVNPFCDTDYQAAKNAGKLLGVYHYASGGDPVAEAKYFLANVKNYIGEAILGLDWEAGQNASWGNTNWCRTFVDYVHSQTGVYPIIYVQFSAVWQVANCSNSCGLWGAGYPTYANSWAVPAFLSSYKFAPWSNLTGWQYTGNTMDRSEFYLDADGWHKLAKGDGTVQPTTQPQPQQDKKAAAEPTSAGTTWRDSLGVTWHAENGKFTVGSGYVLHLRWGATPSSSVIAVLYGGSVVKYDAWARTNGFVYLRQPRANGQYGYVACRDARTNEAYGLFE
Physico‐chemical
properties
protein length:311 AA
molecular weight:34040,3 Da
isoelectric point:6,94
hydropathy:-0,34
Representative Protein Details
Accession
72d4D
Protein name
72d4D
Sequence length
311 AA
Molecular weight
34803,25840 Da
Isoelectric point
5,09734
Sequence
MVMYTVDVYSGSDDSIIRDPHAQGVIVKATQGTGYVNPKCNHQWDLAGELGKKRGLYHYAGGGNPVSEAQYFINNIKNYVGQGMLVIDWESYQNSAWGDSNWVRQFVDEVHRLTGVWCVIYVQESALNQVANCAKDCAVWVAKYASMNWNSWTVPDMSVSSDAFGSIAGWQYTGGDMDRSIWYLDANAWDKFAKPSDEKTQIETPKPVPAPSQNDVKYDSWTDDLGVKWFKESGKFTITVDEGIVLRWGATTSSSEIAVLPKGSVVEYDAFAHSGGYVWIRQPRGNGQFGYLPTGEDRNGTRLNYWGKFTE
Other Proteins in cluster: phalp2_38172
Total (incl. this protein): 103 Avg length: 300,5 Avg pI: 7,66

Protein ID Length (AA) pI
72d4D 311 5,09734
13aAO 310 6,21332
1FWC5 300 8,73381
1QQ64 246 9,71147
1jG22 288 9,31099
1oo3m 302 4,97650
291id 311 6,94461
2dgqB 295 8,39902
3ihW7 308 8,78828
3mvBZ 309 8,62685
3oR6O 311 5,29884
3p8eq 311 5,19442
3yyYi 283 9,67324
4xut6 306 8,12742
5K2Tt 221 5,26525
5L0lo 307 7,59474
5LVq1 309 6,94705
5LsDD 307 6,52076
5NutZ 305 6,22361
5OoOs 307 6,95621
5P6iF 302 8,20742
5QFPi 307 8,41720
5QZg4 300 8,35370
5VLne 309 7,71881
5WsUd 308 5,29884
5XdRt 307 6,52076
5osyn 302 8,20716
5tDOH 297 8,58089
5tDQO 300 8,72072
5tGbb 303 6,52406
5tTT4 310 6,95120
5tTXn 308 8,55807
5tTic 309 7,63168
5tUrq 308 6,99855
5tZfB 283 9,63288
600e6 306 8,52461
61Za1 307 8,61338
61bkj 303 8,29374
62m9i 306 8,22399
65S1S 307 6,94802
66ayH 310 8,67946
66gmu 311 5,35750
67uwS 302 6,95780
6ZCQX 280 9,38100
6ZRbN 300 8,58695
6ju8w 311 6,51491
6lYlK 302 7,65413
6m2MT 309 6,52076
6nqyn 300 8,69538
6odjo 308 6,15472
6pJaC 306 7,64691
6uX7e 307 6,10805
6vxhu 312 9,37836
71HUI 309 5,58110
71u9t 315 9,20507
72NGn 309 7,59474
72d5z 309 6,22241
73T3K 307 5,82716
79GPv 302 6,51360
7C1Oz 308 6,48950
7C4dM 284 9,36282
7POtm 246 6,72487
7XtLf 311 5,81840
7bhOr 309 6,16592
7cSvQ 259 8,33404
7etEy 304 7,71467
7exuW 300 8,71833
7fXIR 320 9,79637
7foX4 306 8,99529
7py40 307 7,58678
7qTb1 282 9,64365
7qTgA 282 9,67969
7tfP9 300 8,96235
7wQWa 309 8,67946
7wR5K 306 6,65655
7wRfV 309 8,18125
7wrCx 280 9,45173
7wyNw 309 8,16977
7wz1A 308 7,61980
7x8Xv 310 6,94802
7x91t 309 8,42313
7x951 309 8,13773
7x9cL 309 8,41134
7xRnD 282 9,51813
7y6f6 302 8,82915
7zrYt 310 5,96226
7zroC 309 7,03555
7zrqH 309 7,03533
81Xc1 305 6,59590
83DVB 303 8,23108
8ieUG 308 6,52264
8pp5r 307 8,48257
KO0s 279 9,43593
NPru 252 5,84290
Onoc 284 9,35328
ncrt 300 8,10692
ugHI 309 6,94853
uh8X 280 9,61502
uh9f 307 7,57132
yjL5 281 9,42272
A0A0A1ERA5 297 5,44247
A0A8S5M5E1 304 7,65413
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22096
5ZSom
2292 37,9% 232 1.038E-58
2 phalp2_37341
8v6XY
562 35,8% 237 2.090E-56
3 phalp2_12277
79IyY
11 36,8% 209 2.533E-55
4 phalp2_4019
13DUc
70 35,4% 237 6.117E-52
5 phalp2_12276
79roT
46 35,9% 217 6.515E-50
6 phalp2_30879
zkiN
42 37,1% 210 1.189E-44
7 phalp2_17635
6JmE3
482 34,5% 301 1.042E-43
8 phalp2_32821
2L8bh
41 29,5% 240 4.288E-42
9 phalp2_36067
6rv4w
74 33,0% 233 1.085E-41
10 phalp2_27394
4HDMm
9 32,9% 231 1.757E-40

Domains

Domains
Representative sequence (used for alignment): 72d4D (311 AA)
Member sequence: 5Yfvb (311 AA)
1 311 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01183, PF08460

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (72d4D) rather than this protein.
PDB ID
72d4D
Method AlphaFoldv2
Resolution 90.03
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50