Protein

Protein accession
5mBsS [EnVhog]
Representative
4cukV
Source
EnVhog (cluster: phalp2_5598)
Protein name
5mBsS
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MCQADKFIATAVGEIGYIEGPADNETKYQKANQPWCGAFVNWCAKQVGLKIPDCTYTPAGAKAFAEAKRWQLVADATPLPGDLAFFDFPAD
Physico‐chemical
properties
protein length:91 AA
molecular weight:9875,1 Da
isoelectric point:4,73
hydropathy:-0,16
Representative Protein Details
Accession
4cukV
Protein name
4cukV
Sequence length
91 AA
Molecular weight
9622,57690 Da
Isoelectric point
4,58051
Sequence
MADQGTAARLIEVAKAELGTIEGPKDNETKYGAFTKANFQPWCGSFVNWCGNESAVKIPNTVYTPSGAQAFKKAGSWIDGDVADPEPGDIA
Other Proteins in cluster: phalp2_5598
Total (incl. this protein): 91 Avg length: 86,1 Avg pI: 6,30

Protein ID Length (AA) pI
4cukV 91 4,58051
15oYs 118 4,66383
19IFK 63 6,14983
1AIYY 79 6,18069
1JLQo 71 7,64436
1JPtW 78 9,21783
1JZFx 105 9,46907
1Kj7F 70 6,70026
1L5aw 69 6,02092
1LN9G 88 8,63614
1Mi7c 84 9,51478
1WJtw 77 6,70651
1mPkX 96 4,61660
1tL7o 80 6,53099
1y8nC 108 8,98156
20y2T 103 5,15964
2X1aO 104 5,47578
2Xliq 81 8,61731
2YAI9 62 5,56348
2j76E 84 6,54566
30Hy0 54 4,65582
32eZk 71 8,70937
349NW 68 6,06543
3Tera 72 7,71330
3XAqP 65 7,78689
3bkkE 86 6,55322
468Kf 67 6,06565
46gIX 64 5,58258
4ClLK 123 4,39442
4DfKp 111 4,55953
4Ghhw 82 5,21193
4YnMF 86 8,59771
4a35A 79 5,66806
4a5Ze 94 6,05002
4aqCM 98 4,96252
4b3wS 93 5,71354
4b4U8 75 5,51335
4bBcA 87 8,66154
4blM9 80 9,39235
4h2xA 107 5,91628
4l8GT 104 8,63788
4lKr8 82 6,89050
4ljYJ 51 4,65582
4m4Nj 107 5,49937
4nn6G 107 5,91628
4qMMf 70 5,49709
4rhtW 96 5,54427
4wiou 88 4,79218
51sZS 47 6,08771
53p1B 130 4,35184
561jk 186 9,03823
56R5c 73 5,57320
56Yjw 77 6,16353
59KkW 85 4,65150
5a0Cp 100 5,50374
5a9Gm 102 4,31501
5ePeo 88 5,69921
5eVAV 73 6,88664
5f4Yo 77 6,53980
5gxJ1 117 6,81826
5lPGD 66 6,70066
5lWIS 76 6,70839
5m8L2 104 4,83588
5nmrL 106 4,67242
5o2ho 78 9,09902
5uFox 72 5,68870
5uHAa 48 6,11419
5vDhO 56 4,44068
5x01o 56 4,65582
5xCRU 106 5,20841
5zaKp 96 5,52864
5zhkU 96 5,49232
6A9hg 107 5,44372
6IK9A 87 6,17467
6LZcx 66 5,25882
6SW3x 141 6,07321
6y6qh 72 7,79947
Fsp7 97 9,46958
Gf8h 68 8,66160
H9sq 66 5,22085
ICO4 66 4,81502
PhGp 65 8,85249
QBwy 84 9,75776
QvyX 51 4,35883
She5 116 6,03411
T0aU 111 5,04164
TLlh 53 5,05375
aGyT 111 4,55953
cs70 120 5,08927
iTKN 70 7,73808
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12059
59yJo
3039 74,4% 90 1.031E-50
2 phalp2_6238
6zvaU
2 56,9% 65 2.364E-23
3 phalp2_38264
e08l
2 54,2% 59 1.232E-19
4 phalp2_21101
nxGw
15 38,3% 86 1.810E-16
5 phalp2_23906
1KSIX
44 33,7% 89 1.118E-14
6 phalp2_22593
1IoI4
1 32,1% 87 1.200E-11
7 phalp2_40106
8bJB6
1 30,0% 60 3.068E-07
8 phalp2_33888
1NlRy
1 30,5% 85 5.784E-07
9 phalp2_30145
3jVMo
14 30,0% 113 3.874E-06
10 phalp2_5034
1m22G
1 27,6% 105 6.703E-05

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4cukV (91 AA)
Member sequence: 5mBsS (91 AA)
1 91 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4cukV) rather than this protein.
PDB ID
4cukV
Method AlphaFoldv2
Resolution 96.74
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50