Protein

Protein accession
3bSqz [EnVhog]
Representative
4ykqu
Source
EnVhog (cluster: phalp2_13420)
Protein name
3bSqz
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNKKPVYYMQTDGRWSGKRYPCIGGTMSIGGGGCGETSAAMLIATLIGRDVFPTETMEWACANNYVVAEQGTSYQPRDYFVEQFKQYGLKCERLTNEVCMNRNSPVREKVIRKLEEGNYIIALMKPKSVDPFIRGTWTGGGHFIVVWWADNKIRINDPASTSDRRTNGDPDTFFSEAKYFWVIDAKSHNKGDELDMTKKEFLASLTPEEKAEIVNGAQEYFATQDLPKWAEAEMEEAKRLGITDGTRPMQLIPRYQAAIMAKRAMMDGKKLNLG
Physico‐chemical
properties
protein length:274 AA
molecular weight:31005,1 Da
isoelectric point:7,56
hydropathy:-0,55
Representative Protein Details
Accession
4ykqu
Protein name
4ykqu
Sequence length
243 AA
Molecular weight
27256,60920 Da
Isoelectric point
8,57173
Sequence
MNKKPVLYLQTDSRWANKPYQTTGETTTIGKSGCGPTCAAMLLETLTGKTITPVDTCAWSVNHGYKAANQGTYYSYFKPQFAAYNIECKQLNGATIYGFPLSPVHTQAFELLKQGYYLIACMGKGTWTSSGHFVVVWWEDGKVRINDPASTKDSRVNGDLATFKSQVKYYFAVDARKYNNPTEEKEDDNMVYYEKLTDVPSYYKDAIQKLVNDGTLKGDGNGKINVSEDMCRIMTILNRKGLL
Other Proteins in cluster: phalp2_13420
Total (incl. this protein): 106 Avg length: 257,1 Avg pI: 7,84

Protein ID Length (AA) pI
4ykqu 243 8,57173
137L2 270 6,44176
139i5 292 8,80911
13BUI 293 9,22454
13atM 268 8,69977
13cbc 291 9,08007
13riQ 266 6,52878
13vGB 231 7,53648
1FI6Q 240 8,34016
1Ggn1 259 6,32853
1aeNz 234 8,84953
1cjTb 237 8,45982
1jHpP 217 7,00190
1nu6X 264 6,18212
21PCc 271 8,35299
21nmd 259 9,04564
23Amu 261 8,96376
23BQI 261 8,92650
23LyA 261 8,26583
23Qz3 277 9,20501
24HP6 281 6,71612
24mUA 305 7,58047
2mptD 281 8,28762
2ov9Y 241 6,93739
38Irp 262 8,93791
38JJt 261 8,21322
38LRv 268 5,25530
38MBN 265 8,98755
3GHkZ 240 10,15114
3PEi5 240 8,96441
3PX5T 249 9,14976
3TQxs 269 8,61699
3fQms 259 5,20170
3gWD9 254 7,72893
3lBnk 264 9,24962
3lGH8 266 6,95893
3pgqQ 264 6,00296
3tlE2 264 6,18564
3wks0 263 9,14595
3wlXj 264 6,18700
40JFL 262 8,80782
4Me4P 265 8,61641
4UgoI 199 5,42889
4UhCU 199 5,42889
4kYDv 262 6,10010
4y7yP 274 9,13293
4ycjL 243 5,47549
4ycvS 265 5,95459
4yfdE 273 5,98255
4yiyX 208 5,59906
4ylCQ 251 5,29867
4z3o0 264 6,00483
5CHU 240 7,65055
5JUMv 263 9,01405
5MoHj 263 9,36456
5NlvT 263 8,95164
5PA8a 266 5,44378
5QC6i 240 7,66828
5ROHB 263 8,83812
5RqCY 255 5,36932
5XiGY 240 7,04203
5XkSt 240 10,11201
5ZXjN 226 9,07214
65LFk 264 9,35315
67PG6 263 9,06504
6cJiP 242 9,14266
6dAFe 266 8,19659
6ezl8 233 9,08742
6gpKg 240 7,04198
6h7bB 256 7,65487
6kcqQ 242 8,17667
6oQCt 263 9,15640
6qCk3 263 9,14595
6qG6j 261 9,15549
6rXfQ 268 6,00410
6tKFB 226 8,80324
6uazU 263 9,02340
6ugIa 255 5,36932
6xWw 257 5,83318
71aQX 255 5,80175
7E00e 301 9,29094
7q1fC 245 8,70247
7q1fU 248 9,11591
7qKC8 237 8,80491
7qKCS 237 8,65696
84Qd4 263 8,98633
85i32 265 9,18953
85uuH 261 8,48696
87pUE 241 9,06047
8b709 246 6,94728
8beFU 263 8,88402
8cwhI 265 9,18953
8fwB7 202 6,83713
8nesY 259 5,73474
8oX35 247 9,53238
8pIR5 264 6,00762
8qtJo 269 8,48896
8rGQA 252 8,44054
8tKdL 263 8,96099
8tpMo 263 6,33507
R9ls 237 8,45337
nivf 336 9,22647
onNF 287 8,81620
wKJw 264 5,84012
z7yp 264 6,00296
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1956
4xpMM
15 60,8% 156 1.033E-74
2 phalp2_8229
7unbc
17 52,0% 173 1.939E-74
3 phalp2_9514
139E1
4 41,9% 229 3.637E-74
4 phalp2_35879
4ObCY
19 40,2% 174 7.677E-51
5 phalp2_18628
6wOj
1 34,0% 182 2.955E-48
6 phalp2_32246
7skyw
22 32,1% 205 7.472E-37
7 phalp2_6228
6rZ6J
17 31,8% 223 2.456E-33
8 phalp2_6559
13HvX
2 26,4% 174 2.518E-18
9 phalp2_28626
8lgNw
11 31,7% 173 4.992E-15
10 phalp2_14857
7BuAn
6 32,8% 152 7.541E-14

Domains

Domains
PET_C39
Unannotated
Representative sequence (used for alignment): 4ykqu (243 AA)
Member sequence: 3bSqz (274 AA)
1 243 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13529

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4ykqu) rather than this protein.
PDB ID
4ykqu
Method AlphaFoldv2
Resolution 94.93
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50