Protein

Protein accession
5W5xD [EnVhog]
Representative
4Legl
Source
EnVhog (cluster: phalp2_7443)
Protein name
5W5xD
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MNVMNEHALKAIESAKSQLGNPYVFGTWGRECTPSLRRQYAGYNPSHKRAIFKACPVLSGKQPSCKGCKWEGKLAFDCRGFTYWCLLNAYGMKLKGGGCTSQWGYNANWLKKGDIKDMPNVVCCVFQYYKGKYQHTGLHIGDGKIIHCSNGVQWGDISEKGWTHYAIPAGLYTAEEVKMAGKPETDKAENVPPTLRKGSRGDDVKKLQKTLNAMGYDCGTADGIFGAKTEIAVRSFQQANGLAADGIAGKNTLALLYSKEGKEEGTYKVVIGGLAYEAAAELASKYKGTLEKEG
Physico‐chemical
properties
protein length:294 AA
molecular weight:32084,3 Da
isoelectric point:9,07
hydropathy:-0,48
Representative Protein Details
Accession
4Legl
Protein name
4Legl
Sequence length
323 AA
Molecular weight
35778,13610 Da
Isoelectric point
5,54182
Sequence
MMNDKAAQIVRIAEEHIGCPYVYGTWGQLCTVALRKRYAGYNPDQKDITYQRCQRLRSSNQQKTCDGCPYQGMLAFDCRGFTHYCALNGAGIDIYGGYVQLQYATKSNWDELGAIDEMPDLVCCVFVYKNGKWKHTGLHIGGGRIIHCSGEVKRDTVGGKNSWTHYAIPKGLYTPEEIANAHKNGGVSNMLLRKGSKGAAVSALQELLNAWYEEYRHPLDYMPLTVDGVFGSITKSAVEAFQYASGLEVDGIAGEQTQMMLAAYNAKPTGPALIDDTATTPELPEDDDEIDEPIQMVQLTRAEVERIRAGLREIESILTRAIS
Other Proteins in cluster: phalp2_7443
Total (incl. this protein): 91 Avg length: 294,4 Avg pI: 7,48

Protein ID Length (AA) pI
4Legl 323 5,54182
1377D 290 9,01379
1cMQ9 292 6,92949
21D2j 310 7,93781
21hQE 281 7,01435
21iZz 299 6,38458
21pEj 323 6,66235
21sbS 197 9,01360
23DSa 288 9,03539
23KI3 301 6,26754
23LU1 282 5,76344
23fbf 285 6,32410
24APp 286 9,32762
24GMk 307 7,24034
24H3Z 283 7,89243
24HAR 312 5,33436
3Js7V 291 8,04309
3KTwb 291 8,04702
3TDzW 290 6,31410
3TJn0 283 7,93098
3TOft 299 7,94097
3VE4B 281 7,48532
3WEsM 322 6,28102
3WM2v 321 8,36492
3WNFG 302 5,42457
3ZPMO 298 8,78570
3ZSXs 284 5,99387
3ZqwL 333 7,47924
3Zwh2 296 6,06963
3dPhn 285 6,18712
3dW2V 281 7,01435
3gKb2 304 7,45360
3gOJY 287 9,62295
3gPtl 303 7,46042
3gQn0 300 7,97437
3inH5 327 6,47501
3irqs 296 8,26963
3iuAS 277 8,87660
3uBS5 291 8,63852
3uaOP 290 8,85153
3vG62 290 8,60352
3vLMB 295 8,35170
3vOan 296 8,59997
3vsYQ 291 8,31760
3w6AB 225 6,51332
3ygqg 291 6,64939
3yxBx 291 8,01228
40eEL 318 5,93327
40exZ 308 5,89377
40gg6 310 6,09714
414EX 293 5,99597
4154A 291 7,01231
41cHb 281 7,44280
41dj1 299 6,59960
41ekt 283 7,47714
41kUP 282 6,97957
4DrGf 232 9,10688
4HxW8 280 5,97710
4L0P2 294 7,01941
4L0yz 292 6,92898
4L4Fv 303 5,05864
4LbXO 215 6,30671
4UgI9 430 5,56104
4k4uU 288 9,62637
4k7rK 303 8,45608
4kdPl 291 6,44897
5R6RK 291 9,00335
5VGsR 293 8,85088
639pa 295 8,72420
64AdT 300 8,19923
64sy2 293 8,94668
65kMT 300 8,38922
6PoJO 271 6,47785
6Pt8T 327 5,62129
6ULXf 305 9,17335
6k847 302 8,84192
6mlIR 293 9,01734
6vHds 298 8,92773
7DETd 274 9,36566
7WWBR 282 5,70075
7YEZr 310 6,51781
7YEZw 310 7,45565
8aZm1 291 8,29252
8ePYK 297 8,80724
8n5Ho 291 5,47083
9uTv 294 5,38876
BYZi 291 8,48483
Zzlq 305 7,04311
a4OG 288 9,03539
n6P1 334 8,00531
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9688
23BCv
75 42,9% 289 4.555E-111
2 phalp2_32888
3ixlE
80 32,9% 270 2.744E-88
3 phalp2_31501
3TJjZ
324 31,1% 273 2.804E-50
4 phalp2_31883
4GV8X
12 29,6% 270 2.814E-43
5 phalp2_32958
3ZDZv
229 24,7% 275 3.430E-40
6 phalp2_15413
23dr0
42 28,7% 261 1.910E-34
7 phalp2_5194
3NO3B
1 23,9% 284 4.693E-25
8 phalp2_20702
4ZCb1
19 25,4% 267 6.347E-25
9 phalp2_9643
1KPZ0
19 24,1% 265 1.932E-22
10 phalp2_28575
8deQW
51 27,3% 267 5.205E-21

Domains

Domains
Unannotated
PG_1
Representative sequence (used for alignment): 4Legl (323 AA)
Member sequence: 5W5xD (294 AA)
1 323 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4Legl) rather than this protein.
PDB ID
4Legl
Method AlphaFoldv2
Resolution 82.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50