Protein

Protein accession
3483v [EnVhog]
Representative
4C1UL
Source
EnVhog (cluster: phalp2_37797)
Protein name
3483v
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VNLPFIKLVVPTALKQYKNGQLAESVLAPIKTGGKMYAPVAAEFNKLYDAAIAAGIKLKNVGDYRSFQGQLTMFMDRYVTTDTGTGVTRQYEGKTWWLKKGKAPSAAPDPTGLKGSNHGWGLAIDLGYDQGGKTASFGVNVPAFQWMCANAPKYGFYLQGNNPASKEFEAWHWQYCLGDASPDGSVQAAPAAPAAAPAGGGMKFDYPGTPVGLGSKGAAASLVQAIIGAKADGDFGPKSVASLKAWQTANGLTADGSVGPVTWKKMFG
Physico‐chemical
properties
protein length:268 AA
molecular weight:28063,6 Da
isoelectric point:9,30
hydropathy:-0,19
Representative Protein Details
Accession
4C1UL
Protein name
4C1UL
Sequence length
289 AA
Molecular weight
31606,62900 Da
Isoelectric point
9,60761
Sequence
MARKLPIAKLVMPSTLARHENGKLPDHLLVKIGVGNARMEMTAARSFVAMFAEASRTLGVQLRHVGDYRPYDAQLRLFLDRYEPVSLAVYGVTSPSNRKKWDAGKTSHGSAWWRKKKRADGSYPATAATPGSSNHGWGLAIDIAEEYDNDPQPDPIRDMFVGWLVGNAHRYGISAELQSEPWHWRYVAGDAIPQATRDFERNGGVVPPTVVTPPNVEPGPALVFAYPGEPIKRGSKHTTAVKLIQAVVGATPDGDFGAVTERRVKVWQASRGLVADGVVATVTWKAMFG
Other Proteins in cluster: phalp2_37797
Total (incl. this protein): 98 Avg length: 275,2 Avg pI: 8,96

Protein ID Length (AA) pI
4C1UL 289 9,60761
11nlL 307 6,75403
16299 236 9,50324
17SdK 269 9,14647
18hwZ 274 8,45640
1B6lF 354 9,29075
1IsCu 325 5,29776
1NM9T 271 9,07207
1O6nT 285 9,16774
1OD9p 253 10,13464
1Ob14 267 9,27399
1ObvO 270 9,10811
1QFyl 271 9,25826
1QHm3 256 9,26096
1WZHz 270 9,17400
1X0f0 269 9,28939
1nBFP 262 9,19972
1nzhH 267 9,13609
230ee 353 9,39093
26yKq 348 9,12042
2GzlL 272 9,14176
2ZgX0 279 8,58218
2Zyan 269 9,14647
2j8G6 261 9,43806
2qoGb 270 9,17400
2qsDG 272 9,08555
33yuN 270 9,17387
341UH 270 9,33433
38mpJ 266 9,16626
3dnnO 341 5,60492
3gs7K 267 9,06807
46LZT 290 9,40234
46cQB 267 9,28591
49Idt 267 9,15388
49lGf 279 9,35464
49lXC 267 9,39583
4A4kf 273 9,91371
4Av5d 266 9,38764
4BWi5 283 9,89785
4HLZm 297 5,87382
4NGRe 314 5,30071
4W7Ip 270 9,99771
4WHCn 269 10,13954
4bzDX 293 8,81342
4fI0d 257 6,23134
4n6Ac 263 9,63920
4nDeW 269 9,20584
4oV2i 252 9,93453
50n51 264 9,21635
51wdV 270 9,07394
53M4h 272 9,26116
53bIj 270 9,28720
53kUW 269 9,28720
54jZH 269 9,20584
56koH 253 9,04886
57Nt8 267 9,22054
58gWI 267 9,40943
59gLp 268 9,37307
5IVY5 300 9,81804
5J71A 277 4,33735
5bESP 270 9,35780
5beIv 266 9,03952
5cfmL 223 8,31347
5fRR5 270 9,19714
5gDSv 265 9,28939
5haYG 270 9,17400
5iBDY 261 9,36631
5kMkJ 262 9,28791
5kN3y 270 9,13003
5lewD 274 9,39486
5mtY1 283 9,43954
5y196 266 9,40176
5yy7I 267 9,35780
5zmJv 263 9,06214
5zp3b 261 9,74248
6FmCD 279 6,59261
6UACW 270 9,26116
6VNjD 250 9,42059
6xoqu 283 9,35386
6zLtm 279 9,82899
84CEL 264 8,97073
8mXul 267 9,04899
8mt5m 272 9,09741
8sXFT 337 6,55350
DA1j 267 9,34239
DAzf 271 9,23266
Ghu7 271 9,08838
SE6r 268 8,97775
aGeh 270 9,27399
dQTo 282 10,07410
hUTU 270 9,44992
jKGp 281 9,02127
jKWx 285 9,20372
jUyJ 280 9,19604
lHHq 296 8,70512
lJDM 254 7,15264
tPOI 281 9,96928
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_34867
49Aoq
347 31,2% 282 6.342E-59
2 phalp2_26267
LAL7
1473 35,7% 193 2.412E-56
3 phalp2_29962
8rfJR
9 40,6% 187 4.478E-48
4 phalp2_18218
4MWch
4 36,2% 204 2.568E-46
5 phalp2_37143
1r1gj
3 25,3% 276 9.505E-23
6 phalp2_23075
4kW5M
8 23,4% 226 3.603E-21
7 phalp2_15650
2SEqZ
21 26,6% 195 2.014E-18
8 phalp2_38340
uXmy
379 24,6% 264 1.641E-17

Domains

Domains
Representative sequence (used for alignment): 4C1UL (289 AA)
Member sequence: 3483v (268 AA)
1 289 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF02557

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4C1UL) rather than this protein.
PDB ID
4C1UL
Method AlphaFoldv2
Resolution 87.29
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50