Protein

Protein accession
809gt [EnVhog]
Representative
6lIKR
Source
EnVhog (cluster: phalp2_20828)
Protein name
809gt
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTEKEFVEKIGNLAVEDMKTSGILASVTTAQACLESGYGSTELAKNANNLFGMKTTLSNNTWASVWDGKSKYTKKTNEQTKDGKVYVITADFRKYANILMSIKDHSCYLNGAMNGKVKRYAGLSGCKDYKTAAQLIKNGGYATDIKYVDKLCSLIERWNLTRFDNFGKENDNMNIIDVTSASKSYVPQWGNQKQYIVVHYLGVAGQNNKINSDGCGAHYYIYWDGTIYKAADHNAILWQVGTAGYYTQKHPYARNSNCIGIEMCPKCDGSGKYAEDPTWYFTEATQNACVQLVKYLMGQLGVGADHVLRHYDVVNKYCPAPYVTNNKYKTSWTWNEFKAKLGATYTPPVVQSQPTSTKTYKTGMYKVNCDLHIRSDATVNSKVVNTIRERGTYTVTEIKNNCWGKLKSGAGWINVSDEYCTYVGVVATASKPVSKPTAKPATPVYKVGKYKVNCDALTIRSDASSKASATGSIRDKGTYNITEIKNTYWGKLKSGAGWICIDKDFCTYVGALDKHGTVVNKEFQIAVKENGIRVRASAGLSARIAIGSCPIGTYTITETKAADGYTWGKLKSGAGWIAIECCVRL
Physico‐chemical
properties
protein length:585 AA
molecular weight:64592,7 Da
isoelectric point:9,17
hydropathy:-0,41
Representative Protein Details
Accession
6lIKR
Protein name
6lIKR
Sequence length
596 AA
Molecular weight
63406,89280 Da
Isoelectric point
8,86004
Sequence
MEANMTEKEFVEKIGPLAAQDMKTSGVLASITAAQACLESGYGSTELAVNANNLFGMKCSLSGNTWDSVWDGVSKYTKKTNEQKPDGTVYTITADFRKYPDILTSIKDHSCYLNGAMNGSKRRYEGLSGEKDYRRAAELIKAGGYATDIAYVDKLCSLIERWNLTKYDKEDEGMSNSSLVNCTVKSPNHSGARTHAIDRITPHCVVGQLSAESIGGCFTSSSRQASCNYGIGSDGRVVLCVDEGNRSWCSSSNANDQRAVTIECASDTTDPYAMTDAVYEKLIALCVDICRRNGKTKLLWFGDKNTALNYSTKSNEMVLTVHRWFANKSCPGDWLYSRLSDVANRVTAQLSGSSGGGTTGGGSTGGSSGNYKTGMYKVNVADLNIRKGPGTNYGINGVITDKGTYTITEIQNGSWGKLKSGAGWINVSTAYCSYAGAASGGSSSSGGSTSSGASYKTGTYKVNVAELNIRKGPGTNYGTNGSIKDKGVYTITEIQNGSWGKLKSGAGWINVDKAYCTYRGAASSGGSTAASSGSFQVQVSISDLYIRKGPGTNYGNNGFCPKGVYTIVETQSVGGYTWGRLKSGAGWIALEHTKRL
Other Proteins in cluster: phalp2_20828
Total (incl. this protein): 83 Avg length: 511,0 Avg pI: 8,73

Protein ID Length (AA) pI
6lIKR 596 8,86004
1gId3 418 9,19965
1gxFb 434 6,91221
1m1SB 592 8,83315
1nu56 591 9,16723
1r5wm 528 9,28669
21fBf 585 9,15672
23JwC 471 9,24285
23cpJ 500 8,75173
24HCK 500 8,82084
2lDWv 474 9,06743
2muZU 466 8,92528
2mxJC 520 8,53776
2mxfC 587 8,62189
2pi7Q 493 8,94836
38GhA 377 9,17664
39WYE 419 8,86790
3A4rW 506 9,16433
3A4u1 474 9,08722
3jLy0 593 8,93643
3jVdw 509 9,16213
3k6ZW 592 8,98285
3k9n5 504 8,60822
3lAxX 534 9,23621
3nRST 433 7,96180
3nWnK 434 9,36676
3o439 472 9,27173
3pqL0 585 9,20172
3pvvZ 588 8,74780
3qgWk 416 5,54001
3qw9R 592 8,93991
3sEpa 597 9,15607
3sIGU 588 8,62763
3tMGL 592 8,98291
3tgUn 416 5,80226
3vZRe 591 9,05544
3wSNV 508 7,77787
3wzWh 410 6,96365
3yO9O 585 9,14247
4knPj 437 9,11204
4ylqD 422 5,59014
5KGxU 585 9,14950
5LYTu 596 8,98291
5LspK 530 9,06240
5MWEn 591 9,09367
5MZl4 487 8,35228
5Smoe 496 6,13897
5TAfa 530 9,02134
5ToV8 488 9,37327
5Uo1O 416 5,81124
5VmT0 585 9,14602
60948 473 9,20462
62QIX 429 9,27586
63Vdp 501 9,35696
64bNr 467 7,97643
67qxw 585 9,19811
68yKd 496 5,74787
6kDzX 545 9,02611
6ox22 501 9,38081
6qKOX 501 9,40924
6rUtx 585 9,13022
6tGh3 529 9,02565
6tT7y 573 9,49943
6wFee 470 9,01289
6wXj 472 8,72736
7UZfL 502 9,33414
7W1IR 532 9,12990
7nGeA 421 9,38919
7pjV6 592 8,97917
7tCBG 584 8,97750
7xASR 592 8,90149
82AgB 534 9,33833
84RSI 431 9,32756
8daFR 585 9,11340
8dngn 467 8,65193
8k2fC 469 9,08600
8kPOV 534 9,27128
8ptyr 423 9,16942
8sjI8 510 9,46140
8sob5 431 9,35967
o7nQ 520 9,07742
A0A8S5QXI1 339 8,17396
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30764
5v6U
28 33,0% 526 8.937E-121
2 phalp2_5251
813lL
211 39,6% 618 9.346E-114
3 phalp2_35371
18KG
17 32,4% 517 5.479E-72
4 phalp2_37594
45uQn
106 36,1% 459 4.023E-65
5 phalp2_38910
23s0X
1 24,7% 574 3.510E-48
6 phalp2_31675
4L12w
1 20,9% 768 7.613E-36

Domains

Domains
GLUCO
Ami2
Unannotated
Unannotated
Unannotated
Representative sequence (used for alignment): 6lIKR (596 AA)
Member sequence: 809gt (585 AA)
1 596 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510, PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6lIKR) rather than this protein.
PDB ID
6lIKR
Method AlphaFoldv2
Resolution 86.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50