Protein

Protein accession
4kf7R [EnVhog]
Representative
4kf7R (this protein)
Source
EnVhog (cluster: phalp2_33020)
Protein name
4kf7R
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
LNRNTVLEAARSWLGRKEADGSFKVIIDTYNSIVPLPCGYRMTYYDPWCAAFVSAVAQKVGATNIIYPECSCDRMIQLYQQHGRWMENDAYTPQPGDVIFYDWQDSGSGDNVGSSDHVGLVYSVVGTSMTIIEGNCSDSVCFTTRTVNQRYIRGYGLPDYENSVVTVPSTPAITVPQTNMAVGSKVKVTGSSWYTGALIPQWVKDDTWIILCINGDRVVLDKNVSGTRSIMSPINIKDIVLADSSTTTNVINTTPTVSVITSSNMTEKEMWDYLMSVYNNKYGVAGIMGNLYAESGLISTNLENQYESKLGYNDTTYTQSVDNGTYTNFVNDSAGYGLAQWTYWSLKKGLYNYAKSNKKSIGDARMQLEFLVSEFASLPAIVSAIKNATNIRTPSDIILTQYERPANQSEAVKVARANFGQNYLNKYGNSTTVTPSSNETNITPSTSKEYSVGDVVNFTGSKHYISSYLDIGFSCKPGKATITIINKKGKHPYHLIRTAAGGSTVYGWVDSGTFN
Physico‐chemical
properties
protein length:515 AA
molecular weight:56673,9 Da
isoelectric point:6,28
hydropathy:-0,28
Other Proteins in cluster: phalp2_33020
Total (incl. this protein): 17 Avg length: 457,6 Avg pI: 6,10

Protein ID Length (AA) pI
24Mng 515 6,51906
2IwN 513 6,36815
603Io 427 5,61162
63EFq 427 5,99602
6cwBZ 427 5,87729
6ku8n 427 5,66852
6m90U 512 5,93686
6nJlA 427 5,87729
6pQMr 427 5,88218
6pxRF 428 6,14159
6rtda 427 6,13130
6rzdA 427 6,12789
6tvOo 427 5,50317
6vsfC 427 6,12789
7DGKh 514 7,02703
8lb1H 513 6,61199
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_8749
8pVFA
2 39,7% 428 4.912E-94
2 phalp2_9561
1gCbl
21 38,9% 477 6.701E-94
3 phalp2_26927
4yq9J
22 45,3% 375 4.316E-93
4 phalp2_14240
38OZD
1 29,1% 532 4.725E-48
5 phalp2_37261
40hNM
4 28,5% 428 1.039E-32

Domains

Domains
CHAP
Unannotated
Phage_lys2
Unannotated
Protein sequence: 4kf7R
1 515
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF05257, PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4kf7R
Method AlphaFoldv2
Resolution 79.30
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50