Protein

Protein accession
4jkn5 [EnVhog]
Representative
4jkn5 (this protein)
Source
EnVhog (cluster: phalp2_25646)
Protein name
4jkn5
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VEFAYGYGKVRLTVAEMEQRTNWNRLHPEFRRRLVALFTTAQTEGTDVGLGGGYRSTERQKAMFLSRYVVDPNGRTRWDGKRWTKKRGVASAAPPGRSYHEPTCTEGWCYAADLVGDLKWANANAHRFDLLHFANVNSEPWHFQPIELPKSRFRYSGQTLEVWNPPAPPNTFTPDPIKHITKDIDMRLVNPARLFDSRSKGGPFKAGETREVRIADVSAAFVNVTVVPYEPGYVTVWGGGTMPNVSNVNYVDAPIANTSLVPVENGHVKVYSSGKADIIIDLQATA
Physico‐chemical
properties
protein length:286 AA
molecular weight:31980,7 Da
isoelectric point:9,26
hydropathy:-0,47
Other Proteins in cluster: phalp2_25646
Total (incl. this protein): 33 Avg length: 265,8 Avg pI: 8,82

Protein ID Length (AA) pI
1LeEF 276 9,61361
1idqO 268 9,28082
1wVkw 273 8,97369
25NFo 249 9,03036
2PfZW 256 9,91712
2fMR1 288 9,27953
30Og9 248 9,14196
46aJk 250 8,52602
49rDI 269 9,27560
4IjeU 292 5,37153
4NIAc 265 9,92112
4bK3U 265 9,12087
4glL2 296 6,28482
55YOC 249 9,74435
5g6JK 248 9,78464
5l2hC 287 9,33207
5mg8s 265 8,99097
5zwH8 227 8,49547
5zxtj 268 9,15562
6LRLz 252 8,97930
6PFD2 279 6,30955
6zLpO 262 9,78922
7VbBd 239 9,53856
8pPqX 277 9,61393
8retA 280 9,84253
8tiGD 290 5,69722
LidV 249 9,78464
SOJJ 250 8,93031
Tlq4 265 9,84627
acEm 251 6,70736
tX5B 286 8,56529
A0A6J5PEQ6 267 8,98826
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12435
lKKf
19 46,7% 244 1.262E-83
2 phalp2_911
8pNJ8
7 44,7% 199 4.638E-60
3 phalp2_37737
4i40I
27 38,7% 191 1.307E-49
4 phalp2_10879
4vCvH
3 32,6% 208 5.619E-36
5 phalp2_15650
2SEqZ
21 28,6% 227 2.385E-30
6 phalp2_13168
2jukH
10 27,5% 298 9.523E-29
7 phalp2_27543
5nBeO
48 26,5% 200 8.152E-15
8 phalp2_34828
3T2Xl
16 23,4% 196 1.866E-08

Domains

Domains
PET_M15
Unannotated
Protein sequence: 4jkn5
1 286
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF02557

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4jkn5
Method AlphaFoldv2
Resolution 92.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50