Protein

Protein accession
1XF5l [EnVhog]
Representative
4NKG1
Source
EnVhog (cluster: phalp2_21)
Protein name
1XF5l
Lysin probability
99%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MKKLVQRAYESAKEDLKKDWSEVKGPGSNPLITECYKAVDGLGNPEMLDDSTTAWCSCYMNKKIQDAGGKGTRSPAARSWLRWGREVTEPSEGDIVIFKRGNSSWQAHVTFFISKDEEYVQCLGGNQGNDVKISRYPIDHIIGFRTSKD
Physico‐chemical
properties
protein length:149 AA
molecular weight:16753,7 Da
isoelectric point:7,63
hydropathy:-0,73
Representative Protein Details
Accession
4NKG1
Protein name
4NKG1
Sequence length
148 AA
Molecular weight
16658,07530 Da
Isoelectric point
9,23086
Sequence
MRLIEKMWALIQAEAKKDWKEKPGALMNENIKKAFEEVKIDGLDLSDMDDGAIATCSILMNWICQKCGGTGTRSGLARSWSNWGRASDGKVGDIVILRRGTSSWQGHVTMLYKKNLLTVECLGFNQKNDLRISTYPRAQVIAYRTSKD
Other Proteins in cluster: phalp2_21
Total (incl. this protein): 6 Avg length: 150,7 Avg pI: 7,95

Protein ID Length (AA) pI
4NKG1 148 9,23086
1aJI7 149 6,82872
3TYjd 151 5,75810
5kDqJ 159 8,76372
5mwFR 148 9,48357
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_19771
7LzfF
669 34,0% 141 3.776E-42
2 phalp2_31967
4Vkzp
33 37,4% 131 1.818E-31
3 phalp2_35078
59R52
32 34,8% 129 2.262E-30
4 phalp2_32166
6PqA1
486 28,7% 153 5.821E-30
5 phalp2_37918
4UqBP
157 31,5% 146 5.821E-30
6 phalp2_28323
16UXy
9 29,1% 134 3.934E-26
7 phalp2_38506
1rtno
9 31,0% 129 7.488E-23
8 phalp2_8732
8njr6
7 34,3% 102 1.268E-21
9 phalp2_1584
87k6T
5 37,6% 101 3.256E-21
10 phalp2_5484
3e9Dj
4 34,4% 122 2.145E-20

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4NKG1 (148 AA)
Member sequence: 1XF5l (149 AA)
1 148 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4NKG1) rather than this protein.
PDB ID
4NKG1
Method AlphaFoldv2
Resolution 90.57
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50