Protein
- Protein accession
- Rkne [EnVhog]
- Representative
- 4gf7o
- Source
- EnVhog (cluster: phalp2_30252)
- Protein name
- Rkne
- Lysin probability
- 89%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
MKKNLCKNCLGKLSNWAYKIQKVHREQNGNVLARVITWRTKKTIYLCKHCWDNYIDFSKVVVFLVVLTLSSLTSAHAIPENKAVRAIIGEAADQGYRGMLAVACAIRNRGHLKGVYGLKAKHVDSEPEWIWNIAKIAWQESASCDIVNGATHWENLTYGTPYWAKSMEIACKIGAHTFFKEN
- Physico‐chemical
properties -
protein length: 182 AA molecular weight: 20640,8 Da isoelectric point: 9,40 hydropathy: -0,18
Representative Protein Details
- Accession
- 4gf7o
- Protein name
- 4gf7o
- Sequence length
- 168 AA
- Molecular weight
- 18921,80280 Da
- Isoelectric point
- 9,90326
- Sequence
-
MSLNYKQSRLLARSWTESEKEESRRMPFIWAIMVGIAILLVMLIVDMASASEASTAKYSDNMAILAIIGEAESEPYAGMVAVGRTIIKRGSLKGVYGLTARRVVMRKYSSSTYKRARQALEEAKRTMHGWKAIGWGNESDLAIFNRSAWFTRCTIVAHIGNHYFYGVK
Other Proteins in cluster: phalp2_30252
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_3237
2edZS
|
52 | 39,0% | 128 | 2.031E-43 |
| 2 |
phalp2_16587
oHc
|
1 | 34,1% | 117 | 3.322E-26 |
| 3 |
phalp2_40530
4HipJ
|
3 | 36,9% | 111 | 6.223E-26 |
| 4 |
phalp2_11072
4Y5WB
|
2 | 25,1% | 151 | 1.595E-25 |
| 5 |
phalp2_37890
7FH0Q
|
12 | 33,3% | 111 | 1.305E-15 |
| 6 |
phalp2_11411
d7uy
|
20 | 22,5% | 133 | 1.208E-12 |
| 7 |
phalp2_13797
6Xdga
|
10 | 26,9% | 130 | 2.660E-11 |
| 8 |
phalp2_26034
6XPUy
|
1 | 23,7% | 118 | 1.693E-10 |
| 9 |
phalp2_26781
3naKf
|
1 | 21,9% | 132 | 3.134E-10 |
| 10 |
phalp2_20064
1DZhH
|
3 | 18,4% | 130 | 2.306E-08 |
Domains
Domains
1
168 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
No tertiary structures available for this protein.
The structures below correspond to the cluster representative
(4gf7o)
rather than this protein.
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50