Protein
- Protein accession
- 4f6EK [EnVhog]
- Representative
- 4f6EK (this protein)
- Source
- EnVhog (cluster: phalp2_39293)
- Protein name
- 4f6EK
- Lysin probability
- 95%
- PhaLP type
-
endolysin
Probability: 99% (predicted by ML model) - Protein sequence
-
VTIKSGRGWRLAPSLVALEGEANRVAPRRSQASDGSIGDSAHRHRVSDHNPSHGVVHAIDLTHDPRGGFDAHAHGRAIAARRDPRVKYLISQRRIWEPATGWKSYGGDNPHDKHLHVSIKDTPAAENDRSVWLPWAGAPAPAPTPIPAPAPISPPVSVPSNVPTPVPTFEEDDVIVRNTEDGSIWAVSSTHMHHLTPDQWNQRSGVEHPPIVDMAAISVWSLALAGRQVV
- Physico‐chemical
properties -
protein length: 230 AA molecular weight: 24926,5 Da isoelectric point: 7,02 hydropathy: -0,49
Other Proteins in cluster: phalp2_39293
| Total (incl. this protein): 17 | Avg length: 232,9 | Avg pI: 7,77 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 15ve9 | 241 | 9,35683 |
| 1LlEl | 245 | 6,59329 |
| 1LnDW | 241 | 9,18322 |
| 1Lp5Y | 225 | 8,70299 |
| 36C6b | 236 | 10,09448 |
| 4fOSu | 242 | 6,35184 |
| 4fdoP | 241 | 9,35683 |
| 4fdoQ | 245 | 6,40481 |
| 8eJMh | 243 | 9,35683 |
| 8jgW8 | 210 | 6,62199 |
| 8oYQ3 | 230 | 6,14568 |
| 8pOm7 | 219 | 6,49121 |
| 8pPWR | 219 | 7,18197 |
| 8rfcN | 233 | 7,15253 |
| 8sRQZ | 231 | 6,14989 |
| 8sXet | 228 | 9,91364 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_398
7qxjv
|
88 | 50,3% | 161 | 1.115E-38 |
| 2 |
phalp2_339
6W7QO
|
1 | 36,5% | 219 | 2.010E-35 |
| 3 |
phalp2_23302
5hZqv
|
9 | 38,0% | 176 | 2.440E-34 |
| 4 |
phalp2_39312
4kyeu
|
9 | 34,0% | 229 | 2.620E-32 |
| 5 |
phalp2_32428
UbvX
|
6 | 44,0% | 150 | 1.244E-31 |
| 6 |
phalp2_11273
6QMUc
|
1 | 38,3% | 159 | 3.817E-30 |
| 7 |
phalp2_19235
42vyU
|
4 | 34,8% | 215 | 2.171E-28 |
| 8 |
phalp2_5823
5e2qa
|
2 | 37,8% | 156 | 1.319E-22 |
Domains
Domains
1
230
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50