Protein
- Protein accession
- 7N5g9 [EnVhog]
- Representative
- 7N5g9 (this protein)
- Source
- EnVhog (cluster: phalp2_5784)
- Protein name
- 7N5g9
- Lysin probability
- 98%
- PhaLP type
-
endolysin
Probability: 96% (predicted by ML model) - Protein sequence
-
QWYDSHEVEFVVKNRSVLAESLNNNTQSFINTISPSAVNIASAYDLFPSVMVAQAILESHSGQSGLASDYHNLFGIKGAYKGGSVSLKTWEDDGSGNAYTIYDTFRVYSTWYESLEDYANVLQQAHFNGVHRSVAGNVYNATSALVGVYATDTAYAEKLRYIIQTYGLESLDGGGTTLTGNEDGTVWNKYRQQNTTQAILNEDIAWAKRIGTE
- Physico‐chemical
properties -
protein length: 213 AA molecular weight: 23484,4 Da isoelectric point: 4,85 hydropathy: -0,36
Other Proteins in cluster: phalp2_5784
| Total (incl. this protein): 18 | Avg length: 221,8 | Avg pI: 7,26 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 39WVM | 167 | 5,81238 |
| 3xCJa | 192 | 5,50653 |
| 4W5RL | 174 | 7,63310 |
| 4jLZw | 189 | 5,25877 |
| 5Hy9X | 186 | 7,66237 |
| 5Hycn | 173 | 9,15072 |
| 6ah09 | 279 | 9,49943 |
| 6uwBB | 247 | 7,99996 |
| 70o72 | 245 | 6,84986 |
| 7MLYM | 253 | 8,95996 |
| 7MUcm | 156 | 4,98031 |
| 7MjKZ | 253 | 8,97318 |
| 7NQY4 | 246 | 6,84838 |
| 7P4A7 | 251 | 6,85304 |
| CF1O | 279 | 9,42007 |
| ELsw | 211 | 4,80951 |
| zgi3 | 279 | 9,53605 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_5346
2cQBB
|
648 | 37,4% | 155 | 6.936E-32 |
| 2 |
phalp2_31187
89iBG
|
9 | 36,8% | 144 | 5.895E-28 |
| 3 |
phalp2_28108
7xE0g
|
17 | 38,0% | 147 | 8.050E-28 |
| 4 |
phalp2_9991
3fZEe
|
50 | 38,5% | 148 | 7.121E-27 |
| 5 |
phalp2_37638
4fbgK
|
5 | 37,8% | 161 | 9.721E-27 |
| 6 |
phalp2_20883
6PO0L
|
1 | 42,9% | 156 | 3.375E-26 |
| 7 |
phalp2_11358
7wMn3
|
16 | 36,3% | 168 | 3.375E-26 |
| 8 |
phalp2_2026
2muJS
|
52 | 35,9% | 153 | 6.288E-26 |
| 9 |
phalp2_2210
4f9X6
|
94 | 36,4% | 151 | 4.062E-25 |
| 10 |
phalp2_19230
41fA9
|
30 | 33,1% | 154 | 7.563E-25 |
Domains
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50