Protein
- Protein accession
- 4y86g [EnVhog]
- Representative
- 4y86g (this protein)
- Source
- EnVhog (cluster: phalp2_31615)
- Protein name
- 4y86g
- Lysin probability
- 99%
- PhaLP type
-
endolysin
Probability: 98% (predicted by ML model) - Protein sequence
-
MASYNAFKNLVLGKAFDLDGAFGAQCWDGYAKYCKYLGYSYANCQASGYVKDIYTQRKSNGMLNNFNEVSILQAGDVVVFKEDPTWTPSSHIAIFDSDIDGVYGYFLGQNQGGANGAFNL
- Physico‐chemical
properties -
protein length: 120 AA molecular weight: 13176,5 Da isoelectric point: 4,79 hydropathy: -0,20
Other Proteins in cluster: phalp2_31615
| Total (incl. this protein): 15 | Avg length: 330,7 | Avg pI: 5,29 |
|
|
||
| Protein ID | Length (AA) | pI |
|---|---|---|
| 3WK2H | 125 | 5,43042 |
| 41bxe | 163 | 7,65754 |
| 699LN | 152 | 5,09496 |
| 8nFhO | 160 | 6,49206 |
| A0A4D6AEL3 | 443 | 4,82946 |
| A0A4D6B3T7 | 448 | 4,61279 |
| A0A4D6B635 | 447 | 4,83242 |
| J7KH48 | 443 | 4,87584 |
| A0A3B8DM02 | 443 | 4,82946 |
| A0A4D6AP36 | 443 | 4,82946 |
| Q8HA43 | 443 | 4,82946 |
| A0A3B8DYV1 | 443 | 4,78700 |
| A0A3B8E250 | 447 | 4,86374 |
| A0A8S5T0Q0 | 241 | 6,59448 |
Similar Clusters (pHMM search)
| # | Cluster | # Members | Identity (%) | Alignment Length | E-value |
|---|---|---|---|---|---|
| 1 |
phalp2_24703
6bxM0
|
24 | 62,6% | 123 | 3.984E-60 |
| 2 |
phalp2_26341
1gBcI
|
110 | 34,6% | 124 | 2.510E-20 |
| 3 |
phalp2_32947
3WJ00
|
2 | 32,5% | 89 | 1.261E-16 |
| 4 |
phalp2_29001
6tZ05
|
5 | 33,3% | 96 | 5.682E-12 |
| 5 |
phalp2_18914
23zaq
|
3 | 29,1% | 96 | 5.141E-11 |
| 6 |
phalp2_3543
4zM0o
|
163 | 31,3% | 99 | 9.642E-11 |
| 7 |
phalp2_24452
4BMZo
|
66 | 31,0% | 103 | 3.390E-10 |
| 8 |
phalp2_13673
5Vryf
|
1 | 26,7% | 86 | 8.702E-10 |
| 9 |
phalp2_22612
1LHfD
|
16 | 27,7% | 119 | 1.177E-06 |
| 10 |
phalp2_9151
6w5gn
|
2 | 23,9% | 92 | 3.005E-06 |
Domains
Domains
1
120
Legend:
EAD
CBD
Linker
Disordered
Unannotated
Taxonomy
| Name | Taxonomy ID | Lineage | |
|---|---|---|---|
| Phage |
Unknown from Metagenome [NCBI] |
UNKNOWN_ENVHOG | No lineage information |
| Host | No host information | ||
Coding sequence (CDS)
Coding sequence (CDS)
No CDS data available.
Gene Ontology
No Gene Ontology terms available.
Enzymatic activity
No enzymatic activity data available.
Tertiary structure
Model Confidence
Very high
pLDDT > 90
pLDDT > 90
High
90 > pLDDT > 70
90 > pLDDT > 70
Low
70 > pLDDT > 50
70 > pLDDT > 50
Very low
pLDDT < 50
pLDDT < 50