fix: fix default number of histogram bins being extremely high #167

mbelak-dtml · 2023-10-05T09:40:40Z

The high number of bins caused extreme loss of performance for some specific data.
For example, for the column mass (g) of the edvart.example_datasets.dataset_meteorite_landings(), the number of inferred bins is over 5M, even though the dataset contains under 50k rows.

…e situations

lukany

examples/report-example.ipynb contains some exceptions in the cell outputs. See the diff. Otherwise LGTM. I tried to run the report-example and it run fine.

…ance-bins

The high number of bins caused extreme loss of performance for some specific data. For example, for the column `mass (g)` of the `edvart.example_datasets.dataset_meteorite_landings()`, the number of inferred bins is over 5M, even though the dataset contains under 50k rows.

mbelak-dtml requested a review from lukany October 5, 2023 09:40

mbelak-dtml self-assigned this Oct 5, 2023

fix: fix default number of histogram bins being extremely high in som…

c904a1c

…e situations

mbelak-dtml force-pushed the fix/histogram-performance-bins branch from 8fa6b08 to c904a1c Compare October 5, 2023 12:38

format with black

921c1b5

lukany requested changes Oct 9, 2023

View reviewed changes

mbelak-dtml added 4 commits October 10, 2023 09:55

rerun example notebooks

f53b41b

Merge remote-tracking branch 'origin/main' into fix/histogram-perform…

f173b38

…ance-bins

rerun time series example

ebf9aef

remove accidentally added exported notebook

4273bba

mbelak-dtml requested a review from lukany October 10, 2023 08:07

lukany approved these changes Oct 10, 2023

View reviewed changes

lukany added this pull request to the merge queue Oct 10, 2023

Merged via the queue into main with commit 8ae5c39 Oct 10, 2023
6 checks passed

lukany deleted the fix/histogram-performance-bins branch October 10, 2023 09:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fix default number of histogram bins being extremely high #167

fix: fix default number of histogram bins being extremely high #167

mbelak-dtml commented Oct 5, 2023

lukany left a comment

fix: fix default number of histogram bins being extremely high #167

fix: fix default number of histogram bins being extremely high #167

Conversation

mbelak-dtml commented Oct 5, 2023

lukany left a comment

Choose a reason for hiding this comment