Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: fix default number of histogram bins being extremely high #167

Merged
merged 6 commits into from
Oct 10, 2023

Conversation

mbelak-dtml
Copy link
Collaborator

The high number of bins caused extreme loss of performance for some specific data.
For example, for the column mass (g) of the edvart.example_datasets.dataset_meteorite_landings(), the number of inferred bins is over 5M, even though the dataset contains under 50k rows.

@mbelak-dtml mbelak-dtml requested a review from lukany October 5, 2023 09:40
@mbelak-dtml mbelak-dtml self-assigned this Oct 5, 2023
@mbelak-dtml mbelak-dtml force-pushed the fix/histogram-performance-bins branch from 8fa6b08 to c904a1c Compare October 5, 2023 12:38
Copy link
Contributor

@lukany lukany left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

examples/report-example.ipynb contains some exceptions in the cell outputs. See the diff. Otherwise LGTM. I tried to run the report-example and it run fine.

@mbelak-dtml mbelak-dtml requested a review from lukany October 10, 2023 08:07
@lukany lukany added this pull request to the merge queue Oct 10, 2023
Merged via the queue into main with commit 8ae5c39 Oct 10, 2023
6 checks passed
@lukany lukany deleted the fix/histogram-performance-bins branch October 10, 2023 09:04
mbelak-dtml added a commit that referenced this pull request Oct 10, 2023
The high number of bins caused extreme loss of performance for some
specific data.
For example, for the column `mass (g)` of the
`edvart.example_datasets.dataset_meteorite_landings()`, the number of
inferred bins is over 5M, even though the dataset contains under 50k
rows.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants