-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve shuffle-benchmark #1074
Improve shuffle-benchmark #1074
Conversation
Codecov ReportBase: 87.17% // Head: 87.17% // No change to project coverage 👍
Additional details and impacted files@@ Coverage Diff @@
## branch-23.02 #1074 +/- ##
=============================================
Coverage 87.17% 87.17%
=============================================
Files 18 18
Lines 2253 2253
=============================================
Hits 1964 1964
Misses 289 289
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @madsbk , left a few suggestions/questions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor quibbles (mostly around wording), but otherwise looks good.
Co-authored-by: Peter Andreas Entschev <[email protected]> Co-authored-by: Lawrence Mitchell <[email protected]>
Thanks for the reviews guys, anything else? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
/merge |
Adding
--ignore-index
and balance the partition distribution between workers.This should make the runs more consist and improve the data creation significantly.