-
Notifications
You must be signed in to change notification settings - Fork 913
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add an example to demonstrate multithreaded read_parquet
pipelines
#16828
Add an example to demonstrate multithreaded read_parquet
pipelines
#16828
Conversation
Thanks @mhaseeb123 for putting this together. After a little hacking I was able to generate a nice pipelined profile. I removed the concats and the writes and only kept the first read. I also had to add an initialization for From this command: Based on this command, you can see why I would like a "repetitions" CLI argument 😆 |
Thanks @mhaseeb123 for the continued development and discussions around this example.
|
OMG that command line is such a monstrosity. All done in the update though. Still working on the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the final batch of nits :)
…m/mhaseeb123/cudf into fea-parquet-multithreaded-example
@lamarrr Would you please share your review? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CI changes look fine to me.
/merge |
Description
Closes #16717. This PR adds a new example to read multiple parquet files using multiple threads.
Checklist