-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize JSON writer #13144
Optimize JSON writer #13144
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
include cleanup use string_view, & naming add comments
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor suggestions. Approving.
cpp/src/io/json/write_json.cu
Outdated
string_view const row_prefix; //{ | ||
string_view const row_suffix; //} or }\n for json-lines | ||
string_view const value_separator; //, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
string_view const row_prefix; //{ | |
string_view const row_suffix; //} or }\n for json-lines | |
string_view const value_separator; //, | |
string_view const row_prefix; // { | |
string_view const row_suffix; // } or }\n for json-lines | |
string_view const value_separator; // , |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How is this not caught in clang-format?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems like it should. I noticed the new clang-format doing some odd things myself.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cuSpatial and cugraph-ops noticed a bug in clang-format 16.0.1 relating to spacing around *
(it made some multiplications look like pointers). We can revert to another version if you think there are issues with the version we've chosen. For those two repos, we switched to 15.0.7. You can alter .pre-commit-config.yaml
and run pre-commit run --all-files clang-format
if you'd like to test it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've not seen anything concerning that is worth reverting to an older version.
/merge |
Description
Reduce benchmark runtime by skipping unrequired combinations
Optimize List and Struct conversion functions.
make_strings_column
to create the joined strings, and update offsets to create concatenated row string.Update default rows_per_chunk in Cython.
512MB dataframe writing runtime reduced from 84s to 2.7s.
JSON_WRITER benchmark runtime reduced from ~15m to ~5m.
Checklist