-
Notifications
You must be signed in to change notification settings - Fork 922
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adds the end-to-end JSON parser implementation #11388
Adds the end-to-end JSON parser implementation #11388
Conversation
…ite-state-transducer-trimmed
…ite-state-transducer-trimmed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
High level review.
Good stuff, mostly docs/naming need to be ironed out.
//-------------------------------------------------------------------------------- | ||
// INITIALIZE JSON ROOT NODE | ||
//-------------------------------------------------------------------------------- |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Consider creating a new function instead of this kind of sections.
//-------------------------------------------------------------------------------- | |
// INITIALIZE JSON ROOT NODE | |
//-------------------------------------------------------------------------------- |
auto const& token = tokens_gpu[offset]; | ||
|
||
#ifdef NJP_DEBUG_PRINT | ||
std::cout << "[" << token_to_string(token) << "]\n"; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dev: we need to add logging
(fictional) manager: we have logging at home
logging at home:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving CMake changes - and requesting a couple small C++ include changes.
@gpucibot merge |
Adds nvbench for nested json parser. Depends on #11388 Authors: - Karthikeyan (https://github.com/karthikeyann) - Elias Stehle (https://github.com/elstehle) Approvers: - David Wendt (https://github.com/davidwendt) - Nghia Truong (https://github.com/ttnghia) URL: #11466
Description
This PR builds on the JSON tokenizer algorithm to implement an end-to-end JSON parser that parses to a
table_with_metadata
.Chained PR depending on:
⛓️ #11264
Checklist
Thanks to @karthikeyann who provided the initial implementation for converting internal columnar data to cudf columns.