-
Notifications
You must be signed in to change notification settings - Fork 920
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] get_json_object parsing values. #10196
Comments
Hello @nvdbaranec would you please comment on this request and the scope of adding behavioral flags to |
This issue has been labeled |
Still relevant. |
Would you please clarify the use case? As I understand, the use case reads a text file containing JSON records, calls
Would this issue be resolved by "Support nested types in JSON reader" (#8827)? |
Hello @GregoryKimball , If we can parse the column with json string, that would make other jobs for us easy too. Our Usecase: If we are able to parse the string as an json object, I blv this should solve our use-case as long it returns Null when a field/value is not present. If you have any further questions or wanting to discuss this further, please let me know. |
Thanks @mlahir1 for your message. I'd like to propose a different JsonPath approach. I recommend iterating through your objects with JsonPath
Now the contents of
|
This issue has been labeled |
Still relevant. |
Hi @GregoryKimball , it is an interesting solution about |
The processing of the json strings is serial within each thread, but it is parallelized at the row level. So we have 1 thread per row in the input. |
Addresses: #10196 Previously, `get_json_object()` ignored fields in a JsonPath expression that are missing in the json string. This PR adds the option to return these missing fields as null instead. Authors: - Srikar Vanavasam (https://github.com/SrikarVanavasam) Approvers: - Nghia Truong (https://github.com/ttnghia) - MithunR (https://github.com/mythrocks) - David Wendt (https://github.com/davidwendt) URL: #10970
This PR exposes `get_json_object_options` to the Python API. Addresses #10196 Authors: - Srikar Vanavasam (https://github.com/SrikarVanavasam) Approvers: - Ashwin Srinath (https://github.com/shwina) - Paul Taylor (https://github.com/trxcllnt) - Vyas Ramasubramani (https://github.com/vyasr) URL: #11180
when the field is not present in the nested json field, it doesn't return empty list/NA, rather it just ignores the field.
This way it becomes impossible to map the values to right rows after mapping.
The text was updated successfully, but these errors were encountered: