Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JSON tree traversal #11610

Merged
merged 85 commits into from
Sep 24, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
85 commits
Select commit Hold shift + click to select a range
cdc98d6
pull changes from PR #11291
karthikeyann Aug 11, 2022
d074579
fix minor bug, cleanup unit test
karthikeyann Aug 11, 2022
f5287a5
add struct member begin, end tokens
karthikeyann Aug 11, 2022
d694f21
add get_tree_representation_gpu initial version
karthikeyann Aug 11, 2022
96eca61
Merge branch 'branch-22.10' of https://github.com/rapidsai/cudf into …
karthikeyann Aug 14, 2022
34ba420
delete repeated nested_json_test.cu
karthikeyann Aug 14, 2022
a083709
add print_tree debug print
karthikeyann Aug 23, 2022
1768fe7
fix valueEnd, structMemberEnd, add relevant unit test
karthikeyann Aug 24, 2022
4964826
add unit test for get_tree_representation with valueend space
karthikeyann Aug 24, 2022
4007f6a
cleanup, organize code
karthikeyann Aug 24, 2022
0c092ea
Merge branch 'branch-22.10' of https://github.com/rapidsai/cudf into …
karthikeyann Aug 26, 2022
8ff75a9
cleanup
karthikeyann Aug 26, 2022
f73a187
doc
karthikeyann Aug 26, 2022
93719e4
col_id generation for record orient
karthikeyann Aug 26, 2022
018a779
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 1, 2022
e531cba
merge fix, review comments
karthikeyann Sep 1, 2022
24d20e6
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 1, 2022
94bdd1f
cleanup tests
karthikeyann Sep 5, 2022
edb78ad
add StructMemberEnd whitespace, newline test cases
karthikeyann Sep 5, 2022
ab1db5b
include_quote_chars change in tree generation
karthikeyann Sep 5, 2022
0ba90db
Merge branch 'branch-22.10' into fea-json-tree-gpu
karthikeyann Sep 5, 2022
d124241
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 6, 2022
15570d4
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 7, 2022
522052e
row offset cpu version reference implementation
karthikeyann Sep 7, 2022
e88d08f
comment out debug prints in level processing for col_id
karthikeyann Sep 7, 2022
b505e76
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 8, 2022
ab4c7a1
temporary workaround until PR #11574 merge
karthikeyann Sep 8, 2022
a146936
add nvtx func ranges
karthikeyann Sep 8, 2022
6591fa5
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 8, 2022
b85c01d
initial code for row_offsets generation
karthikeyann Sep 8, 2022
6eb9bbe
address review comments
karthikeyann Sep 9, 2022
5693ba6
cleanup debug print functions, some functions to stable for debugging
karthikeyann Sep 9, 2022
73c613b
fix bug: carry over col_id from previous stage due to sort reorder.
karthikeyann Sep 9, 2022
38b99cf
address review comments
karthikeyann Sep 12, 2022
285298a
use scatter instead of scatter_by_key, resuse token_levels memory
karthikeyann Sep 12, 2022
bffd961
debug print json string in token order
karthikeyann Sep 12, 2022
f452f8a
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 12, 2022
90cd2dc
review comments (davidwendt)
karthikeyann Sep 13, 2022
af399b3
skip restore parent_node_ids order by making a copy
karthikeyann Sep 13, 2022
03e6826
use NodeIndexT for col_ids type
karthikeyann Sep 15, 2022
439cfa0
create column_order from struct field order
karthikeyann Sep 15, 2022
222cd60
documentation and cleanup
karthikeyann Sep 17, 2022
89188cd
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 17, 2022
a158220
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 17, 2022
07caae7
copied pytest change from PR #11574 to pass
karthikeyann Sep 18, 2022
900bdc1
fix memory error off-by-one write out of bounds
karthikeyann Sep 19, 2022
3b75751
fix bug: call missing get_token call for node_range_end
karthikeyann Sep 19, 2022
2c2f43f
wrap debug prints within debug macro flag
karthikeyann Sep 19, 2022
efbff72
cleanup PRINT_LEVEL_DATA
karthikeyann Sep 19, 2022
202f7aa
rename is_field_name_node lambda
karthikeyann Sep 19, 2022
2bdf767
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 19, 2022
7661669
is_node return type to bool (addressed review comment)
karthikeyann Sep 19, 2022
e69038b
add json_lines test input
karthikeyann Sep 19, 2022
9f39a0e
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 19, 2022
b8367a7
fix missing get_token_index in range_end
karthikeyann Sep 19, 2022
7c4a490
Revert "fix missing get_token_index in range_end"
karthikeyann Sep 19, 2022
d3725a8
Revert "Revert "fix missing get_token_index in range_end""
karthikeyann Sep 19, 2022
0a318a1
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 19, 2022
2f34d3a
remove unnecessary __host__ __device__
karthikeyann Sep 19, 2022
7fff746
Merge branch 'fea-json-tree-gpu' of github.com:karthikeyann/cudf into…
karthikeyann Sep 19, 2022
ee080ea
add json-lines support in json tree traversal
karthikeyann Sep 19, 2022
cffaece
address review comments (removed unused 0 condition)
karthikeyann Sep 19, 2022
1bc51b6
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 20, 2022
a87f08d
address review comments, reduce level_boundaries memory usage
karthikeyann Sep 20, 2022
68f2968
cleanup, optimizations - use scatter, skip needless restore order by …
karthikeyann Sep 21, 2022
353dcc8
use size_type for NodeIndexT, -1 as parent_node_sentinel
karthikeyann Sep 21, 2022
a729729
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 21, 2022
8ef4a4b
move debug print to anonymous namespace, cleanup comments
karthikeyann Sep 21, 2022
d3e48c7
cleanup compare_vector, debug print with env,
karthikeyann Sep 21, 2022
113ea64
address review comments
karthikeyann Sep 22, 2022
173065d
replace merge sort stable with 2 radix stable sort
karthikeyann Sep 22, 2022
619bb72
reorganise code into functions
karthikeyann Sep 22, 2022
898bd25
cleanup debug prints verbose typing
karthikeyann Sep 22, 2022
2791684
move parent_indices translation to a function
karthikeyann Sep 22, 2022
24a65a9
Update cpp/tests/io/json_tree.cpp
karthikeyann Sep 22, 2022
66e1c7f
address review comments
karthikeyann Sep 22, 2022
e11da8d
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 22, 2022
1239514
address review comments
karthikeyann Sep 23, 2022
ecb64be
remove NVTX (helps code review)
karthikeyann Sep 23, 2022
3eb8326
removed debug prints
karthikeyann Sep 23, 2022
f9e2837
performance changes, replace sort with scatter
karthikeyann Sep 23, 2022
7126c07
remove default arg to stream
karthikeyann Sep 23, 2022
d898aba
address review comments (davidwendt)
karthikeyann Sep 23, 2022
aafab03
move sequence to scope of for-loop
karthikeyann Sep 23, 2022
888a83f
Merge branch 'branch-22.10' of github.com:rapidsai/cudf into fea-json…
karthikeyann Sep 24, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading