Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use cuco::static_map to build string dictionaries in ORC writer #13580

Merged
merged 59 commits into from
Jul 14, 2023
Merged
Show file tree
Hide file tree
Changes from 48 commits
Commits
Show all changes
59 commits
Select commit Hold shift + click to select a range
06130bd
remove unused members
vuule Jun 4, 2023
8e75032
simplify char count
vuule Jun 4, 2023
9db5f13
replace chunk dict char count
vuule Jun 6, 2023
f04871c
remove string_char_count from DictionaryChunk
vuule Jun 6, 2023
23b3a16
allocate dict storage
vuule Jun 6, 2023
3f59f73
add char_count
vuule Jun 6, 2023
25342c4
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 7, 2023
29f25a2
init hash maps
vuule Jun 8, 2023
1191626
build dictionaries; decide encode
vuule Jun 8, 2023
1b18ac7
remove debug prints
vuule Jun 8, 2023
ff621f3
fix indexing
vuule Jun 9, 2023
3e564d2
take dict index entry size into account
vuule Jun 9, 2023
1916898
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 9, 2023
e36308a
dict_data alloc
vuule Jun 13, 2023
3b00770
collect_map_entries
vuule Jun 13, 2023
1562e4b
get_dictionary_indices
vuule Jun 13, 2023
5a152cc
replace stripe dict counters
vuule Jun 13, 2023
8f7ef1d
fix lifetime
vuule Jun 13, 2023
3a6d95b
full replacement (?)
vuule Jun 14, 2023
19650e6
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 14, 2023
b793c02
Merge branch 'branch-23.08' into fea-write-orc-dict
vuule Jun 14, 2023
7bedaa3
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 22, 2023
2be21b2
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 23, 2023
4935fb5
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 26, 2023
0bd10d7
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 26, 2023
217b1c9
clean up mem after use
vuule Jun 26, 2023
84b45c2
separate dict build
vuule Jun 26, 2023
8bd1bc0
proper is_enabled
vuule Jun 26, 2023
08d2ab7
Merge branch 'fea-write-orc-dict' of https://github.com/vuule/cudf in…
vuule Jun 26, 2023
6f96e22
remove dict size limit; docs
vuule Jun 27, 2023
bed6883
docs
vuule Jun 27, 2023
22843f5
impl docs
vuule Jun 27, 2023
9500780
comments
vuule Jun 27, 2023
e1b8e94
constttttttttttttt
vuule Jun 28, 2023
588f0f1
namespace
vuule Jun 28, 2023
b978776
header clean up
vuule Jun 28, 2023
5c51670
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jun 28, 2023
54abee1
Merge branch 'branch-23.08' into fea-write-orc-dict
vuule Jun 30, 2023
99cf647
Merge branch 'branch-23.08' into fea-write-orc-dict
vuule Jul 6, 2023
611ae72
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jul 7, 2023
c8c6aab
Apply suggestions from code review
vuule Jul 7, 2023
75203d1
simplify populate_dictionary_hash_maps_kernel
vuule Jul 7, 2023
6ce882e
Merge branch 'fea-write-orc-dict' of https://github.com/vuule/cudf in…
vuule Jul 7, 2023
56bee95
Update cpp/src/io/orc/dict_enc.cu
vuule Jul 7, 2023
80535f6
Merge branch 'fea-write-orc-dict' of https://github.com/vuule/cudf in…
vuule Jul 7, 2023
4b00841
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jul 10, 2023
908186c
populate_dictionary_hash_maps_kernel optimization/clean up
vuule Jul 10, 2023
2ba5c56
Merge branch 'branch-23.08' into fea-write-orc-dict
vuule Jul 11, 2023
6e37d08
group headers
vuule Jul 11, 2023
d9732c9
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jul 11, 2023
5c8112f
Merge branch 'fea-write-orc-dict' of https://github.com/vuule/cudf in…
vuule Jul 11, 2023
a7081f2
Merge branch 'branch-23.08' into fea-write-orc-dict
vuule Jul 12, 2023
b5ef2ff
Merge branch 'branch-23.08' into fea-write-orc-dict
vuule Jul 13, 2023
43145a9
Merge branch 'branch-23.08' of https://github.com/rapidsai/cudf into …
vuule Jul 13, 2023
7d0cd4d
fix sync issue
vuule Jul 13, 2023
2f2c1ce
col ref
vuule Jul 13, 2023
f8b8f6d
Merge branch 'fea-write-orc-dict' of https://github.com/vuule/cudf in…
vuule Jul 13, 2023
d3ac7f9
code review missed
vuule Jul 13, 2023
2704328
bail if no string cols
vuule Jul 14, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading