Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REVIEW] Fix cudf::strings:split logic for many columns #4922

Merged
Merged
Show file tree
Hide file tree
Changes from 19 commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
0d045e5
move split-record fn to separate source file
davidwendt Apr 14, 2020
e6f0d35
recoded split/rsplit for perf
davidwendt Apr 16, 2020
c18eb2c
fix test::print for strings columns with no nullmask
davidwendt Apr 16, 2020
7b6ead3
add more tests using maxsplits parm
davidwendt Apr 16, 2020
6dcfc91
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 16, 2020
fa3135a
update comments and remove prints
davidwendt Apr 16, 2020
cc96b68
fix all-null special case
davidwendt Apr 16, 2020
43fde57
hates compile warnings
davidwendt Apr 17, 2020
36659c1
workaround compile segfault
davidwendt Apr 17, 2020
33dea6c
update changelog
davidwendt Apr 17, 2020
e225d7c
fix changelog entry
davidwendt Apr 17, 2020
a709fa7
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 17, 2020
0cb9b11
add base class for split/rsplit fns
davidwendt Apr 17, 2020
41c0eff
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 17, 2020
9340e8a
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 17, 2020
fb114f1
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 20, 2020
9832fef
handle empty column
davidwendt Apr 20, 2020
fa59935
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 20, 2020
bb8cb7b
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 21, 2020
772237c
fix merge conflicts from clang format
davidwendt Apr 23, 2020
3f0706b
fix merge conflict from clang format again
davidwendt Apr 23, 2020
0eb2392
clang-format new file
davidwendt Apr 23, 2020
f2a8142
add tests for overlapped delimiter
davidwendt Apr 23, 2020
789a37a
update parameter name in declarations
davidwendt Apr 23, 2020
520e29c
fix comment
davidwendt Apr 23, 2020
26fb2f1
one more test string
davidwendt Apr 23, 2020
80188e0
forgot clang format
davidwendt Apr 23, 2020
a258cf3
add factory with iterator parms
davidwendt Apr 24, 2020
b1bc0dd
place tokens in column order; use new iterator factory
davidwendt Apr 24, 2020
90f98a9
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 24, 2020
5bc4ede
Merge branch 'branch-0.14' into perf-strings-split-to-many-columns
davidwendt Apr 24, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -180,6 +180,7 @@
- PR #4915 Fix to `reset_index` inplace in MultiIndex and other places
- Pr #4899 Fix series inplace handling
- PR #4889 Fix multi-index merging
- PR #4922 Fix cudf::strings:split logic for many columns
- PR #4949 Fix scatter, gather benchmark constructor call


Expand Down
1 change: 1 addition & 0 deletions cpp/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -630,6 +630,7 @@ add_library(cudf
src/strings/sorting/sorting.cu
src/strings/split/partition.cu
src/strings/split/split.cu
src/strings/split/split_record.cu
src/strings/strings_column_factories.cu
src/strings/strings_column_view.cu
src/strings/strings_scalar_factories.cpp
Expand Down
2 changes: 1 addition & 1 deletion cpp/src/rolling/rolling.cu
Original file line number Diff line number Diff line change
Expand Up @@ -368,7 +368,7 @@ struct rolling_window_launcher
// The rows that represent null elements will be having negative values in gather map,
// and that's why nullify_out_of_bounds/ignore_out_of_bounds is true.
auto output_table = detail::gather(table_view{{input}}, output->view(), false, true, false, mr, stream);
return std::make_unique<cudf::column>(std::move(output_table->get_column(0)));;
output = std::make_unique<cudf::column>(std::move(output_table->get_column(0)));;
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This simply removes a compile warning.

}

return output;
Expand Down
1,439 changes: 650 additions & 789 deletions cpp/src/strings/split/split.cu

Large diffs are not rendered by default.

Loading