Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RELEASE] cudf v23.12 #14468

Merged
merged 154 commits into from
Dec 6, 2023
Merged
Changes from 1 commit
Commits
Show all changes
154 commits
Select commit Hold shift + click to select a range
f42231f
v23.12 Updates [skip ci]
raydouglass Sep 22, 2023
e591d00
Merge pull request #14174 from rapidsai/branch-23.10
GPUtester Sep 22, 2023
e889bae
Merge pull request #14176 from rapidsai/branch-23.10
GPUtester Sep 22, 2023
6ebae76
Merge pull request #14177 from rapidsai/branch-23.10
GPUtester Sep 22, 2023
6fea450
Merge pull request #14178 from rapidsai/branch-23.10
GPUtester Sep 22, 2023
007f43c
Merge pull request #14179 from rapidsai/branch-23.10
GPUtester Sep 23, 2023
b438d56
Merge pull request #14180 from rapidsai/branch-23.10
GPUtester Sep 23, 2023
e1a0ea3
Merge pull request #14181 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
294b0b3
Merge pull request #14182 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
5c8a4a0
Merge pull request #14184 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
a5e30de
Merge pull request #14185 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
75b08ee
Merge pull request #14186 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
5f5b55c
Merge pull request #14188 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
77d45e3
Merge pull request #14189 from rapidsai/branch-23.10
GPUtester Sep 25, 2023
4137a97
Merge pull request #14194 from rapidsai/branch-23.10
GPUtester Sep 26, 2023
77f05d4
Merge pull request #14199 from rapidsai/branch-23.10
GPUtester Sep 26, 2023
30093c3
Merge pull request #14203 from rapidsai/branch-23.10
GPUtester Sep 26, 2023
62a49a2
Merge pull request #14204 from rapidsai/branch-23.10
GPUtester Sep 26, 2023
b763b8b
Merge pull request #14207 from rapidsai/branch-23.10
GPUtester Sep 27, 2023
cdc03a7
Marginally reduce memory footprint of joins (#14197)
wence- Sep 27, 2023
b838e64
Merge pull request #14211 from rapidsai/branch-23.10
GPUtester Sep 27, 2023
90bedcb
Merge pull request #14213 from rapidsai/branch-23.10
GPUtester Sep 27, 2023
8342caa
Merge pull request #14220 from rapidsai/branch-23.10
GPUtester Sep 27, 2023
c6fea7d
Merge pull request #14221 from rapidsai/branch-23.10
GPUtester Sep 27, 2023
d01b69e
Merge pull request #14222 from rapidsai/branch-23.10
GPUtester Sep 28, 2023
dea0df0
Merge pull request #14223 from rapidsai/branch-23.10
GPUtester Sep 28, 2023
23d24d4
Add `bytes_per_second` to distinct_count of stream_compaction nvbench…
Blonck Sep 28, 2023
801c9cf
Merge branch-23.10 into branch-23.12
galipremsagar Sep 28, 2023
7825790
Merge pull request #14231 from galipremsagar/branch-23.12-merge-23.10
raydouglass Sep 28, 2023
59b09fd
cuDF: Build CUDA 12.0 ARM conda packages. (#14112)
bdice Sep 29, 2023
5f17c83
Merge pull request #14244 from rapidsai/branch-23.10
GPUtester Oct 3, 2023
7bd435d
Merge pull request #14245 from rapidsai/branch-23.10
GPUtester Oct 3, 2023
29556a2
Remove the use of volatile in ORC (#14175)
vuule Oct 3, 2023
d87e181
Expose streams in binaryop APIs (#14187)
vyasr Oct 4, 2023
b120f7e
Improve `contains_column` by invoking `contains_table` (#14238)
PointKernel Oct 4, 2023
5d311ea
Fix strings replace for adjacent, identical multi-byte UTF-8 characte…
davidwendt Oct 5, 2023
04e2cd6
cudf::detail::pinned_allocator doesn't throw from `deallocate` (#14251)
robertmaynard Oct 6, 2023
fc36947
Fixing parquet list of struct interpretation (#13715)
hyperbolic2346 Oct 6, 2023
96664ec
Add pylibcudf.Scalar that interoperates with Arrow scalars (#14133)
vyasr Oct 6, 2023
e28017c
Cleanup of namespaces in parquet code. (#14259)
nvdbaranec Oct 9, 2023
e345620
Add stream parameter to List Manipulation and Operations APIs (#14248)
SurajAralihalli Oct 9, 2023
b4fd77b
Centralize chunked reading code in the parquet reader to reader_impl_…
nvdbaranec Oct 10, 2023
053da82
Make parquet schema index type consistent (#14256)
hyperbolic2346 Oct 10, 2023
5039d04
Expose stream parameter in public strings APIs (#14260)
davidwendt Oct 10, 2023
c0c7ed8
Add `bytes_per_second` to transpose benchmark (#14170)
Blonck Oct 10, 2023
0ed7725
Add `bytes_per_second` to shift benchmark (#13950)
Blonck Oct 10, 2023
aa8b0f8
Handle empty string correctly in Parquet statistics (#14257)
etseidl Oct 10, 2023
b17904d
Add in java bindings for DataSource (#14254)
revans2 Oct 11, 2023
15baa00
Fixes behaviour for incomplete lines when `recover_with_nulls` is ena…
elstehle Oct 11, 2023
301dce1
Merge pull request #14269 from rapidsai/branch-23.10
GPUtester Oct 11, 2023
aa598bc
Expose stream parameter in public strings split/partition APIs (#14247)
davidwendt Oct 11, 2023
737b759
Use branch-23.12 workflows. (#14271)
bdice Oct 12, 2023
fa4e8ab
Expose stream parameter in public strings replace APIs (#14261)
davidwendt Oct 12, 2023
6e00ad0
Return error if BOOL8 column-type is used with integers-to-hex (#14208)
davidwendt Oct 13, 2023
d590e0b
Expose stream parameter in public strings convert APIs (#14255)
davidwendt Oct 16, 2023
655f3a4
Update rapids-cmake functions to non-deprecated signatures (#14265)
robertmaynard Oct 16, 2023
ef92310
Fix memset error in nvtext::edit_distance_matrix (#14283)
davidwendt Oct 16, 2023
c47546e
Add stream parameter to List Sort and Filter APIs (#14272)
SurajAralihalli Oct 16, 2023
5f05c18
Enable indexalator for device code (#14206)
davidwendt Oct 17, 2023
29b2537
update repo refs (#14289)
AyodeAwe Oct 18, 2023
7aa7579
Add nvtx annotations to spilling-based data movement (#14196)
wence- Oct 18, 2023
d36904b
Downgrade to Arrow 12.0.0 for aws-sdk-cpp and fix cudf_kafka builds f…
vyasr Oct 19, 2023
50e2211
Changes JSON reader's recovery option's behaviour to ignore all chara…
elstehle Oct 20, 2023
daa861a
Merge branch-23.10 into branch-23.12
bdice Oct 20, 2023
17eb4cf
Merge pull request #14307 from bdice/branch-23.12-merge-23.10
AyodeAwe Oct 20, 2023
f7ad66f
Add DELTA_BINARY_PACKED encoder for Parquet writer (#14100)
etseidl Oct 20, 2023
0341bb7
Expose streams in public null mask APIs (#14263)
vyasr Oct 20, 2023
e7c6365
Detect and report errors in Parquet header parsing (#14237)
etseidl Oct 20, 2023
253f6a6
Refactor LogicalType for Parquet (#14264)
etseidl Oct 20, 2023
8ae3aab
Extract `debug_utilities.hpp/cu` from `column_utilities.hpp/cu` (#13720)
ttnghia Oct 23, 2023
e8cf0eb
Expose stream parameter in public strings combine APIs (#14281)
davidwendt Oct 23, 2023
630982a
test is_valid before reading column data (#14318)
etseidl Oct 23, 2023
b390bca
Remove aws-sdk-pinning and revert to arrow 12.0.1 (#14319)
vyasr Oct 24, 2023
19d791c
Avoid `pyarrow.fs` import for local storage (#14321)
rjzamora Oct 24, 2023
bc4d38d
fixing thread index overflow issue (#14290)
hyperbolic2346 Oct 24, 2023
91aeec8
Drop `pyorc` dependency and use `pandas`/`pyarrow` instead (#14323)
galipremsagar Oct 25, 2023
865c21e
Expose stream parameter in public strings contains APIs (#14280)
davidwendt Oct 25, 2023
76bdb82
Run IO tests for Dask-cuDF (#14327)
rjzamora Oct 25, 2023
ce93a00
Minor improvements in `source_info` (#14308)
vuule Oct 25, 2023
203f1df
Temporarily avoid the current build of pydata-sphinx-theme (#14332)
vyasr Oct 26, 2023
2e85a3f
Unpin dask and distributed for 23.12 development (#14320)
galipremsagar Oct 26, 2023
a2abdb1
Fix gtest validity setting for TextTokenizeTest.Vocabulary (#14312)
davidwendt Oct 26, 2023
d8f0790
Move and rename byte-pair-encoding source files (#14284)
davidwendt Oct 26, 2023
f6099ca
Add stream parameter to Set Operations (Public List APIs) (#14305)
SurajAralihalli Oct 27, 2023
7d6c377
Upgrade `arrow` to `13` (#14330)
galipremsagar Oct 27, 2023
52f7d5c
Expose stream parameter in public strings filter APIs (#14293)
davidwendt Oct 27, 2023
83746a4
Expose stream parameter in public nvtext tokenize APIs (#14317)
davidwendt Oct 27, 2023
87d9ba4
Optimize ORC writer for decimal columns (#14190)
vuule Oct 27, 2023
9354fb5
Fix host buffer access from device function in the Parquet reader (#1…
vuule Oct 27, 2023
751370e
Enable `dask_cudf/io` pytests in CI (#14338)
galipremsagar Oct 27, 2023
2bc454a
Reimplement `cudf::merge` for nested types without using comparators …
divyegala Oct 28, 2023
2a923df
Fix the precision when converting a decimal128 column to an arrow arr…
jihoonson Oct 28, 2023
2548509
Add cramjam (#14344)
vyasr Oct 30, 2023
abc0d41
Added streams to JSON reader and writer api (#14313)
shrshi Oct 30, 2023
2abf9a6
Fixes stack context for json lines format that recovers from invalid …
elstehle Oct 31, 2023
b4746d8
Upgrade wheels to use arrow 13 (#14339)
vyasr Oct 31, 2023
f4c95aa
Fix logical type issues in the Parquet writer (#14322)
vuule Oct 31, 2023
7358ecd
Update versioning strategy (#14285)
vyasr Oct 31, 2023
cb06c20
Sort dictionary data alphabetically in the ORC writer (#14295)
vuule Oct 31, 2023
ec080eb
Fix overflow check in `cudf::merge` (#14345)
divyegala Oct 31, 2023
f07d9cc
Add the new manylinux builds to the build job (#14351)
vyasr Nov 1, 2023
56fe5db
Expose stream parameter to get_json_object API (#14297)
davidwendt Nov 1, 2023
f97e74f
Improve performance of nvtext::tokenize_with_vocabulary for long stri…
davidwendt Nov 3, 2023
c8c3e5c
Rework nvtext::byte_pair_encoding API (#14337)
davidwendt Nov 6, 2023
70c4283
Register ``partd`` encode dispatch in ``dask_cudf`` (#14287)
rjzamora Nov 6, 2023
f102ba8
Expose streams in public unary APIs (#14342)
vyasr Nov 6, 2023
16051a7
Expose stream parameter in public nvtext replace APIs (#14329)
davidwendt Nov 7, 2023
2463b3a
Fix a pytest typo in `test_kurt_skew_error` (#14368)
galipremsagar Nov 7, 2023
efdba40
Merge branch-23.10 into branch-23.12
bdice Nov 7, 2023
723f0e4
Upgrade to arrow 14 (#14371)
galipremsagar Nov 8, 2023
cda9b12
Merge remote-tracking branch 'upstream/branch-23.12' into branch-23.1…
bdice Nov 8, 2023
0438ac7
Use 23.12 workflows.
bdice Nov 8, 2023
ec4c47d
Add manylinux tag.
bdice Nov 8, 2023
14449b6
Update wheel names and references to outdated branch.
bdice Nov 8, 2023
d3dcc75
Update README (#14374)
shwina Nov 8, 2023
3b9cb4c
Update test_no_cuinit.py
galipremsagar Nov 8, 2023
96fd963
Merge remote-tracking branch 'upstream/branch-23.10' into branch-23.1…
bdice Nov 8, 2023
8cdedd8
Add cwd="/" to test_s3.py.
bdice Nov 8, 2023
a35f90c
Merge pull request #14372 from bdice/branch-23.12-merge-23.10
raydouglass Nov 8, 2023
c4e6c09
Add python tests for Parquet DELTA_BINARY_PACKED encoder (#14316)
etseidl Nov 8, 2023
7da0336
Split up scan_inclusive.cu to improve its compile time (#14358)
davidwendt Nov 9, 2023
1c6f6b4
Update to Arrow 14.0.1. (#14387)
bdice Nov 9, 2023
f5d3fc1
cudf jit parser now supports .pragma instructions with quotes (#14348)
robertmaynard Nov 9, 2023
9be4de5
Upgrade to nvCOMP 3.0.4 (#13815)
vuule Nov 10, 2023
87d2a36
Remove Cython libcpp wrappers (#14382)
vyasr Nov 10, 2023
04d13d8
Normalizing offsets iterator (#14234)
davidwendt Nov 13, 2023
4313cfa
Use new rapids-dask-dependency metapackage for managing dask versions…
vyasr Nov 13, 2023
5d09d38
Always build nvbench statically so we don't need to package it (#14399)
robertmaynard Nov 14, 2023
e982d37
cudf.pandas: cuDF subpath checking in module `__getattr__` (#14388)
shwina Nov 14, 2023
7f3fba1
Refactor cudf_kafka to use skbuild (#14292)
jdye64 Nov 14, 2023
b0c1b7b
Add BytePairEncoder class to cuDF (#13891)
davidwendt Nov 14, 2023
b446a6f
Fix token-count logic in nvtext::tokenize_with_vocabulary (#14393)
davidwendt Nov 14, 2023
8106a0c
Cleanup remaining usages of dask dependencies (#14407)
galipremsagar Nov 14, 2023
27b052d
Added streams to CSV reader and writer api (#14340)
shrshi Nov 14, 2023
330d389
Ensure nvbench initializes nvml context when built statically (#14411)
robertmaynard Nov 14, 2023
8a0a08f
Fix as_column(pd.Timestamp/Timedelta, length=) not respecting length …
mroeschke Nov 15, 2023
ab2248e
Fix and disable encoding for nanosecond statistics in ORC writer (#14…
vuule Nov 15, 2023
8deb3dd
Raise error in `reindex` when `index` is not unique (#14400)
galipremsagar Nov 15, 2023
9e7f8a5
Fix dask dependency in custreamz (#14420)
vyasr Nov 15, 2023
f9c586d
Support java AST String literal with desired encoding (#14402)
winningsix Nov 16, 2023
afd7d18
Example code for blog on new row comparators (#13795)
divyegala Nov 16, 2023
53127de
Remove needs: wheel-build-cudf. (#14427)
bdice Nov 16, 2023
8e1ef05
Change `nullable()` to `has_nulls()` in `cudf::detail::gather` (#14363)
divyegala Nov 16, 2023
bf63d10
Add decoder for DELTA_BYTE_ARRAY to Parquet reader (#14101)
etseidl Nov 16, 2023
ba5ec40
Enable build concurrency for nightly and merge triggers. (#14441)
bdice Nov 17, 2023
6c2e972
Implement user_datasource_wrapper is_empty() and is_device_read_prefe…
tpn Nov 17, 2023
723c565
Fix intermediate type checking in expression parsing (#14445)
vyasr Nov 18, 2023
3ef13d0
Fix io reference in docs. (#14452)
bdice Nov 20, 2023
823d321
Use `pynvjitlink` for CUDA 12+ MVC (#13650)
brandon-b-miller Nov 20, 2023
fc8c81f
Fix function name typo in `cudf.pandas` profiler (#14514)
galipremsagar Nov 28, 2023
68cb1d9
Merge pull request #14435 from rapidsai/branch-23.10
raydouglass Nov 28, 2023
0a56305
Pin actions/labeler to v4 [skip ci] (#14562)
raydouglass Dec 4, 2023
31aedf2
fix for skip_rows on with page-spanning rows (#14557)
etseidl Dec 5, 2023
8eacf8f
Update Changelog [skip ci]
raydouglass Dec 6, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Ensure nvbench initializes nvml context when built statically (#14411)
Port NVIDIA/nvbench#148 to cudf so that nvbench benchmarks work now that we always use a static version of nvbench.

Authors:
  - Robert Maynard (https://github.com/robertmaynard)

Approvers:
  - Bradley Dice (https://github.com/bdice)

URL: #14411
  • Loading branch information
robertmaynard authored Nov 14, 2023

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
commit 330d389b26a05676d9f079503a3d96b571762337
5 changes: 5 additions & 0 deletions cpp/cmake/thirdparty/patches/nvbench_override.json
Original file line number Diff line number Diff line change
@@ -7,6 +7,11 @@
"file" : "${current_json_dir}/nvbench_global_setup.diff",
"issue" : "Fix add support for global setup to initialize RMM in nvbench [https://github.com/NVIDIA/nvbench/pull/123]",
"fixed_in" : ""
},
{
"file" : "nvbench/nvml_with_static_builds.diff",
"issue" : "Add support for nvml with static nvbench [https://github.com/NVIDIA/nvbench/pull/148]",
"fixed_in" : ""
}
]
}