Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RELEASE] cudf v23.04 #13038

Merged
merged 226 commits into from
Apr 12, 2023
Merged
Show file tree
Hide file tree
Changes from 215 commits
Commits
Show all changes
226 commits
Select commit Hold shift + click to select a range
1fcd1be
DOC
raydouglass Jan 23, 2023
136b24c
Merge pull request #12591 from rapidsai/branch-23.02
GPUtester Jan 23, 2023
2b854ed
Merge pull request #12594 from rapidsai/branch-23.02
GPUtester Jan 23, 2023
d4351bd
Merge pull request #12596 from rapidsai/branch-23.02
GPUtester Jan 23, 2023
8ef24fe
Merge pull request #12597 from rapidsai/branch-23.02
GPUtester Jan 24, 2023
acb3434
Merge pull request #12600 from rapidsai/branch-23.02
GPUtester Jan 24, 2023
6a318d9
Merge pull request #12601 from rapidsai/branch-23.02
GPUtester Jan 24, 2023
e3d3207
Merge pull request #12612 from rapidsai/branch-23.02
GPUtester Jan 25, 2023
45b62d0
Merge pull request #12614 from rapidsai/branch-23.02
GPUtester Jan 25, 2023
cdca285
Merge pull request #12615 from rapidsai/branch-23.02
GPUtester Jan 25, 2023
e781300
Merge pull request #12617 from rapidsai/branch-23.02
GPUtester Jan 26, 2023
38ec89e
Merge pull request #12618 from rapidsai/branch-23.02
GPUtester Jan 26, 2023
5994619
Merge pull request #12631 from rapidsai/branch-23.02
GPUtester Jan 27, 2023
ae89e0d
Merge pull request #12635 from rapidsai/branch-23.02
GPUtester Jan 27, 2023
50ca2ac
Merge pull request #12636 from rapidsai/branch-23.02
GPUtester Jan 28, 2023
7013347
Init JNI version 23.04.0-SNAPSHOT (#12599)
pxLi Jan 30, 2023
c115cff
Merge pull request #12649 from rapidsai/branch-23.02
GPUtester Jan 30, 2023
9053240
Merge pull request #12651 from rapidsai/branch-23.02
GPUtester Jan 30, 2023
09d3061
Add comment about CUB patch for SegmentedSortInt.Bool gtest (#12611)
davidwendt Jan 30, 2023
1204a2e
Merge pull request #12653 from rapidsai/branch-23.02
GPUtester Jan 31, 2023
6b0eb00
Merge pull request #12657 from rapidsai/branch-23.02
GPUtester Jan 31, 2023
d715ac8
Pay off some JNI RMM API tech debt (#12632)
revans2 Jan 31, 2023
6d8a01e
Pin cuda-nvrtc. (#12606)
bdice Jan 31, 2023
ab03c50
Merge pull request #12664 from rapidsai/branch-23.02
GPUtester Jan 31, 2023
2b270da
Remove cudf::test::print calls from libcudf gtests (#12604)
davidwendt Jan 31, 2023
6a67e8f
Fix AllocateLikeTest gtests reading uninitialized null-mask (#12643)
davidwendt Jan 31, 2023
5b278b1
Merge pull request #12672 from rapidsai/branch-23.02
GPUtester Feb 1, 2023
3fa081a
Fix dask-cudf read_parquet bug for multi-file aggregation (#12663)
rjzamora Feb 2, 2023
3aab6b8
Handle bool types in `round` API (#12670)
galipremsagar Feb 2, 2023
32f5efa
Fix `from_arrow` to load a sliced arrow table (#12665)
galipremsagar Feb 2, 2023
21ef256
Remove DataFrame.quantiles from docs. (#12684)
bdice Feb 2, 2023
182ee2c
Add compound aggregations to cudf::segmented_reduce (#12573)
davidwendt Feb 3, 2023
e380331
Remove child fom newCudaAsyncMemoryResource (#12681)
abellina Feb 3, 2023
17554ad
Add `regex_program` searching APIs and related java classes (#12666)
cindyyuanjiang Feb 3, 2023
97a746f
Merge pull request #12697 from rapidsai/branch-23.02
GPUtester Feb 3, 2023
8286001
Move date to build string in `conda` recipe (#12661)
ajschmidt8 Feb 6, 2023
c7db81a
Add `regex_program` strings extract java APIs and tests (#12699)
cindyyuanjiang Feb 6, 2023
0574979
Merge branch-23.02 into branch-23.04
galipremsagar Feb 6, 2023
a1451e3
Merge pull request #12709 from galipremsagar/branch-23.04-merge-23.02
raydouglass Feb 6, 2023
d1cebdb
Refactor reduction logic for fixed-point types (#12652)
davidwendt Feb 7, 2023
291c751
Add build dependencies to Java tests. (#12723)
bdice Feb 7, 2023
37fe468
Enable doctests for GroupBy methods (#12658)
brandon-b-miller Feb 8, 2023
b87b64f
Convert `rank` to use to experimental row comparators (#12481)
divyegala Feb 8, 2023
fea6288
Add `regex_program` strings splitting java APIs and tests (#12713)
cindyyuanjiang Feb 8, 2023
b8ae0e4
Support conversion to/from cudf in dask.dataframe.core.to_backend (#1…
rjzamora Feb 8, 2023
8ad4166
Remove cudf::strings::repeat_strings_output_sizes and optional parame…
davidwendt Feb 8, 2023
476d5bb
Handle parquet list data corner case (#12698)
nvdbaranec Feb 8, 2023
89ec635
Update shared workflow branches (#12733)
ajschmidt8 Feb 8, 2023
d3f9daf
Fix faulty conditional logic in JIT `GroupBy.apply` (#12706)
brandon-b-miller Feb 8, 2023
0161ba8
Add `regex_program` strings replacing java APIs and tests (#12701)
cindyyuanjiang Feb 8, 2023
c20c8b4
Bump pinned rapids wheel deps to 23.4 (#12735)
sevagh Feb 8, 2023
3e4ff2a
Reduce error handling verbosity in CI tests scripts (#12738)
AjayThorve Feb 9, 2023
74efb79
Allow setting the seed argument for hash partition (#12715)
firestarman Feb 9, 2023
b0335f0
`partition_by_hash()`: use `_split()` (#12704)
madsbk Feb 9, 2023
8d17379
Fix memcheck read error in compound segmented reduce (#12722)
davidwendt Feb 9, 2023
f0a17ca
Merge pull request #12744 from rapidsai/branch-23.02
GPUtester Feb 9, 2023
80a29ce
Merge pull request #12747 from rapidsai/branch-23.02
GPUtester Feb 9, 2023
0cab19a
Reduce the number of test cases in multibyte_split benchmark (#12737)
PointKernel Feb 9, 2023
c931d5a
Update default data source in cuio reader benchmarks (#12740)
PointKernel Feb 10, 2023
048f936
Unpin `dask` and `distributed` for development (#12710)
galipremsagar Feb 10, 2023
c4a1389
Fix `Series` comparison vs scalars (#12519)
brandon-b-miller Feb 10, 2023
2d7e79a
Add nvbench environment class for initializing RMM in benchmarks (#12…
davidwendt Feb 10, 2023
1c0224f
Reenable stream identification library in CI (#12714)
vyasr Feb 10, 2023
8630e7c
Remove gpuCI scripts. (#12712)
bdice Feb 11, 2023
bad94b9
JNI switches to nested JSON reader (#12732)
res-life Feb 13, 2023
53183cd
Fix missing trailing comma in json writer (#12688)
karthikeyann Feb 13, 2023
12410d9
Stop force pulling fmt in nvbench. (#12768)
vyasr Feb 15, 2023
d787ff2
Add `GroupBy.dtypes` (#12783)
galipremsagar Feb 16, 2023
e4ffcbb
Fast path for `experimental::row::equality` (#12676)
divyegala Feb 16, 2023
506a479
Merge `copy-on-write` feature branch into `branch-23.04` (#12619)
galipremsagar Feb 16, 2023
7190e33
Update `is_sorted` to use `experimental::row::lexicographic` (#12752)
divyegala Feb 16, 2023
a00295a
Fix bug in all-null list due to join_list_elements special handling (…
karthikeyann Feb 16, 2023
cf82fb8
Tell cudf_kafka to use header-only fmt (#12796)
vyasr Feb 16, 2023
4e32bfe
Fix groupby gtests coded in namespace cudf::test (#12784)
davidwendt Feb 16, 2023
79a924a
Update `hash_partition` to use `experimental::row::row_hasher` (#12761)
divyegala Feb 17, 2023
2969b24
Produce useful guidance on overflow error in `to_csv` (#12705)
wence- Feb 17, 2023
b1aa5d2
Fix a bug with `num_keys` in `_scatter_by_slice` (#12749)
thomcom Feb 17, 2023
ec8704a
Fix a leak in a test and clarify some test names (#12781)
revans2 Feb 17, 2023
94bbc82
Add build metrics report as artifact to cpp-build workflow (#12750)
davidwendt Feb 19, 2023
c2f0161
Changing `cudf::io::source_info` to use `cudf::host_span<std::byte>` …
hyperbolic2346 Feb 21, 2023
7da233b
Rework logic in cudf::strings::split_record to improve performance (#…
davidwendt Feb 21, 2023
a308b24
Remove now redundant cuda initialization (#12758)
vyasr Feb 21, 2023
904b8c7
Fixing parquet coalescing of reads (#12808)
hyperbolic2346 Feb 22, 2023
f90ae52
Move `strings_udf` code into cuDF (#12669)
brandon-b-miller Feb 22, 2023
d441f51
Enable groupby std and variance aggregation types in libcudf Debug bu…
davidwendt Feb 22, 2023
d077c9b
Variable fragment sizes for Parquet writer (#12685)
etseidl Feb 22, 2023
a96b150
Stop using versioneer to manage versions (#12741)
vyasr Feb 23, 2023
f076905
Add segmented reduction support for fixed-point types (#12680)
davidwendt Feb 23, 2023
fffdc0c
Add compute-sanitizer github workflow action to nightly tests (#12800)
davidwendt Feb 23, 2023
5719463
Add docs build job (#12592)
AyodeAwe Feb 23, 2023
430d91e
Shuffling read into a sub function in parquet read (#12809)
hyperbolic2346 Feb 23, 2023
e64e26e
Expose seed argument to hash_values (#12795)
ayushdg Feb 24, 2023
2e80eba
Fix parquet `RangeIndex` bug (#12838)
rjzamora Feb 24, 2023
0e4e6dd
Add `always_nullable` flag to Dremel encoding (#12727)
divyegala Feb 24, 2023
8a7fb2f
Deprecate `inplace` parameters in categorical methods (#12824)
galipremsagar Feb 24, 2023
54ee14e
Update datasets download URL (#12840)
jjacobelli Feb 24, 2023
12e4501
Remove KAFKA_HOST_TEST from compute-sanitizer check (#12831)
davidwendt Feb 24, 2023
77c2e03
Consolidate linter configs into pyproject.toml (#12834)
vyasr Feb 24, 2023
4f2f379
Enable nbqa pre-commit hooks for isort and black. (#12848)
bdice Feb 24, 2023
d14d980
Add dfg as a pre-commit hook (#12819)
vyasr Feb 24, 2023
eb4da93
CI: Remove specification of manual stage for check_style.sh script. (…
csadorf Feb 25, 2023
173459e
Replace message parsing with throwing more specific exceptions (#12426)
vyasr Feb 25, 2023
ac1cac6
Add JNI methods for detecting and purging non-empty nulls from LIST a…
razajafri Feb 27, 2023
2025783
Improve performance for cudf::strings::count_characters for long stri…
davidwendt Feb 27, 2023
9a91270
Remove tokenizers pre-install pinning. (#12854)
vyasr Feb 28, 2023
8a45ae2
Make docs builds less verbose (#12836)
AyodeAwe Feb 28, 2023
a7e5092
Update RMM allocators (#12861)
pentschev Feb 28, 2023
afdb51b
Adds JSON reader, writer io benchmark (#12753)
karthikeyann Mar 1, 2023
195e2f7
Fix docs build to be `pydata-sphinx-theme=0.13.0` compatible (#12874)
galipremsagar Mar 1, 2023
40e56c9
Parquet writer column_size() should return a size_t (#12870)
etseidl Mar 1, 2023
8747daa
Use python -m pytest for nightly wheel tests (#12871)
bdice Mar 2, 2023
00c887b
Deallocate encoded data in ORC writer immediately after compression (…
vuule Mar 4, 2023
2689bb6
Throw an exception if an unsupported page encoding is detected in Par…
etseidl Mar 4, 2023
45a9e82
Use test paths relative to package directory. (#12751)
bdice Mar 4, 2023
77b5014
Expect cupy to now support bool arrays for dlpack. (#12883)
vyasr Mar 6, 2023
b6d9fd1
Split C++ and Python build dependencies into separate lists. (#12724)
bdice Mar 6, 2023
39398e4
Deprecate `datetime_is_numeric` from `describe` (#12818)
galipremsagar Mar 6, 2023
7dade51
Migrate as much as possible to pyproject.toml (#12850)
vyasr Mar 6, 2023
618194d
Deprecate `names` & `dtype` in `Index.copy` (#12825)
galipremsagar Mar 6, 2023
a7ede21
min_rows and num_rows are swapped in ComputePageSizes declaration in …
etseidl Mar 6, 2023
f00baad
Update to GCC 11 (#12868)
bdice Mar 6, 2023
98b92a5
Update to protobuf>=4.21.6,<4.22. (#12864)
bdice Mar 6, 2023
0ee82c9
Update JNI build ENV default to gcc 11 (#12881)
pxLi Mar 7, 2023
a78fdd5
Fix cudf::hash_partition kernel launch error with decimal128 types (#…
davidwendt Mar 7, 2023
6d1f8e3
Deprecate `na_sentinel` in `factorize` (#12817)
galipremsagar Mar 7, 2023
97d8d12
Update `distinct/unique_count` to `experimental::row` hasher/comparat…
divyegala Mar 7, 2023
5b62562
Fix `moto` env vars & pass `AWS_SESSION_TOKEN` to conda builds (#12902)
ajschmidt8 Mar 8, 2023
50f7e68
Deprecate `Index.is_*` methods (#12820)
galipremsagar Mar 8, 2023
553162c
Add performance benchmarks to user facing docs (#12595)
galipremsagar Mar 8, 2023
ef9780f
Add try/except for expected null-schema error in read_parquet (#12756)
rjzamora Mar 8, 2023
4497d62
Deprecate `line_terminator` in favor of `lineterminator` in `to_csv` …
wence- Mar 8, 2023
e16cf2d
Pass `SCCACHE_S3_USE_SSL` to conda builds (#12910)
ajschmidt8 Mar 8, 2023
be0f583
Add `stream` and `mr` parameters for `structs::detail::flatten_nested…
ttnghia Mar 9, 2023
a4e58eb
Rewrite CSV writer benchmark with nvbench (#12901)
PointKernel Mar 9, 2023
9299c2b
Deprecate libcudf regex APIs accepting pattern strings directly (#12891)
davidwendt Mar 9, 2023
ea62e0e
Drop Python 3.7 handling for pickle protocol 4 (#12857)
jakirkham Mar 9, 2023
ff96cd1
Fix FST, JSON gtests & benchmarks coded in namespace cudf::test (#12907)
karthikeyann Mar 9, 2023
01375d0
Fix io/text gtests coded in namespace cudf::test (#12914)
karthikeyann Mar 9, 2023
9ec5477
Fix benchmarks coded in namespace cudf and using namespace cudf (#12915)
karthikeyann Mar 9, 2023
3048791
Change return type of `cudf::structs::detail::flatten_nested_columns`…
ttnghia Mar 9, 2023
52c675a
Allow casting from `UDFString` back to `StringView` to call methods i…
brandon-b-miller Mar 9, 2023
f5bb7b1
Fix an issue with parquet chunked reader undercounting string lengths…
nvdbaranec Mar 9, 2023
02d3751
Make string methods return a Series with a useful Index (#12814)
shwina Mar 9, 2023
e37bddb
Remove return type from @return doxygen tags (#12908)
davidwendt Mar 10, 2023
4da6b19
Fix null hive-partition behavior in dask-cudf parquet (#12866)
rjzamora Mar 10, 2023
e591f68
Fix libcudf gtests to pass null-count=0 for empty validity masks (#12…
davidwendt Mar 10, 2023
e4557cb
Update minimum `pandas` and `numpy` pinnings (#12887)
galipremsagar Mar 10, 2023
697d78a
Migrate parquet encoding to use experimental row operators (#12918)
PointKernel Mar 10, 2023
f216c0b
Add logging to libcudf (#12637)
vuule Mar 10, 2023
7bc4a7e
Fix passing seed parameter to MurmurHash3_32 in cudf::hash() function…
davidwendt Mar 13, 2023
ec746cf
Use get_current_device_resource for intermediate allocations in COLLE…
karthikeyann Mar 13, 2023
0723f3f
Ensure all of device bitmask is initialized in from_arrow (#12668)
wence- Mar 13, 2023
9c9dd54
Remove default parameters from detail headers in include (#12888)
vyasr Mar 13, 2023
3584739
Setting a threshold for KvikIO IO (#12841)
madsbk Mar 13, 2023
55ed347
Enable detection of undesired stream usage (#12089)
vyasr Mar 14, 2023
fbbf1fa
Add README symlink for dask-cudf. (#12946)
bdice Mar 14, 2023
745d35e
Refactor `io::orc::ProtobufWriter` (#12877)
ttnghia Mar 14, 2023
6c8bf45
Avoid building cython twice (#12945)
galipremsagar Mar 15, 2023
dfa9e93
Remove remaining default stream parameters (#12943)
vyasr Mar 15, 2023
e9ec83f
Fix set index error for Series rolling window operations (#12942)
galipremsagar Mar 15, 2023
ced3fdf
Preserve integer dtype of hive-partitioned column containing nulls (#…
rjzamora Mar 15, 2023
7776e0e
Update libcudf counting functions to specify cudf::size_type (#12904)
davidwendt Mar 15, 2023
1b78039
Fix cudf::segmented_reduce gtest for ANY aggregation (#12940)
davidwendt Mar 15, 2023
3c72eb0
Generate pyproject dependencies using dfg (#12906)
vyasr Mar 15, 2023
a33e368
Mark dlpack tensor deleter as noexcept to match PyCapsule_Destructor …
bdice Mar 15, 2023
6d264b2
Make Parquet writer `nullable` option application to single table wri…
vuule Mar 15, 2023
9ceecb1
Implement `groupby.head` and `groupby.tail` (#12939)
wence- Mar 16, 2023
1e377fc
Implement initial support for avro logical types (#6482) (#12788)
tpn Mar 16, 2023
3540613
Fix calculation of null counts for Parquet statistics (#12938)
etseidl Mar 17, 2023
8881cb6
Improve performance for replace-multi for long strings (#12858)
davidwendt Mar 17, 2023
d9e1b90
cudftestutil supports static gtest dependencies (#12957)
robertmaynard Mar 17, 2023
49e87b8
Declare a different name for nan_equality.UNEQUAL to prevent Cython w…
bdice Mar 17, 2023
caef9a6
Include gtest in build environment. (#12956)
vyasr Mar 17, 2023
8fbfb4a
Remove MANIFEST.in use auto-generated one for sdists and package_data…
vyasr Mar 17, 2023
1a98ada
Remove default detail mrs: part1 (#12964)
vyasr Mar 20, 2023
aeb05bf
Remove default detail mrs: part7 (#12970)
vyasr Mar 20, 2023
565efb7
Remove default detail mrs: part6 (#12969)
vyasr Mar 20, 2023
d171fda
Update to use rapids-export(COMPONENTS) feature. (#12959)
robertmaynard Mar 20, 2023
aff1c9f
Add optional text file support to ninja-log utility (#12823)
davidwendt Mar 20, 2023
5e1fae6
Fix fetching of MultiIndex values when a label is passed (#12521)
galipremsagar Mar 20, 2023
3b8064d
Remove manual artifact upload step in CI (#12869)
ajschmidt8 Mar 20, 2023
dc585de
Add skeleton API and prose documentation for dask-cudf (#12725)
wence- Mar 21, 2023
84ac1d5
Fix conda recipe post-link.sh typo (#12916)
pentschev Mar 21, 2023
6547d96
Remove default detail mrs: part5 (#12968)
vyasr Mar 21, 2023
17a2cdc
Refactor orc chunked writer (#12949)
ttnghia Mar 21, 2023
832dd27
Stop setting package version attribute in wheels (#12977)
vyasr Mar 21, 2023
f567cf5
Correctly handle scalar indices in `Index.__getitem__` (#12955)
wence- Mar 21, 2023
b2a65e8
Remove default detail mrs: part3 (#12966)
vyasr Mar 22, 2023
913302a
Update `tests/column_utilities` to use `experimental::equality` row c…
divyegala Mar 22, 2023
6e41db0
Add developer documentation forbidding default parameters in detail A…
vyasr Mar 22, 2023
bf18cea
Make timezone table independent from ORC (#12805)
vuule Mar 22, 2023
00c6000
Add `force_nullable_schema` parameter to Parquet writer. (#12952)
galipremsagar Mar 22, 2023
253f2ab
Use rapids-cmake parallel testing feature (#12451)
robertmaynard Mar 22, 2023
0d1fb96
Fix `sort_values` when column is all empty strings (#12988)
eriknw Mar 22, 2023
d9ad58c
Rework gtests SequenceTest to remove using namepace cudf (#12985)
davidwendt Mar 22, 2023
9753ace
Remove unused variable and fix memory issue in ORC writer (#12984)
ttnghia Mar 23, 2023
90d2cb1
Move detail reduction functions to cudf::reduction::detail namespace …
davidwendt Mar 23, 2023
6966fd5
Remove default detail mrs: part4 (#12967)
vyasr Mar 23, 2023
3a2609b
Fix `__setitem__` on string columns when the scalar value ends in a n…
wence- Mar 23, 2023
7456690
Implement `groupby.sample` (#12882)
wence- Mar 23, 2023
2818d45
Fix `find_common_dtype` and `values` to handle complex dtypes (#12537)
galipremsagar Mar 23, 2023
5cdb9d9
Pre-emptive fix for upstream `dask.dataframe.read_parquet` changes (#…
rjzamora Mar 23, 2023
4ab227d
Pin numba version (#13001)
vyasr Mar 23, 2023
dd5252b
Add JNI method for strings::replace multi variety (#12979)
NVnavkumar Mar 23, 2023
33e2387
Fix `DataFrame` constructor to broadcast scalar inputs properly (#12997)
galipremsagar Mar 23, 2023
fb96fc8
Fix `GroupBy.apply` doc examples rendering (#12994)
brandon-b-miller Mar 23, 2023
a0473cf
Cache JIT `GroupBy.apply` functions (#12802)
brandon-b-miller Mar 24, 2023
9fbc249
Remove default detail mrs: part2 (#12965)
vyasr Mar 24, 2023
4c4fdd2
Drop `force_nullable_schema` from chunked parquet writer (#12996)
galipremsagar Mar 24, 2023
ba8116a
Fix gtest column utility comparator diff reporting (#12995)
davidwendt Mar 24, 2023
698fcf6
Fix memcheck script to execute only _TEST files found in bin/gtests/l…
davidwendt Mar 24, 2023
12dc130
Rework some code logic to reduce iterator and comparator inlining to …
davidwendt Mar 27, 2023
bc7b89f
Handle index names while performing `groupby` (#12992)
galipremsagar Mar 27, 2023
766af66
add sphinx building and s3 uploading for dask-cudf docs (#12982)
quasiben Mar 27, 2023
173fde9
Add nunique aggregation support for cudf::segmented_reduce (#12972)
davidwendt Mar 27, 2023
2fd1ac8
Pin cupy version in wheels CI (#13041)
vyasr Mar 31, 2023
7268b5f
Pin dask and distributed for release
galipremsagar Apr 5, 2023
6563440
Fix a dask-cudf error
galipremsagar Apr 5, 2023
54e7889
Apply suggestions from code review
galipremsagar Apr 5, 2023
46a8016
Update python/dask_cudf/pyproject.toml
galipremsagar Apr 5, 2023
1d95f75
Update all_cuda-118_arch-x86_64.yaml
galipremsagar Apr 5, 2023
a3ed98a
Update python/dask_cudf/pyproject.toml
galipremsagar Apr 5, 2023
5179b8e
Update dependencies.yaml
galipremsagar Apr 5, 2023
241b560
Update conda/recipes/dask-cudf/run_test.sh
galipremsagar Apr 5, 2023
d1a0114
Update dependencies.yaml
galipremsagar Apr 5, 2023
2c3b2ab
Merge pull request #13070 from galipremsagar/pin_dask
jolorunyomi Apr 6, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
24 changes: 24 additions & 0 deletions .flake8
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
# Copyright (c) 2017-2023, NVIDIA CORPORATION.

[flake8]
filename = *.py, *.pyx, *.pxd, *.pxi
exclude = __init__.py, *.egg, build, docs, .git
force-check = True
ignore =
# line break before binary operator
W503,
# whitespace before :
E203
per-file-ignores =
# Rules ignored only in Cython:
# E211: whitespace before '(' (used in multi-line imports)
# E225: Missing whitespace around operators (breaks cython casting syntax like <int>)
# E226: Missing whitespace around arithmetic operators (breaks cython pointer syntax like int*)
# E227: Missing whitespace around bitwise or shift operator (Can also break casting syntax)
# E275: Missing whitespace after keyword (Doesn't work with Cython except?)
# E402: invalid syntax (works for Python, not Cython)
# E999: invalid syntax (works for Python, not Cython)
# W504: line break after binary operator (breaks lines that end with a pointer)
*.pyx: E211, E225, E226, E227, E275, E402, E999, W504
*.pxd: E211, E225, E226, E227, E275, E402, E999, W504
*.pxi: E211, E225, E226, E227, E275, E402, E999, W504
5 changes: 0 additions & 5 deletions .gitattributes

This file was deleted.

27 changes: 20 additions & 7 deletions .github/workflows/build.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@ concurrency:
jobs:
cpp-build:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-build.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-build.yaml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
Expand All @@ -37,7 +37,7 @@ jobs:
python-build:
needs: [cpp-build]
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-build.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-build.yaml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
Expand All @@ -46,16 +46,27 @@ jobs:
upload-conda:
needs: [cpp-build, python-build]
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-upload-packages.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-upload-packages.yaml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
skip_upload_pkgs: libcudf-example
docs-build:
if: github.ref_type == 'branch' && github.event_name == 'push'
needs: python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/[email protected]
with:
build_type: branch
node_type: "gpu-latest-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/build_docs.sh"
wheel-build-cudf:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-build.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-build.yml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
Expand All @@ -64,10 +75,11 @@ jobs:
package-name: cudf
package-dir: python/cudf
skbuild-configure-options: "-DCUDF_BUILD_WHEELS=ON -DDETECT_CONDA_ENV=OFF"
uses-setup-env-vars: false
wheel-publish-cudf:
needs: wheel-build-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-publish.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-publish.yml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
Expand All @@ -77,18 +89,19 @@ jobs:
wheel-build-dask-cudf:
needs: wheel-publish-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-build.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-build.yml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
package-name: dask_cudf
package-dir: python/dask_cudf
uses-setup-env-vars: false
wheel-publish-dask-cudf:
needs: wheel-build-dask-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-publish.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-publish.yml@branch-23.04
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
Expand Down
53 changes: 34 additions & 19 deletions .github/workflows/pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,52 +20,55 @@ jobs:
- conda-python-other-tests
- conda-java-tests
- conda-notebook-tests
- docs-build
- wheel-build-cudf
- wheel-tests-cudf
- wheel-build-dask-cudf
- wheel-tests-dask-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/pr-builder.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/pr-builder.yaml@branch-23.04
checks:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/[email protected]
uses: rapidsai/shared-action-workflows/.github/workflows/[email protected]
with:
enable_check_generated_files: false
conda-cpp-build:
needs: checks
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-build.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-build.yaml@branch-23.04
with:
build_type: pull-request
conda-cpp-tests:
needs: conda-cpp-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-tests.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-tests.yaml@branch-23.04
with:
build_type: pull-request
conda-python-build:
needs: conda-cpp-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-build.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-build.yaml@branch-23.04
with:
build_type: pull-request
conda-python-cudf-tests:
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.04
with:
build_type: pull-request
test_script: "ci/test_python_cudf.sh"
conda-python-other-tests:
# Tests for dask_cudf, custreamz, cudf_kafka are separated for CI parallelism
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.04
with:
build_type: pull-request
test_script: "ci/test_python_other.sh"
conda-java-tests:
needs: conda-cpp-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.04
with:
build_type: pull-request
node_type: "gpu-latest-1"
Expand All @@ -75,49 +78,61 @@ jobs:
conda-notebook-tests:
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.04
with:
build_type: pull-request
node_type: "gpu-latest-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/test_notebooks.sh"
docs-build:
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/[email protected]
with:
build_type: pull-request
node_type: "gpu-latest-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/build_docs.sh"
wheel-build-cudf:
needs: checks
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-build.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-build.yml@branch-23.04
with:
build_type: pull-request
package-name: cudf
package-dir: python/cudf
skbuild-configure-options: "-DCUDF_BUILD_WHEELS=ON -DDETECT_CONDA_ENV=OFF"
uses-setup-env-vars: false
wheel-tests-cudf:
needs: wheel-build-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-test.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-test.yml@branch-23.04
with:
build_type: pull-request
package-name: cudf
# Install cupy-cuda11x for arm from a special index url
# Install tokenizers last binary wheel to avoid a Rust compile from the latest sdist
test-before-arm64: "pip install tokenizers==0.10.2 cupy-cuda11x -f https://pip.cupy.dev/aarch64"
test-unittest: "pytest -v -n 8 ./python/cudf/cudf/tests"
test-before-arm64: "python -m pip install cupy-cuda11x -f https://pip.cupy.dev/aarch64"
test-unittest: "python -m pytest -v -n 8 ./python/cudf/cudf/tests"
test-smoketest: "python ./ci/wheel_smoke_test_cudf.py"
wheel-build-dask-cudf:
needs: wheel-tests-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-build.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-build.yml@branch-23.04
with:
build_type: pull-request
package-name: dask_cudf
package-dir: python/dask_cudf
before-wheel: "RAPIDS_PY_WHEEL_NAME=cudf_cu11 rapids-download-wheels-from-s3 ./local-cudf && pip install --no-deps ./local-cudf/cudf*.whl"
before-wheel: "RAPIDS_PY_WHEEL_NAME=cudf_cu11 rapids-download-wheels-from-s3 ./local-cudf && python -m pip install --no-deps ./local-cudf/cudf*.whl"
uses-setup-env-vars: false
wheel-tests-dask-cudf:
needs: wheel-build-dask-cudf
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-test.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-test.yml@branch-23.04
with:
build_type: pull-request
package-name: dask_cudf
test-before: "RAPIDS_PY_WHEEL_NAME=cudf_cu11 rapids-download-wheels-from-s3 ./local-cudf-dep && pip install --no-deps ./local-cudf-dep/cudf*.whl"
test-unittest: "pytest -v -n 8 ./python/dask_cudf/dask_cudf/tests"
# Install the cudf we just built, and also test against latest dask/distributed/dask-cuda.
test-before: "RAPIDS_PY_WHEEL_NAME=cudf_cu11 rapids-download-wheels-from-s3 ./local-cudf-dep && python -m pip install --no-deps ./local-cudf-dep/cudf*.whl && pip install git+https://github.com/dask/dask.git@main git+https://github.com/dask/distributed.git@main git+https://github.com/rapidsai/[email protected]"
test-unittest: "python -m pytest -v -n 8 ./python/dask_cudf/dask_cudf/tests"
34 changes: 24 additions & 10 deletions .github/workflows/test.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -16,15 +16,27 @@ on:
jobs:
conda-cpp-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-tests.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-tests.yaml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
conda-cpp-memcheck-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/[email protected]
with:
build_type: nightly
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
node_type: "gpu-latest-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/test_cpp_memcheck.sh"
conda-python-cudf-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
Expand All @@ -34,7 +46,7 @@ jobs:
conda-python-other-tests:
# Tests for dask_cudf, custreamz, cudf_kafka are separated for CI parallelism
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
Expand All @@ -43,7 +55,7 @@ jobs:
test_script: "ci/test_python_other.sh"
conda-java-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
Expand All @@ -55,7 +67,7 @@ jobs:
run_script: "ci/test_java.sh"
conda-notebook-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
Expand All @@ -67,22 +79,24 @@ jobs:
run_script: "ci/test_notebooks.sh"
wheel-tests-cudf:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-test.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux-test.yml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
package-name: cudf
test-before-arm64: "pip install tokenizers==0.10.2 cupy-cuda11x -f https://pip.cupy.dev/aarch64"
test-unittest: "pytest -v -n 8 ./python/cudf/cudf/tests"
test-before-arm64: "python -m pip install cupy-cuda11x -f https://pip.cupy.dev/aarch64"
test-unittest: "python -m pytest -v -n 8 ./python/cudf/cudf/tests"
wheel-tests-dask-cudf:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-test.yml@branch-23.02
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure-test.yml@branch-23.04
with:
build_type: nightly
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
package-name: dask_cudf
test-unittest: "pytest -v -n 8 ./python/dask_cudf/dask_cudf/tests"
# Test against latest dask/distributed/dask-cuda.
test-before: "pip install git+https://github.com/dask/dask.git@main git+https://github.com/dask/distributed.git@main git+https://github.com/rapidsai/[email protected]"
test-unittest: "python -m pytest -v -n 8 ./python/dask_cudf/dask_cudf/tests"
5 changes: 3 additions & 2 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -36,8 +36,6 @@ python/cudf_kafka/*/_lib/**/*.cpp
python/cudf_kafka/*/_lib/**/*.h
python/custreamz/*/_lib/**/*.cpp
python/custreamz/*/_lib/**/*.h
python/strings_udf/strings_udf/_lib/*.cpp
python/strings_udf/strings_udf/*.ptx
.Python
env/
develop-eggs/
Expand Down Expand Up @@ -168,6 +166,9 @@ docs/cudf/source/api_docs/generated/*
docs/cudf/source/api_docs/api/*
docs/cudf/source/user_guide/example_output/*
docs/cudf/source/user_guide/cudf.*Dtype.*.rst
_html
_text
jupyter_execute

# cibuildwheel
/wheelhouse
Loading