Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge conflicts and address reviewer comments #1

Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
158 commits
Select commit Hold shift + click to select a range
be84225
Remove cuDF dependency from pylibcudf column from_device tests (#16441)
brandon-b-miller Jul 31, 2024
211dbe4
Merge pull request #16459 from rapidsai/branch-24.08
GPUtester Aug 1, 2024
9d0c57a
Add skiprows and nrows to parquet reader (#16214)
lithomas1 Aug 1, 2024
05745d0
Improve performance of hash_character_ngrams using warp-per-string ke…
davidwendt Aug 1, 2024
a8a3670
Move exception handler into pylibcudf from cudf (#16468)
lithomas1 Aug 1, 2024
cc19d8a
Use explicit construction of column subclass instead of `build_column…
mroeschke Aug 2, 2024
e0d1ac1
Fix typo in dispatch_row_equal. (#16473)
bdice Aug 2, 2024
af57286
Add missing pylibcudf strings docs (#16471)
brandon-b-miller Aug 5, 2024
837dfe5
Added batch memset to memset data and validity buffers in parquet rea…
sdrp713 Aug 5, 2024
8068a2d
Fix build failures with GCC 13 (#16488)
PointKernel Aug 5, 2024
e8156d4
Fix segmented-sort overlapped input/output indices (#16463)
davidwendt Aug 6, 2024
6b0bff4
Disallow cudf.Series to accept column in favor of `._from_column` (#1…
mroeschke Aug 7, 2024
d11d2cf
Merge pull request #16505 from rapidsai/branch-24.08
GPUtester Aug 7, 2024
3fd8783
Add `stream` param to stream compaction APIs (#16295)
JayjeetAtGithub Aug 7, 2024
b933b54
Use tool.scikit-build.cmake.version, set scikit-build-core minimum-ve…
jameslamb Aug 8, 2024
c146eed
Expose `stream` param in transform APIs (#16452)
JayjeetAtGithub Aug 8, 2024
a94512a
Add interop example for `arrow::StringViewArray` to `cudf::column` (#…
JayjeetAtGithub Aug 8, 2024
cc75b05
Change IPv4 convert APIs to support UINT32 instead of INT64 (#16489)
davidwendt Aug 8, 2024
da51cad
Improve update-version.sh (#16506)
bdice Aug 8, 2024
792dd06
Update pre-commit hooks (#16510)
KyleFromNVIDIA Aug 8, 2024
1bbe440
Add keep option to distinct nvbench (#16497)
bdice Aug 8, 2024
2c8de62
enable list to be forced as string in JSON reader. (#16472)
karthikeyann Aug 9, 2024
9ec34ad
Remove a deprecated multibyte_split API (#16501)
davidwendt Aug 9, 2024
8009dc8
Update docs of the TPC-H derived examples (#16423)
JayjeetAtGithub Aug 9, 2024
4446cf0
Update json normalization to take device_buffer (#16520)
karthikeyann Aug 9, 2024
16aa0ea
Allow DataFrame.sort_values(by=) to select an index level (#16519)
mroeschke Aug 9, 2024
4cd87d3
Fix `date_range(start, end, freq)` when end-start is divisible by fre…
mroeschke Aug 9, 2024
45b20d1
Preserve array name in MultiIndex.from_arrays (#16515)
mroeschke Aug 9, 2024
a3dc14f
Disallow indexing by selecting duplicate labels (#16514)
mroeschke Aug 12, 2024
091cb72
Remove deprecated public APIs from libcudf (#16524)
davidwendt Aug 12, 2024
cce00c0
Pass batch size to JSON reader using environment variable (#16502)
shrshi Aug 12, 2024
e5f8dd3
Update the java code to properly deal with lists being returned as st…
revans2 Aug 12, 2024
7178bf2
Rework cudf::io::text::byte_range_info class member functions (#16518)
davidwendt Aug 13, 2024
419fb99
Fix all-empty input column for strings split APIs (#16466)
davidwendt Aug 13, 2024
3a791cb
Remove unneeded pair-iterator benchmark (#16511)
davidwendt Aug 13, 2024
3801f81
Remove hardcoded versions from workflows. (#16540)
bdice Aug 13, 2024
5780c4d
Register `read_parquet` and `read_csv` with dask-expr (#16535)
rjzamora Aug 13, 2024
cf3fabf
Ensure comparisons with pyints and integer series always succeed (#16…
seberg Aug 13, 2024
1f0d0c9
Change cudf::empty_like to not include offsets for empty strings colu…
davidwendt Aug 14, 2024
c20d6b3
Remove unneeded output size parameter from internal count_matches uti…
davidwendt Aug 14, 2024
bf3372b
Switch python version to `3.10` in `cudf.pandas` pandas test scripts …
galipremsagar Aug 14, 2024
d684ae0
Raise NotImplementedError for Series.rename that's not a scalar (#16525)
mroeschke Aug 14, 2024
0253e97
[FEA] Support named aggregations in `df.groupby().agg()` (#16528)
Matt711 Aug 15, 2024
19846b6
Disallow cudf.Index accepting column in favor of ._from_column (#16549)
mroeschke Aug 15, 2024
89863a3
Align public utility function signatures with pandas 2.x (#16565)
mroeschke Aug 15, 2024
2bcb7ec
Fix `.replace(Index, Index)` raising a TypeError (#16513)
mroeschke Aug 15, 2024
ac42bc8
Hide all gtest symbols in cudftestutil (#16546)
robertmaynard Aug 15, 2024
ed31523
Merge branch 'branch-24.08' into branch-24.10-merge-24.08
bdice Aug 15, 2024
6912246
Merge pull request #16571 from bdice/branch-24.10-merge-24.08
AyodeAwe Aug 15, 2024
f4a9b1c
Use more idomatic cudf APIs in dask_cudf meta generation (#16487)
mroeschke Aug 15, 2024
1e220b7
Return Interval object in pandas compat mode for IntervalIndex reduct…
mroeschke Aug 15, 2024
5084135
Make NumericalColumn.__init__ strict (#16457)
mroeschke Aug 15, 2024
155edde
Make Timedelta/DatetimeColumn.__init__ strict (#16464)
mroeschke Aug 16, 2024
f955dd7
Rewrite remaining Python Arrow interop conversions using the C Data I…
vyasr Aug 16, 2024
1c63e1e
Initial investigation into NumPy proxying in `cudf.pandas` (#16286)
Matt711 Aug 16, 2024
e690d9d
Ensure size is always passed to NumericalColumn (#16576)
mroeschke Aug 16, 2024
e197d72
Replace `NativeFile` dependency in dask-cudf Parquet reader (#16569)
rjzamora Aug 16, 2024
623dfce
[FEA] Add support for `cudf.unique` (#16554)
Matt711 Aug 16, 2024
e16c2f2
Make (Indexed)Frame.__init__ require data (and index) (#16430)
mroeschke Aug 16, 2024
30011c5
Clean up reshaping ops (#16553)
mroeschke Aug 16, 2024
bc8ca9b
Setup pylibcudf package (#16299)
lithomas1 Aug 16, 2024
10cdd5f
Reenable arrow tests (#16556)
vyasr Aug 16, 2024
cb843db
Fix DataFrame reductions with median returning scalar instead of Seri…
mroeschke Aug 16, 2024
fd44adc
Make CategoricalColumn.__init__ strict (#16456)
mroeschke Aug 16, 2024
b63ba70
Add build job for pylibcudf (#16587)
vyasr Aug 17, 2024
dd2c12d
Revert "Make proxy NumPy arrays pass isinstance check in `cudf.pandas…
Matt711 Aug 17, 2024
592342c
Remove invalid column_view usage in string-scalar-to-column function …
davidwendt Aug 19, 2024
1b18cbc
Add `ToCudfBackend` expression to dask-cudf (#16573)
rjzamora Aug 19, 2024
0491778
MAINT: Adapt to numpy hiding flagsobject away (#16593)
seberg Aug 19, 2024
c516fc4
Make ListColumn.__init__ strict (#16465)
mroeschke Aug 19, 2024
074abcc
Add `public` qualifier for some member functions in Java class `Schem…
ttnghia Aug 19, 2024
79a5a97
Remove NativeFile support from cudf Python (#16589)
vyasr Aug 19, 2024
6ccc2c2
standardize and consolidate wheel installations in testing scripts (#…
jameslamb Aug 19, 2024
f2d13c9
make more use of YAML anchors in dependencies.yaml (#16597)
jameslamb Aug 19, 2024
3f6dd14
Make StructColumn.__init__ strict (#16467)
mroeschke Aug 19, 2024
a45af4a
Remove arrow_io_source (#16607)
vyasr Aug 20, 2024
3ac409d
Fix C++ and Cython io types (#16610)
vyasr Aug 20, 2024
2f7d354
bug-fix: cudf/io/json.hpp use after move (#16609)
NicolasDenoyelle Aug 20, 2024
1cccf3e
Replace usages of `thrust::optional` with `std::optional` (#15091)
miscco Aug 20, 2024
555734d
Remove thrust::optional from expression evaluator (#16604)
bdice Aug 20, 2024
b32bc10
do not install cudf in cudf_polars wheel tests (#16612)
jameslamb Aug 20, 2024
e450baf
remove streamz git dependency, standardize build dependency names, co…
jameslamb Aug 20, 2024
28fee97
Enable gtests previously disabled for compute-sanitizer bug (#16581)
davidwendt Aug 20, 2024
58799d6
Add stricter typing and validation to ColumnAccessor (#16602)
mroeschke Aug 20, 2024
8ab553c
Move libcudf reduction google-benchmarks to nvbench (#16564)
davidwendt Aug 21, 2024
6a2f323
Fix function parameters with common dependency modified during their …
ttnghia Aug 21, 2024
bf2ee32
DOC: Refresh pylibcudf guide (#15856)
lithomas1 Aug 21, 2024
6c4905d
Remove legacy Arrow interop APIs (#16590)
vyasr Aug 22, 2024
1fd9675
Fix overflow bug in low-memory JSON reader (#16632)
shrshi Aug 22, 2024
00ff2ee
[FEA] Add filesystem argument to `cudf.read_parquet` (#16577)
rjzamora Aug 22, 2024
81d71fc
update-version.sh fix (#16629)
AyodeAwe Aug 22, 2024
e4e867a
Annotate `ColumnAccessor._data` labels as `Hashable` (#16623)
mroeschke Aug 22, 2024
8b20298
Move pragma once in rolling/jit/operation.hpp. (#16636)
bdice Aug 22, 2024
eaefcb4
Support DecimalDtype meta in dask_cudf (#16634)
mroeschke Aug 22, 2024
83f68c9
Revert "Hide all gtest symbols in cudftestutil (#16546)" (#16644)
robertmaynard Aug 22, 2024
91f304e
Enable testing `cudf.pandas` unit tests for all minor versions of pan…
galipremsagar Aug 23, 2024
8d6b261
adding wheel build for libcudf (#15483)
msarahan Aug 23, 2024
a7ca3af
Add the missing `num_aggregations` axis for `groupby_max_cardinality`…
PointKernel Aug 23, 2024
7bd14a5
Add pylibcudf build dir in build.sh for `clean` (#16648)
galipremsagar Aug 23, 2024
7ca6a8c
fix libcudf wheel publishing, make package-type explicit in wheel pub…
jameslamb Aug 23, 2024
508bdea
Rebuild for & Support NumPy 2 (#16300)
jakirkham Aug 24, 2024
96f2cc5
Remove CUDA whole compilation ODR violations (#16603)
robertmaynard Aug 26, 2024
a250391
Revise `get_reader_filepath_or_buffer` to handle a list of data sourc…
rjzamora Aug 26, 2024
d15d470
Preserve Series name in duplicated method. (#16655)
bdice Aug 26, 2024
f511322
bug-fix: Don't enable the CUDA language if testing was requested when…
cryos Aug 26, 2024
c4591c0
Use non-mangled type names in nvbench output (#16649)
davidwendt Aug 27, 2024
115ddce
Fix integer overflow in indexalator pointer logic (#16643)
davidwendt Aug 27, 2024
efa9770
Drop Python 3.9 support (#16637)
jameslamb Aug 27, 2024
f1cc962
Fix `cudf::rank` not getting enough params (#16666)
JayjeetAtGithub Aug 27, 2024
2d494ed
Add `num_multiprocessors` utility (#16628)
PointKernel Aug 27, 2024
dd585e8
Prune workflows based on changed files (#16642)
KyleFromNVIDIA Aug 27, 2024
6747d2d
Update rapidsai/pre-commit-hooks (#16669)
KyleFromNVIDIA Aug 27, 2024
1a2aad2
Remove arrow dependency (#16640)
vyasr Aug 27, 2024
d0e5cdf
Allow for binops between two differently sized DecimalDtypes (#16638)
mroeschke Aug 27, 2024
88de8dd
Fix interval_range right child non-zero offset (#16651)
mroeschke Aug 27, 2024
e2a15cb
Fix strings::detail::copy_range when target contains nulls (#16626)
davidwendt Aug 27, 2024
d1412e0
Rework strings::slice benchmark to use nvbench (#16563)
davidwendt Aug 27, 2024
60f30d8
Use `make_host_vector` instead of `make_std_vector` to facilitate pin…
vuule Aug 28, 2024
1a96e4c
Fix loc/iloc.__setitem__[:, loc] with non cupy types (#16677)
mroeschke Aug 28, 2024
569939f
Fix slowdown in DataFrame repr in jupyter notebook (#16656)
galipremsagar Aug 28, 2024
5491b39
switch from typing.Callable to collections.abc.Callable (#16670)
jameslamb Aug 28, 2024
c600a65
Update documentation for Dask cuDF (#16671)
rjzamora Aug 28, 2024
872e01e
Fix slowdown in `CategoricalIndex.__repr__` (#16665)
galipremsagar Aug 28, 2024
dba6c1f
Remove build_categorical_column in favor of CategoricalColumn constru…
mroeschke Aug 28, 2024
925530a
Relax Arrow pin (#16681)
vyasr Aug 28, 2024
fbd6114
Support reading matching projected and filter cols from Parquet files…
mhaseeb123 Aug 28, 2024
9e9efcc
Replace raw device_memory_resource pointer in pylibcudf Cython (#16674)
harrism Aug 28, 2024
f6e2355
Handle `ordered` parameter in `CategoricalIndex.__repr__` (#16683)
galipremsagar Aug 29, 2024
f2d153b
Have interval_range use IntervalIndex.from_breaks, remove column_empt…
mroeschke Aug 29, 2024
eca5108
Disable gtests/ERROR_TEST during compute-sanitizer memcheck test (#16…
davidwendt Aug 29, 2024
21d05d7
Move apply_boolean_mask benchmark to nvbench (#16616)
davidwendt Aug 29, 2024
8c7af08
Increase timeouts for couple of tests (#16692)
galipremsagar Aug 29, 2024
53f488b
Add type annotations to Index classes, utilize _from_column more (#16…
mroeschke Aug 30, 2024
8f2d687
Refactor dictionary encoding in PQ writer to migrate to the new `cuco…
mhaseeb123 Aug 30, 2024
f932bf9
Fix Series.to_frame(name=None) setting a None name (#16698)
mroeschke Aug 30, 2024
62a53b3
[FEA] Add third-party library integration testing of cudf.pandas to c…
Matt711 Aug 30, 2024
23fb31e
Add a libcudf/thrust-based TPC-H derived datagen (#16294)
JayjeetAtGithub Aug 30, 2024
5a81a80
[BUG] Add gpu node type to cudf-pandas 3rd-party integration nightly …
Matt711 Aug 30, 2024
2d6758f
Enable batched multi-source reading of JSONL files with large records…
shrshi Aug 30, 2024
c6c720f
Implement exposed null mask APIs in pylibcudf (#15908)
charlesbluca Aug 30, 2024
5e420ff
Use merge base when calculating changed files (#16709)
KyleFromNVIDIA Aug 30, 2024
4ad4b23
remove some unnecessary libcudf nightly builds (#16714)
jameslamb Aug 31, 2024
7605958
Remove java ColumnView.copyWithBooleanColumnAsValidity (#16660)
revans2 Sep 1, 2024
557aabf
Ensure we pass the has_nulls tparam to mixed_join kernels (#16708)
abellina Sep 3, 2024
25779d9
Add boost-devel to Java CI Docker image (#16707)
jlowe Sep 3, 2024
0097b45
Fix typo in column_factories.hpp comment from 'depth 1' to 'depth 2' …
a-hirota Sep 3, 2024
e18b537
Use Series._from_column more consistently to avoid validation (#16716)
mroeschke Sep 3, 2024
a83ac6f
Add return type annotations to MultiIndex (#16696)
mroeschke Sep 3, 2024
fa1486e
Remove ERROR_TEST gtest from libcudf (#16722)
davidwendt Sep 3, 2024
26091a4
Refactor cudf pandas integration tests CI (#16728)
Matt711 Sep 4, 2024
1b6f02d
Multi-file and Parquet-aware prefetching from remote storage (#16657)
rjzamora Sep 4, 2024
ad1369d
CI: Test against old versions of key dependencies (#16570)
seberg Sep 4, 2024
e1ab1e7
Make isinstance check pass for proxy ndarrays (#16601)
Matt711 Sep 5, 2024
949f171
Performance improvement for strings::slice for wide strings (#16574)
davidwendt Sep 5, 2024
b62a786
Merge branch 'branch-24.10' into mixed-semi-join-refactor
mhaseeb123 Sep 5, 2024
8b10e65
Resolve merge conflicts
mhaseeb123 Sep 5, 2024
d49131d
Remove erroneous changelog
mhaseeb123 Sep 5, 2024
b8bf218
Address reviewer comments.
mhaseeb123 Sep 6, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .github/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ cudf.polars:
- 'python/cudf_polars/**'

pylibcudf:
- 'python/cudf/cudf/_lib/pylibcudf/**'
- 'python/cudf/pylibcudf/**'

libcudf:
- 'cpp/**'
Expand Down
49 changes: 48 additions & 1 deletion .github/workflows/build.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -67,7 +67,51 @@ jobs:
node_type: "gpu-v100-latest-1"
run_script: "ci/build_docs.sh"
sha: ${{ inputs.sha }}
wheel-build-libcudf:
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
# build for every combination of arch and CUDA version, but only for the latest Python
matrix_filter: group_by([.ARCH, (.CUDA_VER|split(".")|map(tonumber)|.[0])]) | map(max_by(.PY_VER|split(".")|map(tonumber)))
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
script: ci/build_wheel_libcudf.sh
wheel-publish-libcudf:
needs: wheel-build-libcudf
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
package-name: libcudf
package-type: cpp
wheel-build-pylibcudf:
needs: [wheel-publish-libcudf]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
script: ci/build_wheel_pylibcudf.sh
wheel-publish-pylibcudf:
needs: wheel-build-pylibcudf
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
build_type: ${{ inputs.build_type || 'branch' }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
package-name: pylibcudf
package-type: python
wheel-build-cudf:
needs: wheel-publish-pylibcudf
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
Expand All @@ -86,6 +130,7 @@ jobs:
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
package-name: cudf
package-type: python
wheel-build-dask-cudf:
needs: wheel-publish-cudf
secrets: inherit
Expand All @@ -108,8 +153,9 @@ jobs:
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
package-name: dask_cudf
package-type: python
wheel-build-cudf-polars:
needs: wheel-publish-cudf
needs: wheel-publish-pylibcudf
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
Expand All @@ -130,6 +176,7 @@ jobs:
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}
package-name: cudf_polars
package-type: python
trigger-pandas-tests:
if: inputs.build_type == 'nightly'
needs: wheel-build-cudf
Expand Down
3 changes: 2 additions & 1 deletion .github/workflows/pandas-tests.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,8 @@ jobs:
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
matrix_filter: map(select(.ARCH == "amd64" and .PY_VER == "3.9" and (.CUDA_VER | startswith("12.5.")) ))
# This selects "ARCH=amd64 + the latest supported Python + CUDA".
matrix_filter: map(select(.ARCH == "amd64")) | group_by(.CUDA_VER|split(".")|map(tonumber)|.[0]) | map(max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]))
build_type: nightly
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
Expand Down
121 changes: 108 additions & 13 deletions .github/workflows/pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ concurrency:
jobs:
pr-builder:
needs:
- changed-files
- checks
- conda-cpp-build
- conda-cpp-checks
Expand All @@ -23,6 +24,8 @@ jobs:
- static-configure
- conda-notebook-tests
- docs-build
- wheel-build-libcudf
- wheel-build-pylibcudf
- wheel-build-cudf
- wheel-tests-cudf
- wheel-build-cudf-polars
Expand All @@ -35,6 +38,70 @@ jobs:
- pandas-tests-diff
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: always()
with:
needs: ${{ toJSON(needs) }}
changed-files:
runs-on: ubuntu-latest
name: "Check changed files"
outputs:
test_cpp: ${{ steps.changed-files.outputs.cpp_any_changed == 'true' }}
test_java: ${{ steps.changed-files.outputs.java_any_changed == 'true' }}
test_notebooks: ${{ steps.changed-files.outputs.notebooks_any_changed == 'true' }}
test_python: ${{ steps.changed-files.outputs.python_any_changed == 'true' }}
steps:
- name: Get PR info
id: get-pr-info
uses: rapidsai/shared-actions/get-pr-info@main
- name: Checkout code repo
uses: actions/checkout@v4
with:
fetch-depth: 0
persist-credentials: false
- name: Calculate merge base
id: calculate-merge-base
env:
PR_SHA: ${{ fromJSON(steps.get-pr-info.outputs.pr-info).head.sha }}
BASE_SHA: ${{ fromJSON(steps.get-pr-info.outputs.pr-info).base.sha }}
run: |
(echo -n "merge-base="; git merge-base "$BASE_SHA" "$PR_SHA") > "$GITHUB_OUTPUT"
- name: Get changed files
id: changed-files
uses: tj-actions/changed-files@v45
with:
base_sha: ${{ steps.calculate-merge-base.outputs.merge-base }}
sha: ${{ fromJSON(steps.get-pr-info.outputs.pr-info).head.sha }}
files_yaml: |
cpp:
- '**'
- '!CONTRIBUTING.md'
- '!README.md'
- '!docs/**'
- '!img/**'
- '!java/**'
- '!notebooks/**'
- '!python/**'
java:
- '**'
- '!CONTRIBUTING.md'
- '!README.md'
- '!docs/**'
- '!img/**'
- '!notebooks/**'
- '!python/**'
notebooks:
- '**'
- '!CONTRIBUTING.md'
- '!README.md'
- '!java/**'
python:
- '**'
- '!CONTRIBUTING.md'
- '!README.md'
- '!docs/**'
- '!img/**'
- '!java/**'
- '!notebooks/**'
checks:
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
Expand All @@ -54,9 +121,10 @@ jobs:
build_type: pull-request
enable_check_symbols: true
conda-cpp-tests:
needs: conda-cpp-build
needs: [conda-cpp-build, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_cpp == 'true'
with:
build_type: pull-request
conda-python-build:
Expand All @@ -66,24 +134,27 @@ jobs:
with:
build_type: pull-request
conda-python-cudf-tests:
needs: conda-python-build
needs: [conda-python-build, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
build_type: pull-request
script: "ci/test_python_cudf.sh"
conda-python-other-tests:
# Tests for dask_cudf, custreamz, cudf_kafka are separated for CI parallelism
needs: conda-python-build
needs: [conda-python-build, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
build_type: pull-request
script: "ci/test_python_other.sh"
conda-java-tests:
needs: conda-cpp-build
needs: [conda-cpp-build, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_java == 'true'
with:
build_type: pull-request
node_type: "gpu-v100-latest-1"
Expand All @@ -101,9 +172,10 @@ jobs:
container_image: "rapidsai/ci-wheel:latest"
run_script: "ci/configure_cpp_static.sh"
conda-notebook-tests:
needs: conda-python-build
needs: [conda-python-build, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_notebooks == 'true'
with:
build_type: pull-request
node_type: "gpu-v100-latest-1"
Expand All @@ -120,22 +192,39 @@ jobs:
arch: "amd64"
container_image: "rapidsai/ci-conda:latest"
run_script: "ci/build_docs.sh"
wheel-build-cudf:
wheel-build-libcudf:
needs: checks
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
# build for every combination of arch and CUDA version, but only for the latest Python
matrix_filter: group_by([.ARCH, (.CUDA_VER|split(".")|map(tonumber)|.[0])]) | map(max_by(.PY_VER|split(".")|map(tonumber)))
build_type: pull-request
script: "ci/build_wheel_libcudf.sh"
wheel-build-pylibcudf:
needs: [checks, wheel-build-libcudf]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
build_type: pull-request
script: "ci/build_wheel_pylibcudf.sh"
wheel-build-cudf:
needs: wheel-build-pylibcudf
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
build_type: pull-request
script: "ci/build_wheel_cudf.sh"
wheel-tests-cudf:
needs: wheel-build-cudf
needs: [wheel-build-cudf, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
build_type: pull-request
script: ci/test_wheel_cudf.sh
wheel-build-cudf-polars:
needs: wheel-build-cudf
needs: wheel-build-pylibcudf
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
Expand All @@ -144,9 +233,10 @@ jobs:
build_type: pull-request
script: "ci/build_wheel_cudf_polars.sh"
wheel-tests-cudf-polars:
needs: wheel-build-cudf-polars
needs: [wheel-build-cudf-polars, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
# This selects "ARCH=amd64 + the latest supported Python + CUDA".
matrix_filter: map(select(.ARCH == "amd64")) | group_by(.CUDA_VER|split(".")|map(tonumber)|.[0]) | map(max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]))
Expand All @@ -164,9 +254,10 @@ jobs:
build_type: pull-request
script: "ci/build_wheel_dask_cudf.sh"
wheel-tests-dask-cudf:
needs: wheel-build-dask-cudf
needs: [wheel-build-dask-cudf, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
# This selects "ARCH=amd64 + the latest supported Python + CUDA".
matrix_filter: map(select(.ARCH == "amd64")) | group_by(.CUDA_VER|split(".")|map(tonumber)|.[0]) | map(max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]))
Expand All @@ -183,20 +274,24 @@ jobs:
build-all -DBUILD_BENCHMARKS=ON --verbose;
sccache -s;
unit-tests-cudf-pandas:
needs: wheel-build-cudf
needs: [wheel-build-cudf, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
# This selects "ARCH=amd64 + the latest supported Python + CUDA".
matrix_filter: map(select(.ARCH == "amd64")) | group_by(.CUDA_VER|split(".")|map(tonumber)|.[0]) | map(max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]))
build_type: pull-request
script: ci/cudf_pandas_scripts/run_tests.sh
pandas-tests:
# run the Pandas unit tests using PR branch
needs: wheel-build-cudf
needs: [wheel-build-cudf, changed-files]
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
if: needs.changed-files.outputs.test_python == 'true'
with:
matrix_filter: map(select(.ARCH == "amd64" and .PY_VER == "3.9" and (.CUDA_VER | startswith("12.5.")) ))
# This selects "ARCH=amd64 + the latest supported Python + CUDA".
matrix_filter: map(select(.ARCH == "amd64")) | group_by(.CUDA_VER|split(".")|map(tonumber)|.[0]) | map(max_by([(.PY_VER|split(".")|map(tonumber)), (.CUDA_VER|split(".")|map(tonumber))]))
build_type: pull-request
script: ci/cudf_pandas_scripts/pandas-tests/run.sh pr
# Hide test failures because they exceed the GITHUB_STEP_SUMMARY output limit.
Expand Down
12 changes: 12 additions & 0 deletions .github/workflows/test.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -124,3 +124,15 @@ jobs:
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
script: ci/cudf_pandas_scripts/run_tests.sh
third-party-integration-tests-cudf-pandas:
secrets: inherit
uses: rapidsai/shared-workflows/.github/workflows/[email protected]
with:
build_type: nightly
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
node_type: "gpu-v100-latest-1"
container_image: "rapidsai/ci-conda:latest"
run_script: |
ci/cudf_pandas_scripts/third-party-integration/test.sh python/cudf/cudf_pandas_tests/third_party_integration_tests/dependencies.yaml
6 changes: 6 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,8 @@ Debug
build/
cpp/build/
cpp/examples/*/install/
cpp/examples/*/build/
cpp/examples/tpch/datagen/datafusion
cpp/include/cudf/ipc_generated/*.h
cpp/thirdparty/googletest/

Expand Down Expand Up @@ -176,3 +178,7 @@ jupyter_execute
# clang tooling
compile_commands.json
.clangd/

# pytest artifacts
rmm_log.txt
python/cudf/cudf_pandas_tests/data/rmm_log.txt
2 changes: 1 addition & 1 deletion .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -144,7 +144,7 @@ repos:
- id: ruff-format
files: python/.*$
- repo: https://github.com/rapidsai/pre-commit-hooks
rev: v0.2.0
rev: v0.4.0
hooks:
- id: verify-copyright
exclude: |
Expand Down
Loading
Loading