-
Notifications
You must be signed in to change notification settings - Fork 916
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RELEASE] cudf v21.12 #9689
Merged
Merged
[RELEASE] cudf v21.12 #9689
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: Peixin Li <[email protected]> update cudf JNI version to 21.12.0 Authors: - Peixin (https://github.com/pxLi) Approvers: - Jason Lowe (https://github.com/jlowe) - Robert (Bobby) Evans (https://github.com/revans2) URL: #9248
Fix Automerger for `Branch-21.12` from `branch-21.10`
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
This PR addresses issue #8608 by adding a deprecation warning before we remove the functionality entirely. Authors: - Mayank Anand (https://github.com/mayankanand007) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #9289
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
…g. (#9300) Currently mypy set as is a "local" hook in pre-commit, meaning that it uses `mypy` from the user's environment. I have mypy 0.910 installed (released Jun 22, 2021) locally and it fails with the following error: ``` mypy.....................................................................Failed - hook id: mypy - exit code: 2 python/cudf/cudf/tests/conftest.py: error: Duplicate module named "conftest" (also at "python/cudf/cudf/benchmarks/conftest.py") python/cudf/cudf/tests/conftest.py: note: Are you missing an __init__.py? Alternatively, consider using --exclude to avoid checking one of them. Found 1 error in 1 file (errors prevented further checking) ``` The conda environment pinning is [currently 0.782](https://github.com/rapidsai/cudf/blob/ba763105e006494a536c1a2fafc5112ab3dae362/conda/environments/cudf_dev_cuda11.2.yml#L39) (released Jun 23, 2020). Newer versions of mypy (around 0.900) have substantial behavior changes, so my newer version throws errors that are not seen in the older mypy that is pinned for CI checks. This PR changes the behavior of `.pre-commit-config.yml` to use mypy from a mirror repository, at the same pinning used in the conda environment, instead of the local executable. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #9300
This bug fixes #9281. The issue is that we should be using `std::common_type_t` for explicit casting between different types to avoid overflow. That fix has been applied. Authors: - Conor Hoekstra (https://github.com/codereport) Approvers: - Jake Hemstad (https://github.com/jrhemstad) - Nghia Truong (https://github.com/ttnghia) - Robert Maynard (https://github.com/robertmaynard) - https://github.com/nvdbaranec - GALI PREM SAGAR (https://github.com/galipremsagar) URL: #9287
Fixes #9270 Authors: - Rong Ou (https://github.com/rongou) Approvers: - Jason Lowe (https://github.com/jlowe) - Alessandro Bellina (https://github.com/abellina) URL: #9311
This PR unpins `dask` and `distributed` in CI. Authors: - GALI PREM SAGAR (https://github.com/galipremsagar) Approvers: - Charles Blackmon-Luca (https://github.com/charlesbluca) - AJ Schmidt (https://github.com/ajschmidt8) - https://github.com/jakirkham URL: #9307
Depends on rapidsai/integration#352 Authors: - Devavret Makkar (https://github.com/devavret) Approvers: - AJ Schmidt (https://github.com/ajschmidt8) - https://github.com/nvdbaranec - GALI PREM SAGAR (https://github.com/galipremsagar) URL: #9181
…view (#9312) Fixes #9140 Added `shallow_hash(column_view)` Added unit tests SWIPAT approval complete It computes hash values based on the shallow states of `column_view`: type, size, data pointer, null_mask pointer, offset, and the hash value of the children. `null_count` is not used since it is a cached value and it may vary based on contents of `null_mask`, and may be pre-computed or not. Fixes #9139 Added `is_shallow_equivalent(column_view, column_view)` ~shallow_equal~ Added unit tests It compares two column_views based on the shallow states of column_view: type, size, data pointer, null_mask pointer, offset, and the column_view of the children. null_count is not used since it is a cached value and it may vary based on contents of null_mask, and may be pre-computed or not. Authors: - Karthikeyan (https://github.com/karthikeyann) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) URL: #9312
This PR standardizes some more methods that were previously duplicated between different Frame classes into Frame. Many of the changes should be straightforward to review, with the main possible exception being the changes to serialization logic. Note that because the format of the serialized data has been standardized across different Frame types, objects serialized with old versions of cudf will be incompatible with the new format. I've included some backwards compatibility layers to be removed in a future release. Additionally, this PR adds some methods to Frame that are really only applicable to DataFrame and Series. Since some such methods already exist in Frame, I think it's fine to continue adding to those pending the eventual creation of a common IndexedFrame class. Authors: - Vyas Ramasubramani (https://github.com/vyasr) Approvers: - Ashwin Srinath (https://github.com/shwina) - GALI PREM SAGAR (https://github.com/galipremsagar) URL: #9305
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
Tiny fix to use an f-string in a warning message. Current output: ``` .../cudf/python/cudf/cudf/core/join/_join_helpers.py:126: UserWarning: Can't safely cast column from float32 to int8, upcasting to {common_type}. ``` Authors: - Bradley Dice (https://github.com/bdice) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - https://github.com/brandon-b-miller URL: #9325
Some unused variable warnings are appearing in 11.4 builds from recent PR merges. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) - Nghia Truong (https://github.com/ttnghia) URL: #9326
Replaces C++ implementation of masked UDF pipeline with a pure python version which compiles and launches the entire kernel using numba. This solves a bunch of problems: - CUDA 11.0 support is now available since the impl no longer needs `cuda::std::tuple` to work with NVRTC 11.0. - Support for special functions which compile to multiple function definitions, such as `pow`, `sin`, and `cos` is now provided since all the PTX is compiled and linked inside numba (Fixes #8470) - Allows us to support this corner case which would require a separate c++ kernel in previous implementation ```python def f(x): return 42 ``` - Makes developing/adding features to the impl much easier Authors: - https://github.com/brandon-b-miller Approvers: - Robert Maynard (https://github.com/robertmaynard) - GALI PREM SAGAR (https://github.com/galipremsagar) - Graham Markall (https://github.com/gmarkall) - Ashwin Srinath (https://github.com/shwina) URL: #9174
This PR fixes three small issues relating to uninitialized variables in Cython code. This removes warnings from the build log and fixes potential bugs. - Parquet code had a boolean variable `is_range_index` that was not initialized if `json_str != ""`. - Reduction scan type wasn't correctly initialized (`if True:` / `elif False:` pattern) - Undesired comparison of a signed int and an unsigned value in a utility function. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #9327
[gpuCI] Forward-merge branch-21.10 to branch-21.12 [skip gpuci]
A similar fix for this problem was recently submitted in #9159 and closed in favor of #9203. It seems that the test added in the latter PR was not actually capturing the original problem. However, after [dask#8072](dask/dask#8072) is merged, the new test will certainly start failing. Authors: - Richard (Rick) Zamora (https://github.com/rjzamora) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Benjamin Zaitlen (https://github.com/quasiben) URL: #9314
Closes #8896 #4908. It creates a new `isocalendar` method in cudf python, and solves related datetime issues for obtaining week, year and day in isocalendar form. Authors: - Marlene (https://github.com/marlenezw) - GALI PREM SAGAR (https://github.com/galipremsagar) Approvers: - Ram (Ramakrishna Prabhu) (https://github.com/rgsl888prabhu) - GALI PREM SAGAR (https://github.com/galipremsagar) URL: #9169
The behaviour of `cudf::lists::segmented_gather()` is currently undefined for any index value `i` that falls outside the range `[-n, n)`, where `n` is the number of elements in the list row. This commit adds support to explicitly specify an `out_of_bounds_policy`, like in `cudf::gather()`. The erstwhile behaviour is retained when the bounds policy is set to `DONT_CHECK`. If the bounds policy is specified as `NULLIFY`, then for any index falling outside the range `[-n, n)`, the list element is set to `null`. E.g. ```c++ auto source_column = [{"a", "b", "c", "d"}, {"1", "2", "3", "4"}, {"x", "y", "z"}]; auto gather_map = [{0, -1, 4, -5}, {1, 3, 5}, {}]; auto result = segmented_gather(source_column, gather_map, NULLIFY); result == [{"a", "d", null, null}, {"2", "4", null}, {}]; ``` Authors: - MithunR (https://github.com/mythrocks) Approvers: - Conor Hoekstra (https://github.com/codereport) - Karthikeyan (https://github.com/karthikeyann) - Nghia Truong (https://github.com/ttnghia) URL: #9318
This PR replaces the memory resource for temporaries in `reduction.cpp` with default memory resources. Authors: - Michael Wang (https://github.com/isVoid) Approvers: - David Wendt (https://github.com/davidwendt) - Bradley Dice (https://github.com/bdice) - MithunR (https://github.com/mythrocks) URL: #9344
harrism
approved these changes
Nov 18, 2021
ttnghia
approved these changes
Nov 18, 2021
Fix Changelog Merge Conflicts for `branch-21.12`
Now that we only do `11.5` builds for RAPIDS, the `DEFAULT_CUDA_VER` variable in `ci/cpu/prebuild.sh` should be set to `11.5` so that the rest of the logic in the file works correctly.
* pin max dask version * Update conda/recipes/dask-cudf/meta.yaml Co-authored-by: Bradley Dice <[email protected]> * Apply suggestions from code review Co-authored-by: jakirkham <[email protected]> Co-authored-by: Bradley Dice <[email protected]> Co-authored-by: jakirkham <[email protected]>
This PR adds the `arrow_dataset` and `parquet` targets to the build export when Arrow is built from source by CPM, similar to what we have to do today for `arrow` and `arrow_cuda` targets.
vyasr
approved these changes
Nov 29, 2021
CI runs all tests over a variety of different hardware. Tests that have to use NVRTC to re-compile cudf C++ code are only supported on CUDA 11.5+.
…_operation Remove jit code of binary operation
Signed-off-by: sperlingxx <[email protected]>
…#9816) * move NullEquals to separate file * To improve runtime performance move more binary_ops dispatch to host * make sure to forceinline the operator_dispatcher * Correct style issues found by ci * Expand the binary-op compiled benchmark suite * Ensure forceinline is on binary ops device dispatch functions * Correct style issues found by ci Co-authored-by: Karthikeyan Natarajan <[email protected]> Co-authored-by: Karthikeyan <[email protected]>
* skip some parts of writing if no rows; test * expand test * Apply suggestions from code review Co-authored-by: Conor Hoekstra <[email protected]> * fix * remove integer-suffix Co-authored-by: Conor Hoekstra <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
❄️ Code freeze for
branch-21.12
and v21.12 releaseWhat does this mean?
Only critical/hotfix level issues should be merged into
branch-21.12
until release (merging of this PR).What is the purpose of this PR?
branch-21.12
intomain
for the release