Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RELEASE] cudf v22.06 #10969

Merged
merged 281 commits into from
Jun 7, 2022
Merged
Show file tree
Hide file tree
Changes from 250 commits
Commits
Show all changes
281 commits
Select commit Hold shift + click to select a range
90882d3
Update pre-commit to run black 22.3.0 (#10523)
vyasr Mar 28, 2022
8103a91
add accidentally removed comment. (#10526)
vyasr Mar 28, 2022
7150106
Fix Series.str.findall behavior for expand=False. (#10459)
bdice Mar 28, 2022
19f324c
Update conda environment. (#10525)
vyasr Mar 28, 2022
62360cb
Fix temp data cleanup in `test_text.py` (#10524)
brandon-b-miller Mar 29, 2022
a0495f4
Deprecate `Series.applymap` (#10497)
brandon-b-miller Mar 29, 2022
b926f51
Consolidate C++ `conda` recipes and add `libcudf-tests` package (#10326)
ajschmidt8 Mar 29, 2022
cc986f7
Remove statically linked CUDA runtime check in Java build (#10532)
jlowe Mar 29, 2022
9a62a37
Merge pull request #10533 from rapidsai/branch-22.04
GPUtester Mar 29, 2022
bc6239b
Remove deprecated `decimal_cols_as_float` in the ORC reader (#10515)
vuule Mar 29, 2022
3f1d1ca
Merge pull request #10539 from rapidsai/branch-22.04
GPUtester Mar 30, 2022
2d8d913
Merge pull request #10546 from rapidsai/branch-22.04
GPUtester Mar 30, 2022
1f0967e
Remove Click pinnings that are unnecessary after upgrading black. (#1…
vyasr Mar 30, 2022
4f3ab29
Remove pip requirements files. (#10543)
bdice Mar 30, 2022
1355191
Refactor `memory_usage` to improve performance (#10537)
galipremsagar Mar 31, 2022
bc8f578
Adjust the valid range of group index for replace_with_backrefs (#10530)
sperlingxx Mar 31, 2022
73bc7d7
Merge branch-22.04 into branch-22.06
bdice Mar 31, 2022
4775f11
Merge pull request #10557 from bdice/branch-22.06-merge-22.04
sevagh Mar 31, 2022
d50d1ef
Re-enable Build Metrics Report (#10562)
davidwendt Mar 31, 2022
ee03c1a
Add Replace Backreferences section to Regex Features page (#10560)
davidwendt Apr 1, 2022
b614a9a
pin more cmake versions (#10570)
robertmaynard Apr 1, 2022
ca952f8
Update to Thrust 1.16 (#10489)
bdice Apr 1, 2022
f66c99a
Merge pull request #10575 from rapidsai/branch-22.04
GPUtester Apr 1, 2022
291fbcf
Upgrade `arrow-cpp` & `pyarrow` to `7.0.0` (#10503)
galipremsagar Apr 1, 2022
d7602c3
Update `Programming Language :: Python` Versions to 3.8 & 3.9 (#10579)
madsbk Apr 4, 2022
adec535
Fix doxygen Modules page for cudf::lists::sequences (#10561)
davidwendt Apr 4, 2022
fa0938f
Fix missing RMM_STATIC_CUDART define when compiling JNI with static C…
jlowe Apr 4, 2022
ff1ff80
Add patch for thrust-cub 1.16 to fix sort compile times (#10577)
davidwendt Apr 4, 2022
5d2e206
Remove ColumnBase.__getitem__ (#10516)
vyasr Apr 4, 2022
090f6b8
Revert Thrust 1.16 to Thrust 1.15 (#10586)
bdice Apr 5, 2022
0aef0c1
Refactor binary ops for timedelta and datetime columns (#10581)
vyasr Apr 5, 2022
e8d189c
Rename CUDA_TRY macro to CUDF_CUDA_TRY, rename CHECK_CUDA macro to CU…
bdice Apr 5, 2022
9ee7617
Add support for struct columns to the random table generator (#10566)
vuule Apr 5, 2022
6675c75
Merge branch-22.04 into branch-22.06
galipremsagar Apr 5, 2022
faff5de
Fix has_atomic_support check in can_use_hash_groupby() (#10588)
jbrennan333 Apr 5, 2022
3631860
Merge pull request #10598 from galipremsagar/branch-22.06-merge-22.04
ajschmidt8 Apr 5, 2022
f359ec7
Support nvComp 2.3 if local, otherwise use nvcomp 2.2 (#10513)
robertmaynard Apr 5, 2022
5f4f232
Enable building static libs (#10545)
trxcllnt Apr 5, 2022
956c7b5
Fix strings strip() to accept only str Scalar for to_strip parameter …
davidwendt Apr 6, 2022
261879f
Add default= kwarg to .list.get() accessor method (#10547)
shwina Apr 6, 2022
fb03c8b
Move binop methods from Frame to IndexedFrame and standardize the doc…
vyasr Apr 6, 2022
018924f
Verify compression type in Parquet reader (#10610)
vuule Apr 7, 2022
26c1810
Allow libcudfjni to be built as a static library (#10619)
jlowe Apr 7, 2022
c4a9b6a
Reduce kernel calls to build strings findall results (#10559)
davidwendt Apr 8, 2022
1cc3d8b
Update strings contains benchmark to measure varying match rates (#10…
davidwendt Apr 8, 2022
00fce5a
Branch 22.06 merge 22.04 (#10624)
vyasr Apr 8, 2022
50cabc9
Fix type-o in docstring for json_reader_options (#10627)
dagardner-nv Apr 8, 2022
35e25d8
Merge branch 'branch-22.04' into branch-22.06-merge-22.04_2
vyasr Apr 8, 2022
7643a0b
Merge pull request #10628 from vyasr/branch-22.06-merge-22.04_2
ajschmidt8 Apr 8, 2022
bc43e6a
Fix struct row comparator's exception on empty structs (#10604)
sperlingxx Apr 8, 2022
bf4ffc9
Add python bindings for cudf::list::index_of (#10549)
ChrisJar Apr 10, 2022
df6bd3c
Refactor cudf::strings::count_re API to use count_matches utility (#1…
davidwendt Apr 11, 2022
28aa895
Update guide to UDFs with notes about `Series.applymap` deprecation a…
brandon-b-miller Apr 11, 2022
97ac349
Unpin `dask` & `distributed` for development (#10623)
galipremsagar Apr 11, 2022
10b26b2
Merge pull request #10634 from rapidsai/branch-22.04
GPUtester Apr 11, 2022
c8ffece
Fix to_timestamps to support Z for %z format specifier (#10617)
davidwendt Apr 11, 2022
012af64
Slightly improve accuracy of stod in to_floats (#10622)
davidwendt Apr 11, 2022
09b080d
Automate Java cudf jar build with statically linked dependencies (#10…
gerashegalov Apr 11, 2022
3c13ef1
Document cudf.read_text and cudf.read_avro. (#10638)
bdice Apr 12, 2022
2348277
Enable passing a sequence for the `index` argument to `.list.get()` (…
shwina Apr 12, 2022
d6c15e4
Use cudf.read_json as documented API name. (#10640)
bdice Apr 12, 2022
9e8e92c
Fix docstring section headings. (#10639)
bdice Apr 12, 2022
8c825f5
Remove `concurrent_unordered_multimap`. (#10642)
bdice Apr 12, 2022
64a811e
Add missing APIs to documentation. (#10643)
bdice Apr 12, 2022
c9e16c7
Simplify preprocessing of arguments for DataFrame binops (#10563)
vyasr Apr 12, 2022
0ea6f8e
List element Equality comparator (#10289)
devavret Apr 13, 2022
dd7143a
Bump hadoop-common from 3.1.4 to 3.2.3 in /java (#10645)
dependabot[bot] Apr 13, 2022
c72868e
Remove implementation details from `apply` docstrings (#10651)
brandon-b-miller Apr 13, 2022
ce56bc3
Add `cudf.DataFrame.applymap` (#10542)
brandon-b-miller Apr 13, 2022
489e41f
Deprecate various functions that don't need to be defined for Index. …
vyasr Apr 13, 2022
03e84ef
Update pinning to allow newer CMake versions. (#10646)
vyasr Apr 14, 2022
22a6679
Improve cudf::cuda_error (#10630)
sperlingxx Apr 14, 2022
ac27757
Cleanup libcudf strings regex classes (#10573)
davidwendt Apr 14, 2022
f7c35d5
Add support for numeric_only in DataFrame._reduce (#10629)
martinfalisse Apr 14, 2022
77fa49e
Clean up C++ includes to use <> instead of "". (#10658)
bdice Apr 14, 2022
14a3261
Improve User Guide docs (#10663)
bdice Apr 14, 2022
6e6c325
Fix some docstrings formatting (#10660)
galipremsagar Apr 14, 2022
8f5a044
Add option to drop cache in cuIO benchmarks (#10488)
vuule Apr 15, 2022
b542678
cuco isn't a cudf dependency when we are built shared (#10662)
robertmaynard Apr 15, 2022
4e668f2
Update UDF notebook in User Guide. (#10668)
bdice Apr 15, 2022
9e1258d
Use `std::filesystem` for temporary directory location and deletion (…
vuule Apr 15, 2022
d5a982b
Add column field ID control in parquet writer (#10504)
PointKernel Apr 15, 2022
94a5d41
Add support for null and non-numeric types in Series.diff and DataFra…
Matt711 Apr 15, 2022
9409559
Rework JNI CMake to leverage rapids_find_package (#10649)
jlowe Apr 18, 2022
45c003d
Fix list of testing requirements in setup.py. (#10678)
bdice Apr 18, 2022
c322cba
Standardize imports. (#10680)
bdice Apr 18, 2022
6c79b59
Standardize usage of collections.abc. (#10679)
bdice Apr 18, 2022
17d49fa
Enable segmented_gather in Java package (#10669)
sperlingxx Apr 19, 2022
9dc728a
Use Lists of Columns for Various Files (#10463)
isVoid Apr 19, 2022
ba1173d
cleanup benchmark includes (#10661)
karthikeyann Apr 19, 2022
08cd428
Add device_memory_resource parameter to create_string_vector_from_col…
davidwendt Apr 19, 2022
565f474
Split up mixed-join kernels source files (#10671)
davidwendt Apr 19, 2022
304711a
Handle RuntimeError thrown by CUDA Python in `validate_setup` (#10653)
shwina Apr 19, 2022
31a5f44
Cython API Refactor: `transpose.pyx`, `sort.pyx` (#10675)
isVoid Apr 19, 2022
65b1cbd
add data generation to benchmark documentation (#10677)
karthikeyann Apr 19, 2022
017d52a
Improve parquet dictionary encoding (#10635)
PointKernel Apr 20, 2022
5f6b70a
Fix sphinx/jupyter heading issue in UDF notebook (#10690)
brandon-b-miller Apr 20, 2022
c8c7271
Additional refactoring of hash functions (#10462)
bdice Apr 20, 2022
01d08af
KvikIO as an alternative GDS backend (#10593)
madsbk Apr 20, 2022
d5e6941
Remove cudf::strings::string namespace (#10684)
davidwendt Apr 21, 2022
8a4d1b2
Introduce deprecation policy to developer guide. (#10252)
vyasr Apr 21, 2022
0a7c141
Support `args` in groupby apply (#10682)
brandon-b-miller Apr 21, 2022
070aef1
Add `.list.astype()` to cast list leaves to specified dtype (#10693)
shwina Apr 21, 2022
5053a1a
Remove or simplify various utility functions (#10705)
vyasr Apr 21, 2022
d6e3068
Cython API refactor: `merge.pyx` (#10698)
isVoid Apr 22, 2022
5264f95
Fix rounding to zero error in stod on very small float numbers (#10672)
davidwendt Apr 23, 2022
ae7e979
JNI: throw CUDA errors more specifically (#10551)
sperlingxx Apr 24, 2022
399c910
Allow building arrow with parquet and not python (#10702)
revans2 Apr 25, 2022
75f3873
Use upstream clang-format pre-commit hook. (#10659)
bdice Apr 25, 2022
8505a13
Fix `fillna` to retain `columns` when it is `MultiIndex` (#10729)
galipremsagar Apr 25, 2022
fdb1f47
Correct build dir `cudf-config` dependency issues for static builds (…
robertmaynard Apr 25, 2022
93070fa
Migrated user guide notebooks to MyST-NB and added sphinx extension (…
mmccarty Apr 26, 2022
a1c7cbe
Rename tests from multiIndex to multiindex. (#10732)
bdice Apr 26, 2022
62005f2
Retain series name in `Series.apply` (#10716)
brandon-b-miller Apr 26, 2022
57b9d0b
First step toward statistics in ORC files with chunked writes (#10567)
hyperbolic2346 Apr 26, 2022
cc0bf12
JNI: Add generateListOffsets API (#10683)
sperlingxx Apr 26, 2022
a914b58
Update cmake-format version. (#10440)
vyasr Apr 26, 2022
e2cd6b0
Fix random string data length to become variable (#10697)
galipremsagar Apr 26, 2022
420da0a
Missing `f` prefix on f-strings fix (#10721)
code-review-doctor Apr 26, 2022
1fdca07
Fix `cupy` function in notebook (#10737)
ajschmidt8 Apr 26, 2022
47740bc
Improve use of isinstance. (#10734)
bdice Apr 26, 2022
41dfdc2
Use canonical "magic methods" (replace `x.__repr__()` with `repr(x)`)…
bdice Apr 26, 2022
75a675b
Deprecate index merging (#10689)
vyasr Apr 26, 2022
dc1435b
Use structured bindings instead of std::tie (#10726)
karthikeyann Apr 27, 2022
09995a5
Add bindings for index_of with column search key (#10696)
ChrisJar Apr 27, 2022
1f8a03e
Replace std::make_pair with std::pair (C++17 CTAD) (#10727)
karthikeyann Apr 27, 2022
3d92bf2
Fix scatter for all-empty-string column case (#10724)
davidwendt Apr 27, 2022
f0b9117
Revise CONTRIBUTING.md (#10644)
bdice Apr 27, 2022
03d419d
Prepare dask_cudf test_parquet.py for upcoming API changes (#10709)
rjzamora Apr 28, 2022
a43fb9e
Implement DataFrame.eval using libcudf ASTs (#8022)
vyasr Apr 28, 2022
20569f6
Add `detail::hash_join` (#10695)
PointKernel Apr 28, 2022
9ac2477
Implement all methods of groupby rank aggregation in libcudf, python …
karthikeyann Apr 28, 2022
280acdf
Partial cuIO GPU decompression refactor (#10699)
vuule Apr 28, 2022
84f88ce
Support purging non-empty null elements from LIST/STRING columns (#10…
mythrocks Apr 29, 2022
3c208a6
Enable pydocstyle rules involving quotes (#10748)
vyasr Apr 29, 2022
15e4982
Enable pydocstyle for all packages. (#10759)
bdice Apr 29, 2022
3c4e72e
Add row hasher with nested column support (#10641)
devavret Apr 29, 2022
9b8d26f
Fix an issue with one_level_list schemas in parquet reader. (#10750)
nvdbaranec Apr 29, 2022
9112907
Support Segmented Min/Max Reduction on String Type (#10447)
isVoid Apr 29, 2022
bf10a94
Flush output streams before creating a process to drop caches (#10762)
vuule Apr 29, 2022
027c34a
Use generator expressions in any/all functions. (#10736)
bdice Apr 30, 2022
6128e0d
Use warp per string for long strings in cudf::strings::contains() (#1…
davidwendt May 2, 2022
0ddb3d9
Add row bitmask as a `detail::hash_join` member (#10248)
PointKernel May 2, 2022
a9eb47c
Deprecate `merge_sorted`, change dask cudf usage to internal method (…
isVoid May 2, 2022
0e32624
Add multiple rows to subword tokenizer benchmark (#10767)
davidwendt May 3, 2022
ad12606
Fix replace error when regex has only zero match quantifiers (#10760)
davidwendt May 3, 2022
8d861ce
Fixing deprecation warnings in test_orc.py (#10772)
hyperbolic2346 May 3, 2022
d3a39b3
Add struct utility functions. (#10776)
bdice May 4, 2022
0d11591
Use column_views instead of column_device_views in binary operations.…
bdice May 4, 2022
dd68db3
Reorganize cuDF Python docs (#10691)
shwina May 4, 2022
1a457ef
In-place updates with loc or iloc don't work correctly when the LHS h…
skirui-source May 4, 2022
14b5169
Enable ccache for cudfjni build in Docker (#10790)
gerashegalov May 5, 2022
d994905
Update `cuco` git tag (#10788)
PointKernel May 5, 2022
e52a1eb
Segmented `apply_boolean_mask` for `LIST` columns (#10773)
mythrocks May 5, 2022
ee26fbe
Optimize `left_semi_join` by materializing the gather mask (#10511)
cheinger May 5, 2022
4ce7b65
simplifying skiprows test in test_orc.py (#10783)
hyperbolic2346 May 5, 2022
d574c69
Persist string statistics data across multiple calls to orc chunked w…
hyperbolic2346 May 6, 2022
b12fd56
HostColumnVectoreCore#isNull should return true for out-of-range rows…
gerashegalov May 6, 2022
de0f7e0
Change stack-based regex state data to use global memory (#10600)
davidwendt May 6, 2022
a8f0976
Make the JNI API to get list offsets as a view public. (#10807)
revans2 May 6, 2022
4913a9b
Add NumPy to intersphinx references. (#10809)
bdice May 6, 2022
0fdb6dc
Use conda to build python packages during GPU tests (#10648)
jjacobelli May 9, 2022
6280ef0
Fix element access const correctness in `hostdevice_vector` (#10804)
vuule May 9, 2022
c4ed468
Return per-file metadata from readers (#10782)
vuule May 9, 2022
ac7492e
Upgrade `cudf` to support `pandas` 1.4.x versions (#10584)
galipremsagar May 9, 2022
566f29a
Add `max_file_size` parameter to chunked parquet dataset writer (#10718)
galipremsagar May 9, 2022
9def28c
Generic serialization of all column types (#10784)
wence- May 10, 2022
1519108
Mention 2 cpp-reviewer requirement in pull request template (#10768)
davidwendt May 10, 2022
19c5bad
Rework `Scalar` imports (#10791)
brandon-b-miller May 10, 2022
ee8cd59
Improve coverage of dask-cudf's groupby aggregation, add tests for `d…
charlesbluca May 10, 2022
0fcd364
Enable writing to `s3` storage in chunked parquet writer (#10769)
galipremsagar May 10, 2022
4539e5e
Refactor `cudf::contains`, renaming and switching parameters role (#1…
ttnghia May 10, 2022
dc0c3cd
Use `ThreadedMotoServer` instead of `subprocess` in spinning up `s3` …
galipremsagar May 10, 2022
366206d
Import `NA` from `missing` rather than using `cudf.NA` everywhere (#1…
brandon-b-miller May 10, 2022
2aaa863
Add cudf JNI docker build github action (#10806)
pxLi May 10, 2022
efd2c39
Refactor regex builtin character-class identifiers (#10814)
davidwendt May 11, 2022
2b204d0
Fix compile warning in search.cu (#10827)
davidwendt May 11, 2022
0cc29a0
Refactor binaryop/compiled/util.cpp (#10756)
bdice May 11, 2022
325fa77
Return weak orderings from `device_row_comparator`. (#10793)
rwlee May 11, 2022
1889133
Add missing cuda-python dependency to cudf (#10833)
bdice May 11, 2022
16d9a92
Add handling for string by-columns in dask-cudf groupby (#10830)
charlesbluca May 11, 2022
3e1a345
Add tests for null scalar binaryops (#10828)
brandon-b-miller May 11, 2022
e0d94f3
Add JNI support for apply_boolean_mask (#10812)
res-life May 12, 2022
1bb3aac
Adds the JNI call for Cuda.deviceSynchronize (#10839)
abellina May 12, 2022
fe9aaeb
Cleanup regex compile optimize functions (#10825)
davidwendt May 12, 2022
b64452a
Change pattern parameter for regex APIs from std::string to std::stri…
davidwendt May 13, 2022
4ad1e51
Allow string aggs for `dask_cudf.CudfDataFrameGroupBy.aggregate` (#10…
charlesbluca May 13, 2022
1302ec0
Split up search.cu to improve compile time (#10831)
davidwendt May 13, 2022
13c21d0
Fix constness / references in weak ordering operator() signatures. (#…
bdice May 13, 2022
0802451
Revise 10 minutes notebook. (#10738)
bdice May 13, 2022
6901b12
Suppress sizeof-array-div warnings in thrust found by gcc-11 (#10840)
robertmaynard May 13, 2022
ae2062e
Remove typo in ngram documentation (#10859)
miguelusque May 14, 2022
6591a6a
fix doxygen warnings (#10842)
karthikeyann May 14, 2022
e58d049
update mangle_dupe_cols behavior in csv reader to match pandas 1.4.0 …
karthikeyann May 16, 2022
9e004c3
More error checking in `from_dlpack` (#10850)
wence- May 16, 2022
d0d7193
Fix a bug in `distinct`: using nested nulls logic (#10848)
PointKernel May 16, 2022
09b7045
Add a section to the docs that compares cuDF with Pandas (#10796)
shwina May 16, 2022
db07df1
Add more unit tests for `cudf::distinct` for nested types with sliced…
ttnghia May 16, 2022
42438de
Changing `list_view.cuh` to `list_view.hpp` (#10854)
ttnghia May 16, 2022
712e77f
Refactor host decompression in ORC reader (#10764)
vuule May 16, 2022
df76c8c
Fix construction of nested structs with EMPTY child (#10761)
shwina May 16, 2022
6352b4e
spell check fixes (#10865)
karthikeyann May 16, 2022
80e4262
Fix dask-cudf groupby handling when grouping by all columns (#10866)
charlesbluca May 17, 2022
1dbd3ca
Fix some docs build warnings (#10674)
galipremsagar May 17, 2022
369d005
Support for Zstandard decompression in Parquet reader (#10847)
vuule May 17, 2022
dee435f
Segmented Min/Max for Fixed Point Types (#10794)
isVoid May 18, 2022
7f9d51b
Two-table comparators with strong index types (#10730)
bdice May 18, 2022
851418c
Rename `sliced_child` to `get_sliced_child`. (#10885)
bdice May 18, 2022
7b393a7
Add missing documentation in scalar/ headers (#10861)
karthikeyann May 18, 2022
ac77940
Fix segmented_reduce on empty column with non-empty offsets (#10876)
davidwendt May 18, 2022
c9bc82e
Remove C style artifacts in cuIO (#10886)
vuule May 19, 2022
54789ee
Strong index types for equality comparator (#10883)
ttnghia May 19, 2022
1db83e3
Change std::string parameters in cudf::strings APIs to std::string_vi…
davidwendt May 19, 2022
a808d9c
Update Dask + Pandas to Dask + cuDF path (#10897)
miguelusque May 19, 2022
c0895c1
Make cudf::test::expect_columns_equal() to fail when comparing unsani…
nvdbaranec May 19, 2022
6acf226
Use full name of GPUDirect Storage SDK in docs (#10904)
vuule May 19, 2022
b405154
Use conda compilers
jjacobelli Feb 12, 2022
bb76e71
Revert reordering in conda recipes
jjacobelli May 20, 2022
8634789
PR review
jjacobelli May 21, 2022
f10f380
Merge pull request #10275 from Ethyling/conda-comp
jjacobelli May 23, 2022
d1a0191
Add strong index iterator docs. (#10888)
bdice May 23, 2022
5067cc7
Fix an issue with reading raw string in `cudf.read_json` (#10924)
galipremsagar May 23, 2022
cee5cc9
Use `conda` compilers in env file (#10915)
galipremsagar May 23, 2022
a9220bd
Add a library_design.md file documenting the core Python data structu…
vyasr May 23, 2022
b26aaf7
Handle nested types in cudf::concatenate_rows() (#10890)
nvdbaranec May 23, 2022
35f9d5e
Support for Zstandard decompression in ORC reader (#10873)
vuule May 23, 2022
b053618
Clarify append deprecation notice. (#10930)
bdice May 23, 2022
da74744
Fix `gcc_linux` version pinning in dev environment (#10943)
galipremsagar May 24, 2022
9f06de6
Make SerializedTableHeader(numRows) public (#10949)
gerashegalov May 24, 2022
29f0b5a
Add missing documentation in cudf/types.hpp (#10895)
karthikeyann May 24, 2022
379cc9f
Add parameters to control page size in Parquet writer (#10882)
etseidl May 24, 2022
e096345
Replace defaulted stream value for libcudf APIs that use NVCOMP (#10877)
jbrennan333 May 24, 2022
f0d43e5
Use pre-built nvcomp 2.3 binaries by default (#10851)
robertmaynard May 25, 2022
4b95c4f
Fix single column `MultiIndex` issue in `sort_index` (#10957)
galipremsagar May 25, 2022
6a64ce1
Cleanup regex compiler fixed quantifiers source (#10843)
davidwendt May 25, 2022
dbd2b08
Update `groupby::hash` to use new row operators for keys (#10770)
PointKernel May 25, 2022
df5dc08
Removed `mr` parameter from inplace bitmask operations (#10805)
AtlantaPepsi May 25, 2022
31e1739
Enable Zstandard decompression only when all nvcomp integrations are …
vuule May 25, 2022
5165319
Handle closed property in IntervalDtype.from_pandas (#10798)
wence- May 25, 2022
62c4b58
[REVIEW] Pin `dask` & `distributed` for release (#10965)
galipremsagar May 25, 2022
a57d1e1
Fix the issue of empty lists having empty offsets (#10935)
devavret May 25, 2022
b4674a1
String support for jcudf row to cudf column conversion (#10871)
hyperbolic2346 May 25, 2022
fd5724f
Remove extra instance of list_view.hpp in meta.yaml. (#10981)
vyasr May 26, 2022
c6260eb
Use mambabuild to build conda packages
jjacobelli May 20, 2022
840c9d8
Merge pull request #10911 from Ethyling/mamba
jjacobelli May 27, 2022
0a34f67
Register `cudf.core.groupby.Grouper` objects to dask `grouper_dispatc…
brandon-b-miller May 27, 2022
dcb0470
pin dask (#10987)
galipremsagar May 27, 2022
d0b4e30
pin protobuf version (#10996)
galipremsagar May 31, 2022
82c062a
Fix a row index entry error in ORC writer issue (#10989) (#11014)
vuule May 31, 2022
9742260
update changelog
raydouglass Jun 7, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
4 changes: 4 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,10 @@ Here are some guidelines to help the review process go smoothly.
If conflicts occur against the target branch they should be resolved by
merging the target branch into the branch used for making the pull request.

8. Pull requests that modify cpp source that are marked ready for review
will automatically be assigned two cudf-cpp-codeowners reviewers.
Ensure at least two approvals from cudf-cpp-codeowners before merging.

Many thanks in advance for your cooperation!

-->
53 changes: 53 additions & 0 deletions .github/workflows/jni-docker-build.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,53 @@
# Copyright (c) 2022, NVIDIA CORPORATION.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

name: JNI Docker Build

on:
workflow_dispatch: # manual trigger only

concurrency:
group: jni-docker-build-${{ github.ref }}
cancel-in-progress: true

jobs:
docker-build:
if: github.repository == 'rapidsai/cudf'
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v2

- name: Set up QEMU
uses: docker/setup-qemu-action@v2

- name: Set up Docker Buildx
uses: docker/setup-buildx-action@v2

- name: Login to DockerHub
uses: docker/login-action@v2
with:
username: ${{ secrets.GPUCIBOT_DOCKERHUB_USER }}
password: ${{ secrets.GPUCIBOT_DOCKERHUB_TOKEN }}

- name: Set ENVs
run: |
echo "IMAGE_NAME=rapidsai/cudf-jni-build" >> $GITHUB_ENV
echo "IMAGE_REF=${GITHUB_REF_NAME}" >> $GITHUB_ENV

- name: Build and Push
uses: docker/build-push-action@v3
with:
push: true
file: java/ci/Dockerfile.centos7
tags: "${{ env.IMAGE_NAME }}:${{ env.IMAGE_REF }}"
36 changes: 20 additions & 16 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -1,3 +1,5 @@
# Copyright (c) 2019-2022, NVIDIA CORPORATION.

repos:
- repo: https://github.com/PyCQA/isort
rev: 5.6.4
Expand Down Expand Up @@ -28,12 +30,10 @@ repos:
args: ["--settings-path=python/dask_cudf/setup.cfg"]
files: python/dask_cudf/.*
- repo: https://github.com/psf/black
rev: 19.10b0
rev: 22.3.0
hooks:
- id: black
files: python/.*
additional_dependencies:
- click==8.0.4
- repo: https://github.com/PyCQA/flake8
rev: 3.8.3
hooks:
Expand All @@ -58,20 +58,24 @@ repos:
hooks:
- id: pydocstyle
args: ["--config=python/.flake8"]
- repo: local
exclude: |
(?x)^(
ci|
cpp|
conda|
docs|
java|
notebooks
)
- repo: https://github.com/pre-commit/mirrors-clang-format
rev: v11.1.0
hooks:
- id: clang-format
# Using the pre-commit stage to simplify invocation of all
# other hooks simultaneously (via any other hook stage). This
# can be removed if we also move to running clang-format
# entirely through pre-commit.
stages: [commit]
name: clang-format
description: Format files with ClangFormat.
entry: clang-format -i
language: system
files: \.(cu|cuh|h|hpp|cpp|inl)$
args: ['-fallback-style=none']
types_or: [file]
args: ['-fallback-style=none', '-style=file', '-i']
- repo: local
hooks:
- id: cmake-format
name: cmake-format
entry: ./cpp/scripts/run-cmake-format.sh cmake-format
Expand All @@ -80,7 +84,7 @@ repos:
# Note that pre-commit autoupdate does not update the versions
# of dependencies, so we'll have to update this manually.
additional_dependencies:
- cmake-format==0.6.11
- cmakelang==0.6.13
- id: cmake-lint
name: cmake-lint
entry: ./cpp/scripts/run-cmake-format.sh cmake-lint
Expand All @@ -89,7 +93,7 @@ repos:
# Note that pre-commit autoupdate does not update the versions
# of dependencies, so we'll have to update this manually.
additional_dependencies:
- cmake-format==0.6.11
- cmakelang==0.6.13
- id: copyright-check
name: copyright-check
# This hook's use of Git tools appears to conflict with
Expand Down
4 changes: 4 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# cuDF 22.06.00 (Date TBD)

Please see https://github.com/rapidsai/cudf/releases/tag/v22.06.00a for the latest changes to this development branch.

# cuDF 22.04.00 (6 Apr 2022)

## 🚨 Breaking Changes
Expand Down
Loading