Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial round of updates #1

Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
385 commits
Select commit Hold shift + click to select a range
1effe19
Removing int8 column option from parquet byte_array writing (#11539)
hyperbolic2346 Oct 18, 2022
5d57159
Initial draft of policies and guidelines for libcudf usage. (#11853)
vyasr Oct 18, 2022
425fb02
Update flake8 to 5.0.4 and use flake8-force to check Cython. (#11736)
bdice Oct 18, 2022
6ca2ceb
Adds retryCount to RmmEventHandler.onAllocFailure (#11940)
abellina Oct 18, 2022
08e4ec2
Refactor pad/zfill functions for reuse with strings udf (#11914)
davidwendt Oct 19, 2022
08ffecc
Fix some gtests incorrectly coded in namespace cudf::test (part I) (#…
davidwendt Oct 19, 2022
416d4d5
Enable backend dispatching for Dask-DataFrame creation (#11920)
rjzamora Oct 20, 2022
ff41841
Remove validation that requires introspection (#11938)
vyasr Oct 20, 2022
536ddd0
Tell jitify_preprocess where to search for libnvrtc (#11787)
robertmaynard Oct 20, 2022
98185fe
Fix writing of Parquet files with many fragments (#11869)
etseidl Oct 20, 2022
ee9ffd0
Default to equal NaNs in make_collect_set_aggregation. (#11621)
bdice Oct 20, 2022
5803015
Rename libcudf++ to libcudf. (#11953)
bdice Oct 20, 2022
b9ba9e3
Update Unit Testing in libcudf guidelines to code tests outside the c…
davidwendt Oct 21, 2022
dec8bde
Add tests ensuring that cudf's default stream is always used (#11875)
vyasr Oct 21, 2022
9c06330
Accept const refs instead of const unique_ptr refs in reduce and scan…
vyasr Oct 21, 2022
7940b5b
Fix maximum page size estimate in Parquet writer (#11962)
vuule Oct 21, 2022
f1ab5e9
add V2 page header support to parquet reader (#11778)
etseidl Oct 21, 2022
5c2150e
Default to equal NaNs in make_merge_sets_aggregation. (#11952)
bdice Oct 21, 2022
5a190b9
Switch over to rapids-cmake patches for thrust (#11921)
robertmaynard Oct 24, 2022
4c0f2fd
Fix lists and structs gtests coded in namespace cudf::test (#11956)
davidwendt Oct 24, 2022
c806b10
Use gather-based strings factory in cudf::strings::strip (#11954)
davidwendt Oct 24, 2022
1e93af8
Add gpu memory watermark apis to JNI (#11950)
abellina Oct 24, 2022
11918ae
Add dtype docs pages and docstrings for `cudf` specific dtypes (#11974)
galipremsagar Oct 24, 2022
2ee41d0
Replace most of preprocessor usage in nvcomp adapter with `constexpr`…
vuule Oct 25, 2022
dc5924c
Add pool memory resource to libcudf basic example (#11966)
davidwendt Oct 25, 2022
2d89f43
Add missing noexcepts to column_in_metadata methods (#11973)
vyasr Oct 25, 2022
285cb9e
Replace default_stream_value with get_default_stream in docs. (#11985)
vyasr Oct 25, 2022
a37f27b
Ensure better compiler cache results between cudf cal-ver branches (#…
robertmaynard Oct 25, 2022
ffd130a
Remove stale labeler (#11995)
raydouglass Oct 25, 2022
6a5c77b
Minor cleanup of root CMakeLists.txt for better organization (#11988)
robertmaynard Oct 25, 2022
5bfc9a4
Move protobuf compilation to CMake (#11986)
vyasr Oct 25, 2022
6b9c026
Use rapids-cmake for google benchmark. (#11997)
vyasr Oct 25, 2022
b7d0115
Switch to DISABLE_DEPRECATION_WARNINGS to match other RAPIDS projects…
robertmaynard Oct 25, 2022
b89c0e2
Add inplace arithmetic operators to `MaskedType` (#11987)
brandon-b-miller Oct 26, 2022
c146d21
Revert "Replace most of preprocessor usage in nvcomp adapter with `co…
vuule Oct 26, 2022
fac35b4
Fix some libcudf calls to cudf::detail::gather (#11963)
davidwendt Oct 26, 2022
72572a8
Determine if Arrow has S3 support at runtime in unit test. (#11560)
bdice Oct 26, 2022
07eb723
Feature/remove default streams (#11967)
vyasr Oct 26, 2022
646a7e3
Fix doxygen text for cudf::dictionary::encode (#11991)
davidwendt Oct 26, 2022
cd21ce7
Remove unnecessary code from dask-cudf _Frame (#12001)
rjzamora Oct 27, 2022
8d49db5
Ignore python docs build artifacts (#12000)
galipremsagar Oct 27, 2022
b4ca894
Add `strip_delimiters` option to `read_text` (#11946)
upsj Oct 27, 2022
43eb7a0
Refactor multibyte_split `output_builder` (#11945)
upsj Oct 27, 2022
bac2004
Add pivot_table and crosstab to docs. (#12014)
bdice Oct 27, 2022
1b1ca7c
Provide `data_chunk_source` wrapper for `datasource` (#11886)
upsj Oct 27, 2022
f17ea94
Fix bug where `df.loc` resulting in single row could give wrong index…
eriknw Oct 27, 2022
69fac8a
Remove unused `managed_allocator` (#12005)
vyasr Oct 27, 2022
1017045
Add DataFrame.pivot_table. (#12015)
bdice Oct 28, 2022
ee53458
New GHA to add issues/prs to project board (#12016)
jarmak-nv Oct 28, 2022
c915523
Add deprecation warning for set_allocator. (#11958)
vyasr Oct 28, 2022
aaf251d
Performance improvement in JSON Tree traversal (#11919)
karthikeyann Oct 28, 2022
7620fb1
Add method argument to DataFrame.quantile (#11957)
rjzamora Oct 28, 2022
0603167
Add cython-lint to pre-commit checks. (#12020)
bdice Oct 28, 2022
1c057bc
Use pragma once (#12019)
bdice Oct 31, 2022
f0b4c4f
Pass column names to `write_csv` instead of `table_metadata` pointer …
vuule Oct 31, 2022
a5aaa52
Remove default parameters for cudf::dictionary::detail functions (#12…
davidwendt Nov 1, 2022
991c86b
Remove default parameters for nvtext::detail functions (#12007)
davidwendt Nov 1, 2022
7af461c
Update cuda-python dependency to 11.7.1 (#12030)
galipremsagar Nov 1, 2022
d236779
Reduce/Remove reliance on `**kwargs` and `*args` in `IO` readers & wr…
galipremsagar Nov 1, 2022
41fca6e
Add `read_orc_metadata` to libcudf (#11815)
vuule Nov 1, 2022
2fe06bc
Leverage rapids_cython for more automated RPATH handling (#11996)
vyasr Nov 1, 2022
80c238c
Fix black exclusions. (#12036)
bdice Nov 1, 2022
f19bdbc
Remove smart quotes from all docstrings. (#12035)
bdice Nov 1, 2022
f3bf872
Merge branch 'branch-22.10' into branch-22.12-merge-22.10
vyasr Nov 1, 2022
1c2ad6a
Fix Parquet support for seconds and milliseconds duration types (#11854)
vuule Nov 1, 2022
c04dbef
Merge pull request #12045 from vyasr/branch-22.12-merge-22.10
msadang Nov 1, 2022
ac3f205
Port thrust's pinned_allocator to cudf, since Thrust 1.17 removes the…
robertmaynard Nov 1, 2022
03034af
Standardize newlines at ends of files. (#12042)
bdice Nov 1, 2022
a20bbfb
Trim trailing whitespace from all files. (#12041)
bdice Nov 2, 2022
5ace809
Add strings udf C++ classes and functions for phase II (#11912)
davidwendt Nov 2, 2022
d6a9e4a
Rollback of `DeviceBufferLike` (#12009)
madsbk Nov 2, 2022
a3d2276
Fixes bug in csv_reader_options construction in cython (#12021)
karthikeyann Nov 2, 2022
49fc3c7
Enable CEC for `strings_udf` (#11884)
brandon-b-miller Nov 2, 2022
856ac3f
Add full page indexes to Parquet writer benchmarks (#11955)
etseidl Nov 2, 2022
d949cd2
Make all `nvcc` warnings into errors (#8916)
trxcllnt Nov 2, 2022
eaa0706
Add developer docs for writing tests (#11199)
vyasr Nov 3, 2022
e402448
Trim quotes for non-string values in nested json parsing (#11898)
karthikeyann Nov 3, 2022
baa645d
Add strings `like` jni and native method (#12032)
cindyyuanjiang Nov 3, 2022
b156c25
Add `memory_usage` & `items` implementation for `Struct` column & dty…
galipremsagar Nov 3, 2022
765279d
[HOTFIX] Update cuda-python dependency to 11.7.1 (#11994)
shwina Nov 3, 2022
ad1ba13
Force using old fmt in nvbench. (#12064)
vyasr Nov 3, 2022
2a58ff6
Force using old fmt in nvbench. (#12067)
vyasr Nov 4, 2022
1d6931a
Allow falling back to `shim_60.ptx` by default in `strings_udf` (#12056)
brandon-b-miller Nov 4, 2022
0278485
Remove default parameters for cudf::strings::detail functions (#12003)
davidwendt Nov 4, 2022
b1c2520
Remove overflow error during decimal binops (#12063)
galipremsagar Nov 4, 2022
e788f36
Fixes List offset bug in Nested JSON reader (#12060)
karthikeyann Nov 4, 2022
a3e9c1c
Mark nvcomp zstd compression stable (#12059)
jbrennan333 Nov 4, 2022
6e13139
Add debug-only onAllocated/onDeallocated to RmmEventHandler (#12054)
abellina Nov 4, 2022
9df2eba
Adding feature Truncate to DataFrame and Series (#11435)
VamsiTallam95 Nov 4, 2022
11b875b
Fix type casting in Series.__setitem__ (#11904)
wence- Nov 4, 2022
52dbb63
Fix link to c++ developer guide from `CONTRIBUTING.md` (#12084)
brandon-b-miller Nov 7, 2022
262631b
Fix ingest_raw_data performance issue in Nested JSON reader due to RV…
karthikeyann Nov 7, 2022
17b6b2e
Add checks for HLG layers in dask-cudf groupby tests (#10853)
charlesbluca Nov 7, 2022
f9a2512
Fix quantile gtests coded in namespace cudf::test (#12049)
davidwendt Nov 7, 2022
a72627a
Throw an error when libcudf is built without cuFile and `LIBCUDF_CUFI…
vuule Nov 7, 2022
ec46e7f
Move and update `dask` nigthly install in CI (#12082)
galipremsagar Nov 7, 2022
2ced214
Use nosync policy in gather and scatter implementations. (#12038)
bdice Nov 7, 2022
b16b4ff
Remove macros that inspect the contents of exceptions (#12076)
vyasr Nov 8, 2022
35077f5
Enable returning string data from UDFs used through `apply` (#11933)
brandon-b-miller Nov 8, 2022
c900fed
Bifurcate Dependency Lists [skip-gpuci] (#11674)
bdice Nov 8, 2022
8ee5f51
Enable building against the libarrow contained in pyarrow (#12034)
vyasr Nov 8, 2022
7535f31
Remove CUDA 10 compatibility code. (#12088)
bdice Nov 8, 2022
628cd4f
Change cudf::detail::tdigest to cudf::tdigest::detail (#12050)
davidwendt Nov 9, 2022
74053f4
Add regex_program class for use with all regex APIs (#11927)
davidwendt Nov 9, 2022
a2c428c
Fix an error in IO with `GzipFile` type (#12085)
galipremsagar Nov 9, 2022
26d449c
Update Numba docs links. (#12107)
bdice Nov 9, 2022
fbac4b4
Add `truncate` API to python doc pages (#12109)
galipremsagar Nov 9, 2022
6f78e74
Expose engine argument in dask_cudf.read_json (#12101)
rjzamora Nov 9, 2022
4de279d
Fix reading of CSV files with blank second row (#12098)
vuule Nov 9, 2022
59bd5c3
Support `strip`, `lstrip`, and `rstrip` in `strings_udf` (#12091)
brandon-b-miller Nov 10, 2022
4497ed6
Workaround groupby aggregate thrust::copy_if overflow (#12079)
davidwendt Nov 10, 2022
8ca2bd9
First pass of `pd.read_orc` changes in tests (#12103)
galipremsagar Nov 10, 2022
b3429fb
Remove "Multi-GPU with Dask-cuDF" notebook. (#12095)
bdice Nov 10, 2022
480b4cc
DOC
raydouglass Nov 10, 2022
b30664b
Fix conditional_full_join benchmark (#12121)
Nov 10, 2022
7f2a471
Fix regex working-memory-size refactor error (#12119)
davidwendt Nov 10, 2022
70c7b7a
Refactor Parquet reader (#12046)
ttnghia Nov 10, 2022
f87d2b4
Add symlinks to notebooks. (#12128)
bdice Nov 11, 2022
3894427
Add JNI for `substring` without 'end' parameter. (#12113)
firestarman Nov 11, 2022
d335aa3
Fix alignment of compressed blocks in ORC writer (#12077)
vuule Nov 11, 2022
8668752
Adds an EventHandler to Java MemoryBuffer to be invoked on close (#12…
abellina Nov 11, 2022
825f049
Fix singleton-range `__setitem__` edge case (#12075)
wence- Nov 14, 2022
5081fb1
Enable automatic column projection in groupby().agg (#12124)
rjzamora Nov 14, 2022
b20a6e6
Add support for `DataFrame.from_dict`\`to_dict` and `Series.to_dict` …
galipremsagar Nov 14, 2022
b2e5069
Create an `int8` column in `read_csv` when all elements are missing (…
vuule Nov 15, 2022
1efde6a
Merge pull request #12146 from rapidsai/branch-22.12
GPUtester Nov 15, 2022
fd488cd
Cleanup common parsing code in JSON, CSV reader (#12022)
karthikeyann Nov 15, 2022
b1f38f8
Merge pull request #12147 from rapidsai/branch-22.12
GPUtester Nov 15, 2022
bae9e39
Fix/disable jitify lto (#12122)
robertmaynard Nov 15, 2022
c0a1de3
Merge pull request #12151 from rapidsai/branch-22.12
GPUtester Nov 15, 2022
186e129
Add in negative size checks for columns (#12118)
revans2 Nov 15, 2022
4b7f5a7
Safely allocate `udf_string` pointers in `strings_udf` (#12138)
brandon-b-miller Nov 15, 2022
98880d2
Update cp.clip call (#12148)
quasiben Nov 15, 2022
215b809
Merge pull request #12152 from rapidsai/branch-22.12
GPUtester Nov 15, 2022
90f0a77
Accelerate libcudf segmented sort with CUB segmented sort (#11969)
davidwendt Nov 15, 2022
3038b37
Merge pull request #12153 from rapidsai/branch-22.12
GPUtester Nov 15, 2022
414140b
check number of rows on empty data
vuule Nov 16, 2022
c574ddf
Fix decimal binary operations (#12142)
galipremsagar Nov 16, 2022
b51af32
Merge pull request #12161 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
a8c0f4b
Fix type promotion edge cases in numerical binops (#12074)
wence- Nov 16, 2022
520c0e0
Merge pull request #12163 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
742093e
Support `+` in `strings_udf` (#12117)
brandon-b-miller Nov 16, 2022
6ad5752
Use rapidsai CODE_OF_CONDUCT.md (#12166)
bdice Nov 16, 2022
e27c102
Merge pull request #12169 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
defad5e
byte_range support for JSON Lines format (#12017)
karthikeyann Nov 16, 2022
5741306
Merge pull request #12171 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
afb3c97
Support nested types as groupby keys in libcudf (#11792)
PointKernel Nov 16, 2022
c19eb5c
Merge pull request #12176 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
95a348b
Spilling to host memory (#12106)
madsbk Nov 16, 2022
5d0d8ac
Merge pull request #12177 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
73d73a7
Refactor `purge_nonempty_nulls` (#12111)
ttnghia Nov 16, 2022
c993fd8
Merge pull request #12178 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
ae101cc
Don't rely on GNU find in headers_test.sh (#12164)
wence- Nov 16, 2022
9291263
Merge pull request #12179 from rapidsai/branch-22.12
GPUtester Nov 16, 2022
ce97a54
Merge branch 'branch-22.12' of https://github.com/rapidsai/cudf into …
vuule Nov 17, 2022
6de2c4e
Fix issues when both `usecols` and `names` options are used in `read_…
vuule Nov 17, 2022
bee8bfd
Merge pull request #12183 from rapidsai/branch-22.12
GPUtester Nov 17, 2022
aa13b95
Support `upper` and `lower` in `strings_udf` (#12099)
brandon-b-miller Nov 17, 2022
d3f296d
Merge pull request #12184 from rapidsai/branch-22.12
GPUtester Nov 17, 2022
2f2685f
Allow setting malloc heap size in string udfs (#12094)
brandon-b-miller Nov 17, 2022
e45198e
Merge pull request #12189 from rapidsai/branch-22.12
GPUtester Nov 17, 2022
db0d045
Ensure dlpack include is provided to cudf interop lib (#12139)
robertmaynard Nov 17, 2022
b912dbf
Merge pull request #12192 from rapidsai/branch-22.12
GPUtester Nov 17, 2022
ec8888c
fix selection of original vs compressed blocks, padding
vuule Nov 18, 2022
e29ea84
style
vuule Nov 18, 2022
3fb09d1
Implement chunked Parquet reader (#11867)
ttnghia Nov 18, 2022
6d2a4f0
Add wheel builds (#12096)
vyasr Nov 18, 2022
12f7fe1
Merge pull request #12195 from rapidsai/branch-22.12
GPUtester Nov 18, 2022
cc4b4dd
Don't use CMake 3.25.0 as it has a show stopping FindCUDAToolkit bug …
robertmaynard Nov 18, 2022
075976a
Merge pull request #12196 from rapidsai/branch-22.12
GPUtester Nov 18, 2022
30bc05c
Merge branch 'branch-22.12' of https://github.com/rapidsai/cudf into …
vuule Nov 18, 2022
cbd07a5
Merge branch-22.10 into branch-22.12
davidwendt Nov 18, 2022
2af481c
Update JNI version to 23.02.0-SNAPSHOT (#12129)
pxLi Nov 18, 2022
85a82ac
Clean up existing JNI scalar to column code (#12173)
revans2 Nov 18, 2022
3c94071
Merge pull request #12198 from davidwendt/branch-22.12-merge-22.10
ajschmidt8 Nov 18, 2022
22087b3
Merge pull request #12199 from rapidsai/branch-22.12
GPUtester Nov 18, 2022
a2f69e4
Reduce number of tests marked `spilling` (#12197)
madsbk Nov 18, 2022
c37d33c
Merge pull request #12202 from rapidsai/branch-22.12
GPUtester Nov 18, 2022
782fba3
Implement JNI for chunked Parquet reader (#11961)
ttnghia Nov 18, 2022
124a8d5
Merge pull request #12204 from rapidsai/branch-22.12
GPUtester Nov 18, 2022
c79c2d1
Merge branch 'branch-22.12' of https://github.com/rapidsai/cudf into …
vuule Nov 18, 2022
08c0c5a
comment
vuule Nov 18, 2022
9292b50
Merge branch 'branch-22.12' of https://github.com/rapidsai/cudf into …
vuule Nov 18, 2022
21ba312
Fix dask backend dispatch (#12203)
galipremsagar Nov 18, 2022
0bab474
Merge pull request #12209 from rapidsai/branch-22.12
GPUtester Nov 18, 2022
42a6ad4
Remove Python dependencies from Java CI. (#12193)
bdice Nov 18, 2022
a8afc75
fix is_data_empty
vuule Nov 19, 2022
dd39a44
Adds in new java APIs for appending byte arrays to host columnar data…
revans2 Nov 21, 2022
ad878ac
merge latest, resolve conflicts, pass tests
brandon-b-miller Nov 21, 2022
d876ad7
pass style, cleanup
brandon-b-miller Nov 21, 2022
769dfbb
Merge pull request #12194 from vuule/bug-write_orc-compressission
jolorunyomi Nov 21, 2022
a33e7fd
Merge pull request #12215 from rapidsai/branch-22.12
GPUtester Nov 21, 2022
f600196
start to move files
brandon-b-miller Nov 21, 2022
e670c10
remove assert; separate empty stripe and level
vuule Nov 21, 2022
cd6dff3
Workaround for CUB segmented-sort bug with boolean keys
davidwendt Nov 21, 2022
b743f2f
Add `Series.hasnans` and `Index.hasnans` (#12214)
galipremsagar Nov 22, 2022
6756b02
Merge branch 'branch-22.12' of https://github.com/rapidsai/cudf into …
vuule Nov 22, 2022
d49e412
Add codespell as a linter (#12097)
benfred Nov 22, 2022
f15080f
test
vuule Nov 22, 2022
6cbdaf8
starting to refactor
brandon-b-miller Nov 22, 2022
49f983d
Merge pull request #12217 from davidwendt/bug-cub-segmented-sort
jolorunyomi Nov 22, 2022
331e36e
Merge pull request #12225 from rapidsai/branch-22.12
GPUtester Nov 22, 2022
87553b9
Simplify repetitive boolean logic (#12218)
vuule Nov 22, 2022
ed35f67
Merge pull request #12160 from vuule/bug-read_orc-empty-map-column
jolorunyomi Nov 22, 2022
f9b6984
Merge pull request #12226 from rapidsai/branch-22.12
GPUtester Nov 22, 2022
02ead0a
Move strings children functions from cudf/strings/detail/utilities.cu…
davidwendt Nov 23, 2022
0c60819
Make dask pinning looser (#12231)
vyasr Nov 23, 2022
104309d
Merge pull request #12233 from rapidsai/branch-22.12
GPUtester Nov 23, 2022
d970c5a
Add cudf::strings:udf::replace function (#12210)
davidwendt Nov 23, 2022
50c9577
Clean up of `test_spilling.py` (#12220)
madsbk Nov 23, 2022
c0feb02
Wrapping concat and file writes in `@acquire_spill_lock()` (#12232)
madsbk Nov 23, 2022
7426a06
Add support to read binary encoded decimals in parquet (#12205)
PointKernel Nov 23, 2022
7ec0dc0
Fix memory leak in udf_string::assign(&&) function (#12206)
davidwendt Nov 28, 2022
82b646e
Workaround thrust-copy-if limit in wordpiece-tokenizer (#12168)
davidwendt Nov 28, 2022
f063d9e
Support regex EOL where the string ends with a new-line character (#1…
davidwendt Nov 28, 2022
541ba3c
Spill Statistics (#12223)
madsbk Nov 28, 2022
3a11fe1
continue to refactor typing
brandon-b-miller Nov 28, 2022
8161548
move lowering to its own file
brandon-b-miller Nov 28, 2022
52656ab
continue refactoring idxmin and idxmax functions
brandon-b-miller Nov 28, 2022
b9096f3
add tests for idxmin and idxmax, not currently passing
brandon-b-miller Nov 28, 2022
d21a099
normalize call_cuda_functions keys
brandon-b-miller Nov 28, 2022
d02601a
Update 10 minutes to cuDF (#12114)
wence- Nov 28, 2022
62aad1e
continued refactoring
brandon-b-miller Nov 28, 2022
9ff058a
refactoring lowering
brandon-b-miller Nov 28, 2022
3bf6d1d
Use CUDF_JNI_ENABLE_PROFILING to conditionally enable profiling suppo…
bdice Nov 28, 2022
c83ff55
Fix include line for io/numpy.
vyasr Nov 28, 2022
2a5411a
Patch CUB DeviceSegmentedSort and remove workaround (#12234)
davidwendt Nov 29, 2022
9af3ed8
Replace column/table test utilities with macros (#12242)
PointKernel Nov 29, 2022
715aa9a
Fix sort gtests coded in namespace cudf::test (#12237)
davidwendt Nov 29, 2022
eb27104
Merge pull request #12250 from vyasr/fix/io_numpy_link
AyodeAwe Nov 29, 2022
c8f7982
Merge pull request #12253 from rapidsai/branch-22.12
GPUtester Nov 29, 2022
5f07ca2
continued refactoring
brandon-b-miller Nov 29, 2022
e67b94b
Fix spill to device limit (#12252)
madsbk Nov 29, 2022
f4bb574
Fix page size calculation in Parquet writer (#12182)
etseidl Nov 30, 2022
5f83a84
Support `replace` in `strings_udf` (#12207)
brandon-b-miller Nov 30, 2022
e321bf1
Only exclude join keys that are indices from key columns (#12271)
wence- Nov 30, 2022
cb8d9e1
Move `_label_encoding` from Series to Column (#12040)
shwina Nov 30, 2022
6af6563
Remove default parameters from cudf::detail::sort function declaratio…
davidwendt Nov 30, 2022
a442dd8
Rename `cudf::structs::detail::superimpose_parent_nulls` APIs (#12230)
ttnghia Nov 30, 2022
ff3b643
Cover parsing to decimal types in `read_json` tests (#12229)
vuule Dec 1, 2022
8e0ee53
Rework cudf::strings::pad and zfill to use make_strings_children (#12…
davidwendt Dec 1, 2022
e25eecb
Fix compile issue in `json_chunked_reader.cpp` (#12280)
ttnghia Dec 1, 2022
be26260
Add cudf::strings::reverse function (#12227)
davidwendt Dec 1, 2022
a731de9
Add environment variable to control host memory allocation in `hostde…
vuule Dec 2, 2022
1cb87f3
Remove deprecated code for 23.02 (#12281)
vyasr Dec 2, 2022
0563463
Consolidate dask-cudf `groupby_agg` calls in one place (#10835)
charlesbluca Dec 2, 2022
2bba970
Add GitHub Actions Workflows (#12002)
ajschmidt8 Dec 2, 2022
7026638
Enable specifying exceptions in error macros (#12078)
vyasr Dec 2, 2022
2a9aaa9
Fix reduction gtests coded in namespace cudf::test (#12257)
davidwendt Dec 2, 2022
38c3560
Merge branch 'branch-23.02' into groupby-apply-updates
brandon-b-miller Dec 5, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .github/CODEOWNERS
Validating CODEOWNERS rules …
Original file line number Diff line number Diff line change
Expand Up @@ -18,3 +18,4 @@ java/ @rapidsai/cudf-java-codeowners
.github/ @rapidsai/ops-codeowners
/ci/ @rapidsai/ops-codeowners
conda/ @rapidsai/ops-codeowners
dependencies.yaml @rapidsai/ops-codeowners
6 changes: 3 additions & 3 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,18 +3,18 @@
cuDF (Python):
- 'python/**'
- 'notebooks/**'

libcudf:
- 'cpp/**'

CMake:
- '**/CMakeLists.txt'
- '**/cmake/**'

cuDF (Java):
- 'java/**'

gpuCI:
ci:
- 'ci/**'

conda:
Expand Down
20 changes: 20 additions & 0 deletions .github/workflows/add_to_project.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
name: Add new issue/PR to project

on:
issues:
types:
- opened

pull_request_target:
types:
- opened

jobs:
add-to-project:
name: Add issue or PR to project
runs-on: ubuntu-latest
steps:
- uses: actions/[email protected]
with:
project-url: https://github.com/orgs/rapidsai/projects/51
github-token: ${{ secrets.ADD_TO_PROJECT_GITHUB_TOKEN }}
57 changes: 57 additions & 0 deletions .github/workflows/build.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
name: build

on:
push:
branches:
- "branch-*"
tags:
- v[0-9][0-9].[0-9][0-9].[0-9][0-9]
workflow_call:
inputs:
branch:
required: true
type: string
date:
required: true
type: string
sha:
required: true
type: string
build_type:
type: string
default: nightly

concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: true

jobs:
cpp-build:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-matrix-build.yaml@main
with:
build_type: ${{ inputs.build_type || 'branch' }}
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
python-build:
needs: [cpp-build]
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-matrix-build.yaml@main
with:
build_type: ${{ inputs.build_type || 'branch' }}
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
upload-conda:
needs: [cpp-build, python-build]
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-upload-packages.yaml@main
with:
build_type: ${{ inputs.build_type || 'branch' }}
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
71 changes: 71 additions & 0 deletions .github/workflows/pr.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,71 @@
name: pr

on:
push:
branches:
- "pull-request/[0-9]+"

concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: true

jobs:
pr-builder:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/pr-builder.yaml@main
checks:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/checks.yaml@main
conda-cpp-build:
needs: checks
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-matrix-build.yaml@main
with:
build_type: pull-request
conda-cpp-tests:
needs: conda-cpp-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-tests.yaml@main
with:
build_type: pull-request
conda-python-build:
needs: conda-cpp-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-matrix-build.yaml@main
with:
build_type: pull-request
conda-python-cudf-tests:
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@main
with:
build_type: pull-request
test_script: "ci/test_python_cudf.sh"
conda-python-other-tests:
# Tests for dask_cudf, custreamz, cudf_kafka are separated for CI parallelism
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@main
with:
build_type: pull-request
test_script: "ci/test_python_other.sh"
conda-java-tests:
needs: conda-cpp-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@main
with:
build_type: pull-request
node_type: "gpu-v100-495-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/test_java.sh"
conda-notebook-tests:
needs: conda-python-build
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@main
with:
build_type: pull-request
node_type: "gpu-v100-495-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/test_notebooks.sh"
57 changes: 0 additions & 57 deletions .github/workflows/stale.yaml

This file was deleted.

72 changes: 72 additions & 0 deletions .github/workflows/test.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,72 @@
name: test

on:
workflow_call:
inputs:
branch:
required: true
type: string
date:
required: true
type: string
sha:
required: true
type: string

jobs:
conda-cpp-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-cpp-tests.yaml@main
with:
build_type: nightly
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
conda-python-cudf-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@main
with:
build_type: nightly
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
test_script: "ci/test_python_cudf.sh"
conda-python-other-tests:
# Tests for dask_cudf, custreamz, cudf_kafka are separated for CI parallelism
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/conda-python-tests.yaml@main
with:
build_type: nightly
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
test_script: "ci/test_python_other.sh"
conda-java-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@main
with:
build_type: nightly
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
node_type: "gpu-v100-495-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/test_java.sh"
conda-notebook-tests:
secrets: inherit
uses: rapidsai/shared-action-workflows/.github/workflows/custom-job.yaml@main
with:
build_type: nightly
repo: rapidsai/cudf
branch: ${{ inputs.branch }}
date: ${{ inputs.date }}
sha: ${{ inputs.sha }}
node_type: "gpu-v100-495-1"
arch: "amd64"
container_image: "rapidsai/ci:latest"
run_script: "ci/test_notebooks.sh"
77 changes: 77 additions & 0 deletions .github/workflows/wheels.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
name: cuDF wheels

on:
workflow_call:
inputs:
versioneer-override:
type: string
default: ''
build-tag:
type: string
default: ''
branch:
required: true
type: string
date:
required: true
type: string
sha:
required: true
type: string
build-type:
type: string
default: nightly

concurrency:
group: "cudf-${{ github.workflow }}-${{ github.ref }}"
cancel-in-progress: true

jobs:
cudf-wheels:
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-manylinux.yml@main
with:
repo: rapidsai/cudf

build-type: ${{ inputs.build-type }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}

package-dir: python/cudf
package-name: cudf

python-package-versioneer-override: ${{ inputs.versioneer-override }}
python-package-build-tag: ${{ inputs.build-tag }}

skbuild-configure-options: "-DCUDF_BUILD_WHEELS=ON -DDETECT_CONDA_ENV=OFF"

test-extras: test

# Have to manually specify the cupy install location on arm.
# Have to also manually install tokenizers==0.10.2, which is the last tokenizers
# to have a binary aarch64 wheel available on PyPI
# Otherwise, the tokenizers sdist is used, which needs a Rust compiler
test-before-arm64: "pip install tokenizers==0.10.2 cupy-cuda11x -f https://pip.cupy.dev/aarch64"

test-unittest: "pytest -v -n 8 ./python/cudf/cudf/tests"
secrets: inherit
dask_cudf-wheel:
needs: cudf-wheels
uses: rapidsai/shared-action-workflows/.github/workflows/wheels-pure.yml@main
with:
repo: rapidsai/cudf

build-type: ${{ inputs.build-type }}
branch: ${{ inputs.branch }}
sha: ${{ inputs.sha }}
date: ${{ inputs.date }}

package-dir: python/dask_cudf
package-name: dask_cudf

python-package-versioneer-override: ${{ inputs.versioneer-override }}
python-package-build-tag: ${{ inputs.build-tag }}

test-extras: test
test-unittest: "pytest -v -n 8 ./python/dask_cudf/dask_cudf/tests"
secrets: inherit
Loading