Change log

Generated on 2022-02-14

Release 22.02

Features


#4305	[FEA] write nvidia tool wrappers to allow old YARN versions to work with MIG
#4360	[FEA] Add explain api for Spark 2.X
#3541	[FEA] Support max on single-level struct in aggregation context
#4238	[FEA] Add a Spark 3.X Explain only mode to the plugin
#3952	[Audit] [FEA][SPARK-32986][SQL] Add bucketed scan info in query plan of data source v1
#4412	[FEA] Improve support for \A, \Z, and \z in regular expressions
#3979	[FEA] Improvements for CPU(Row) based UDF
#4467	[FEA] Add support for regular expression with repeated digits (`\d+`, `\d*`, `\d?`)
#4439	[FEA] Enable GPU broadcast exchange reuse for DPP when AQE enabled
#3512	[FEA] Support org.apache.spark.sql.catalyst.expressions.Sequence
#3475	[FEA] Spark 3.2.0 reads Parquet unsigned int64(UINT64) as Decimal(20,0) but CUDF does not support it
#4091	[FEA] regexp_replace: Improve support for ^ and $
#4104	[FEA] Support org.apache.spark.sql.catalyst.expressions.ReplicateRows
#4027	[FEA] Support SubqueryBroadcast on GPU to enable exchange reuse during DPP
#4284	[FEA] Support idx = 0 in GpuRegExpExtract
#4002	[FEA] Implement regexp_extract on GPU
#3221	[FEA] Support GpuFirst and GpuLast on nested types under reduction aggregations
#3944	[FEA] Full support for sum with overflow on Decimal 128
#4028	[FEA] support GpuCast from non-nested ArrayType to StringType
#3250	[FEA] Make CreateMap duplicate key handling compatible with Spark and enable CreateMap by default
#4170	[FEA] Make regular expression behavior with `$` and `\r` consistent with CPU
#4001	[FEA] Add regexp support to regexp_replace
#3962	[FEA] Support null characters in regular expressions in RLIKE
#3797	[FEA] Make RLike support consistent with Apache Spark

Performance


#4392	[FEA] could the parquet scan code avoid acquiring the semaphore for an empty batch?
#679	[FEA] move some deserialization code out of the scope of the gpu-semaphore to increase cpu concurrent
#4350	[FEA] Optimize the all-true and all-false cases in GPU `If` and `CaseWhen`
#4309	[FEA] Leverage cudf conditional nested loop join to implement semi/anti hash join with condition
#4395	[FEA] acquire the semaphore after concatToHost in GpuShuffleCoalesceIterator
#4134	[FEA] Allow `EliminateJoinToEmptyRelation` in `GpuBroadcastExchangeExec`
#4189	[FEA] understand why between is so expensive

Bugs Fixed


#4675	[BUG] Jenkins integration build timed out at 10 hours
#4665	[BUG] Spark321Shims.getParquetFilters failed with NoSuchMethodError
#4635	[BUG] nvidia-smi wrapper script ignores ENABLE_NON_MIG_GPUS=1 on a heterogeneous multi-GPU machine
#4500	[BUG] Build failures against Spark 3.2.1 rc1 and make 3.2.1 non snapshot
#4631	[BUG] Release build with mvn option `-P source-javadoc` FAILED
#4625	[BUG] NDS query 5 fails with AdaptiveSparkPlanExec assertion
#4632	[BUG] Build failing for Spark 3.3.0 due to deprecated method warnings
#4599	[BUG] test_group_apply_udf and test_group_apply_udf_more_types hangs on Databricks 9.1
#4600	[BUG] crash if we have a decimal128 in a struct in an array
#4581	[BUG] Build error "GpuOverrides.scala:924: wrong number of arguments" on DB9.1.x spark-3.1.2
#4593	[BUG] dup GpuHashJoin.diff case-folding issue
#4503	[BUG] regexp_replace with back references produces incorrect results on GPU
#4567	[BUG] Profile tool hangs in compare mode
#4315	[BUG] test_hash_reduction_decimal_overflow_sum[30] failed OOM in integration tests
#4551	[BUG] protobuf-java version changed to 3.x
#4499	[BUG]GpuSequence blows up when nulls exist in any of the inputs (start, stop, step)
#4454	[BUG] Shade warnings when building the tools artifact
#4541	[BUG] Column vector leak in conditionals_test.py
#4514	[BUG] test_hash_reduction_pivot_without_nans failed
#4521	[BUG] Inconsistencies in handling of newline characters and string and line anchors
#4548	[BUG] ai.rapids.cudf.CudaException: an illegal instruction was encountered in databricks 9.1
#4475	[BUG] `\D` and `\W` match newline in Spark but not in cuDF
#4524	[BUG] RegExp transpiler fails to detect some choice expressions that cuDF cannot compile
#3226	[BUG]OOM happened when do cube operations
#2504	[BUG] OOM when running NDS queries with UCX and GDS
#4273	[BUG] Rounding past the size that can be stored in a type produces incorrect results
#4060	[BUG] test_hash_groupby_approx_percentile_long_repeated_keys failed intermittently
#4039	[BUG] Spark 3.3.0 IT Array test failures
#3849	[BUG] In ANSI mode we can fail in cases Spark would not due to conditionals
#4421	[BUG] the driver is trying to load CUDA with latest 22.02
#4455	[BUG] join_test.py::test_struct_self_join[IGNORE_ORDER({'local': True})] failed in spark330
#4442	[BUG] mvn build FAILED with option `-P noSnapshotsWithDatabricks`
#4281	[BUG] q9 regression between 21.10 and 21.12
#4280	[BUG] q88 regression between 21.10 and 21.12
#4422	[BUG] Host column vectors are being leaked during tests
#4446	[BUG] GpuCast crashes when casting from Array with unsupportable child type
#4432	[BUG] nightly build 3.3.0 failed: HashClusteredDistribution is not a member of org.apache.spark.sql.catalyst.plans.physical
#4443	[BUG] SPARK-37705 breaks parquet filters from Spark 3.3.0 and Spark 3.2.2 onwards
#4316	[BUG] Exception: Unable to find py4j, your SPARK_HOME may not be configured correctly intermittently
#4378	[BUG] udf_test udf_cudf_test failed require_minimum_pandas_version check in spark 320+
#4423	[BUG] Build is failing due to FileScanRDD changes in Spark 3.3.0-SNAPSHOT
#4401	[BUG]array_test.py::test_array_contains failures
#4403	[BUG] NDS query 72 logs codegen fallback exception and produces incorrect results
#4386	[BUG] conditionals_test.py FAILED with side_effects_cast[Integer/Long] on Databricks 9.1 Runtime
#3934	[BUG] Dependencies of published integration tests jar are missing
#4341	[BUG] GpuCast.scala:nnn warning: discarding unmoored doc comment
#4356	[BUG] nightly spark303 deploy pulling spark301 aggregator
#4347	[BUG] Dist jar pom lists aggregator jar as dependency
#4176	[BUG] ParseDateTimeSuite UT failed
#4292	[BUG] no meaningful message is surfaced to maven when binary-dedupe fails
#4351	[BUG] Tests FAILED On SPARK-3.2.0, com.nvidia.spark.rapids.SerializedTableColumn cannot be cast to com.nvidia.spark.rapids.GpuColumnVector
#4346	[BUG] q73 decimal was twice as slow in weekly results
#4334	[BUG] GpuColumnarToRowExec will always be tagged False for exportColumnarRdd after Spark311
#4339	The parameter `dataType` is not necessary in `resolveColumnVector` method.
#4275	[BUG] Row-based Hive UDF will fail if arguments contain a foldable expression.
#4229	[BUG] regexp_replace `[^a]` has different behavior between CPU and GPU for multiline strings
#4294	[BUG] parquet_write_test.py::test_ts_write_fails_datetime_exception failed in spark 3.1.1 and 3.1.2
#4205	[BUG] Get different results when casting from timestamp to string
#4277	[BUG] cudf_udf nightly cudf import rmm failed
#4246	[BUG] Regression in CastOpSuite due to cuDF change in parsing NaN
#4243	[BUG] test_regexp_replace_null_pattern_fallback[ALLOW_NON_GPU(ProjectExec,RegExpReplace)] failed in databricks
#4244	[BUG] Cast from string to float using hand-picked values failed
#4227	[BUG] RAPIDS Shuffle Manager doesn't fallback given encryption settings
#3374	[BUG] minor deprecation warnings in a 3.2 shim build
#3613	[BUG] release312db profile pulls in 311until320-apache
#4213	[BUG] unused method with a misleading outdated comment in ShimLoader
#3609	[BUG] GpuShuffleExchangeExec in v2 shims has inconsistent packaging
#4127	[BUG] CUDF 22.02 nightly test failure

PRs


#4771	revert cudf api links from legacy to stable[skip ci]
#4767	Update 22.02 changelog to latest [skip ci]
#4750	Updated doc for decimal support
#4757	Update qualification tool to remove DECIMAL 128 as potential problem
#4755	Fix databricks doc for limitations.[skip ci]
#4751	Fix broken hyperlinks in documentation [skip ci]
#4706	Update 22.02 changelog to latest [skip ci]
#4700	Update cudfjni version to released 22.02.0
#4701	Decrease nighlty tests upper limitation to 7 [skip ci]
#4639	Update changelog for 22.02 and archive info of some older releases [skip ci]
#4572	Add download page for 22.02 [skip ci]
#4672	Revert "Disable 311cdh build due to missing dependency (#4659)"
#4662	Update the deploy script [skip ci]
#4657	Upmerge spark2 directory to the latest 22.02 changes
#4659	Disable 311cdh build by default because of a missing dependency
#4508	Fix Spark 3.2.1 build failures and make it non-snapshot
#4652	Remove non-deterministic test order in nightly [skip ci]
#4643	Add profile release301 when mvn help:evaluate
#4630	Fix the incomplete capture of SubqueryBroadcast
#4633	Suppress newTaskTempFile method warnings for Spark 3.3.0 build
#4618	[DB31x] Pick the correct Python runner for flatmap-group Pandas UDF
#4622	Fallback to CPU when encoding is not supported for JSON reader
#4470	Add in HashPartitioning support for decimal 128
#4535	Revert "Disable orc write by default because of https://issues.apache.org/jira/browse/ORC-1075 (#4471)"
#4583	Avoid unapply on PromotePrecision
#4573	Correct version from 21.12 to 22.02[skip ci]
#4575	Correct and update links in UDF doc[skip ci]
#4501	Switch and/or to use new cudf binops to improve performance
#4594	Resolve case-folding issue [skip ci]
#4585	Spark2 module upmerge, deploy script, and updates for Jenkins
#4589	Increase premerge databricks IDLE_TIMEOUT to 4 hours [skip ci]
#4485	Add json reader support
#4556	regexp_replace with back-references should fall back to CPU
#4569	Fix infinite loop with Profiling tool compare mode and app with no sql ids
#4529	Add support for Spark 2.x Explain Api
#4577	Revert "Fix CVE-2021-22569 (#4545)"
#4520	GpuSequence refactor
#4570	A few quick fixes to try to reduce max memory usage in the tests
#4477	Use libcudf mixed joins for conditional hash joins
#4566	remove scala-library from combined tools jar
#4552	Fix resource leak in GpuCaseWhen
#4553	Reenable test_hash_reduction_pivot_without_nans
#4530	Fix correctness issues in regexp and add `\r` and `\n` to fuzz tests
#4549	Fix typos in integration tests README [skip ci]
#4545	Fix CVE-2021-22569
#4543	Enable auto-merge from branch-22.02 to branch-22.04 [skip ci]
#4540	Remove user kuhushukla
#4434	Support max on single-level struct in aggregation context
#4534	Temporarily disable integration test - test_hash_reduction_pivot_without_nans
#4322	Add an explain only mode to the plugin
#4497	Make better use of pinned memory pool
#4512	remove hadoop version requirement[skip ci]
#4527	Fall back to CPU for regular expressions containing \D or \W
#4525	Properly close data writer in GpuFileFormatWriter
#4502	Removed the redundant test for element_at and fixed the failing one
#4523	Add more integration tests for decimal 128
#3762	Call the right method to convert table from row major <=> col major
#4482	Simplified the construction of zero scalar in GpuUnaryMinus
#4510	Update copyright in NOTICE [skip ci]
#4484	Update GpuFileFormatWriter to stay in sync with recent Spark changes, but still not support writing Hive bucketed table on GPU.
#4492	Fall back to CPU for regular expressions containing hex digits
#4495	Enable approx_percentile by default
#4420	Fix up incorrect results of rounding past the max digits of data type
#4483	Update test case of reading nested unsigned parquet file
#4490	Remove warning about RMM default allocator
#4461	[Audit] Add bucketed scan info in query plan of data source v1
#4489	Add arrays of decimal128 to join tests
#4476	Don't acquire the semaphore for empty input while scanning
#4424	Improve support for regular expression string anchors `\A`, `\Z`, and `\z`
#4491	Skip the test for spark versions 3.1.1, 3.1.2 and 3.2.0 only
#4459	Use merge sort for struct types in non-key columns
#4494	Append new authorized user to blossom-ci whitelist [skip ci]
#4400	Enable approx percentile tests
#4471	Disable orc write by default because of https://issues.apache.org/jira/browse/ORC-1075
#4462	Rename DECIMAL_128_FULL and rework usage of TypeSig.gpuNumeric
#4479	Change signoff check image to slim-buster [skip ci]
#4464	Throw SparkArrayIndexOutOfBoundsException for Spark 3.3.0+
#4469	Support repetition of \d and \D in regexp functions
#4472	Modify docs for 22.02 to address issue-4319[skip ci]
#4440	Enable GPU broadcast exchange reuse for DPP when AQE enabled
#4376	Add sequence support
#4460	Abstract the text based PartitionReader
#4383	Fix correctness issue with CASE WHEN with expressions that have side-effects
#4465	Refactor for shims 320+
#4463	Avoid replacing a hash join if build side is unsupported by the join type
#4456	Fix build issues: 1 clean non-exists target dirs; 2 remove duplicated plugin
#4416	Unshim join execs
#4172	Support String to Decimal 128
#4458	Exclude some metadata operators when checking GPU replacement
#4451	Some metrics improvements and timeline reporting
#4435	Disable add profile src execution by default to make the build log clean
#4436	Print error log to stderr output
#4155	Add partial support for line begin and end anchors in regexp_replace
#4428	Exhaustively iterate ColumnarToRow iterator to avoid leaks
#4430	update pca example link in ml-integration.md[skip ci]
#4452	Limit parallelism of nightly tests [skip ci]
#4449	Add recursive type checking and fallback tests for casting array with unsupported element types to string
#4437	Change logInfo to logWarning
#4447	Fix 330 build error and add 322 shims layer
#4417	Fix an Intellij debug issue
#4431	Add DateType support for AST expressions
#4433	Import the right pandas from conda [skip ci]
#4419	Import the right pandas from conda
#4427	Update getFileScanRDD shim for recent changes in Spark 3.3.0
#4397	Ignore cufile.log
#4388	Add support for ReplicateRows
#4399	Update docs for Profiling and Qualification tool to change wording
#4407	Fix GpuSubqueryBroadcast on multi-fields relation
#4396	GpuShuffleCoalesceIterator acquire semaphore after host concat
#4361	Accommodate altered semantics of `cudf::lists::contains()`
#4394	Use correct column name in GpuIf test
#4385	Add missing GpuSubqueryBroadcast replacement rule for spark31x
#4387	Fix auto merge conflict 4384[skip ci]
#4374	Fix the IT module depends on the tests module
#4365	Not publishing integration_tests jar to Maven Central [skip ci]
#4358	Update GpuIf to support expressions with side effects
#4382	Remove unused scallop dependency from integration_tests
#4364	Replace Scala document with Scala comment for inner functions
#4373	Add pytest tags for nightly test parallel run [skip ci]
#4150	Support GpuSubqueryBroadcast for DPP
#4372	Move casting to string tests from array_test.py and struct_test.py to cast_test.py
#4371	Fix typo in skipTestsFor330 calculation [skip ci]
#4355	Dedicated deploy-file with reduced pom in nightly build [skip ci]
#4352	Revert "Ignore failing string to timestamp tests temporarily (#4197)"
#4359	Audit - SPARK-37268 - Remove unused variable in GpuFileScanRDD [Databricks]
#4327	Print meaningful message when calling scripts in maven
#4354	Fix regression in AQE optimizations
#4343	Fix issue with binding to hash agg columns with computation
#4285	Add support for regexp_extract on the GPU
#4349	Fix PYTHONPATH in pre-merge
#4269	The option for the nightly script not deploying jars [skip ci]
#4335	Fix the issue of exporting Column RDD
#4336	Split expensive pytest files in cases level [skip ci]
#4328	Change the explanation of why the operator will not work on GPU
#4338	Use scala Int.box instead of Integer constructors
#4340	Remove the unnecessary parameter `dataType` in `resolveColumnVector` method
#4256	Allow returning an EmptyHashedRelation when a broadcast result is empty
#4333	Add tests about writing empty table to ORC/PAQUET
#4337	Support GpuFirst and GpuLast on nested types under reduction aggregations
#4331	Fix parquet options builder calls
#4310	Fix typo in shim class name
#4326	Fix 4315 decrease concurrentGpuTasks to avoid sum test OOM
#4266	Check revisions for all shim jars while build all
#4282	Use data type to create an inspector for a foldable GPU expression.
#3144	Optimize AQE with Spark 3.2+ to avoid redundant transitions
#4317	[BUG] Update nightly test script to dynamically set mem_fraction [skip ci]
#4206	Porting GpuRowToColumnar converters to InternalColumnarRDDConverter
#4272	Full support for SUM overflow detection on decimal
#4255	Make regexp pattern `[^a]` consistent with Spark for multiline strings
#4306	Revert commonizing the int96ParquetRebase* functions
#4299	Fix auto merge conflict 4298 [skip ci]
#4159	Optimize sample perf
#4235	Commonize v2 shim
#4274	Add tests for timestamps that overflowed before.
#4271	Skip test_regexp_replace_null_pattern_fallback on Spark 3.1.1 and later
#4278	Use mamba for cudf conda install [skip ci]
#4270	Document exponent differences when casting floating point to string [skip ci]
#4268	Fix merge conflict with branch-21.12
#4093	Add tests for regexp() and regexp_like()
#4259	fix regression in cast from string to float that caused signed NaN to be considered valid
#4241	fix bug in parsing regex character classes that start with `^` and contain an unescaped `]`
#4224	Support row-based Hive UDFs
#4221	GpuCast from ArrayType to StringType
#4007	Implement duplicate key handling for GpuCreateMap
#4251	Skip test_regexp_replace_null_pattern_fallback on Databricks
#4247	Disable failing CastOpSuite test
#4239	Make EOL anchor behavior match CPU for strings ending with newline
#4153	Regexp: Only transpile once per expression rather than once per batch
#4230	Change to build tools module with all the versions by default
#4223	Fixes a minor deprecation warning
#4215	Rebalance testing load
#4214	Fix pre_merge ci_2 [skip ci]
#4212	Remove an unused method with its outdated comment
#4211	Update test_floor_ceil_overflow to be more lenient on exception type
#4203	Move all the GpuShuffleExchangeExec shim v2 classes to org.apache.spark
#4193	Rename 311until320-apache to 311until320-noncdh
#4197	Ignore failing string to timestamp tests temporarily
#4160	Fix merge issues for branch 22.02
#4081	Convert String to DecimalType without casting to FloatType
#4132	Fix auto merge conflict 4131 [skip ci]
#4099	[REVIEW] Init version 22.02.0
#4113	Fix pre-merge CI 2 conditions [skip ci]
#4064	Regex: transpile `.` to `[^\r\n]` in cuDF
#4044	RLike: Fall back to CPU for regex that would produce incorrect results

Release 21.12

Features


#1571	[FEA] Better precision range for decimal multiply, and possibly others
#3953	[FEA] Audit: Add array support to union by name
#4085	[FEA] Decimal 128 Support: Concat
#4073	[FEA] Decimal 128 Support: MapKeys, MapValues, MapEntries
#3432	[FEA] Qualification tool checks if there is any "Scan JDBCRelation" and count it as "problematic"
#3824	[FEA] Support MapType in ParquetCachedBatchSerializer
#4048	[FEA] WindowExpression support for Decimal 128 in Spark 320
#4047	[FEA] Literal support for Decimal 128 in Spark 320
#3863	[FEA] Add Spark 3.3.0-SNAPSHOT Shim
#3814	[FEA] stddev stddev_samp and std should be supported over a window
#3370	[FEA] Add support for Databricks 9.1 runtime
#3876	[FEA] Support REGEXP_REPLACE to replace null values
#3784	[FEA] Support ORC write Map column(single level)
#3470	[FEA] Add shims for 3.2.1-SNAPSHOT
#3855	[FEA] CPU based UDF to run efficiently and transfer data back to GPU for supported operations
#3739	[FEA] Provide an explicit config for fallback on CPU if plan rewrite fails
#3888	[FEA] Decimal 128 Support: Add a "Trust me I know it will not overflow config"
#3088	[FEA] Profile tool print problematic operations
#3886	[FEA] Decimal 128 Support: Extend the range for Decimal Multiply and Divide
#79	[FEA] Support Size operation
#3880	[FEA] Decimal 128 Support: Average aggregation
#3659	[FEA] External tool integration with Qualification tool
#2	[FEA] RLIKE support
#3192	[FEA] Support decimal type in ORC writer
#3419	[FEA] Add support for org.apache.spark.sql.execution.SampleExec
#3535	[FEA] Qualification tool can detect RDD APIs in SQL plan
#3494	[FEA] Support structs in ORC writer
#3514	[FEA] Support collect_set on struct in aggregation context
#3515	[FEA] Support CreateArray to produce array(struct)
#3116	[FEA] Support Maps, Lists, and Structs as non-key columns on joins
#2054	[FEA] Add support for Arrays to ParquetCachedBatchSerializer
#3573	[FEA] Support Cache(PCBS) Array-of-Struct

Performance


#3768	[DOC] document databricks init script required for UCX
#2867	[FEA] Make LZ4_CHUNK_SIZE configurable
#3832	[FEA] AST enabled GpuBroadcastNestedLoopJoin left side can't be small
#3798	[FEA] bounds checking in joins can be expensive
#3603	[FEA] Allocate UCX bounce buffers outside of RMM if ASYNC allocator is enabled

Bugs Fixed


#4253	[BUG] Dependencies missing of spark-rapids v21.12.0 release jars
#4216	[BUG] AQE Crashing Spark RAPIDS when using filter() and union()
#4188	[BUG] data corruption in GpuBroadcastNestedLoopJoin with empty relations edge case
#4191	[BUG] failed to read DECIMAL128 within MapType from ORC
#4175	[BUG] arithmetic_ops_test failed in spark 3.2.0
#4162	[BUG] isCastDecimalToStringEnabled is never called
#3894	[BUG] test_pandas_scalar_udf and test_pandas_map_udf failed in UCX standalone CI run
#3970	[BUG] mismatching timezone settings on executor and driver can cause ORC read data corruption
#4141	[BUG] Unable to start the RapidsShuffleManager in databricks 9.1
#4102	[BUG] udf-example build failed: Unknown CMake command "cpm_check_if_package_already_added".
#4084	[BUG] window on unbounded preceeding and unbounded following can produce incorrect results.
#3990	[BUG] Scaladoc link warnings in ParquetCachedBatchSerializer and ExplainPlan
#4108	[BUG] premerge fails due to Spark 3.3.0 HadoopFsRelation after SPARK-37289
#4042	[BUG] cudf_udf tests fail on nightly Integration test run
#3743	[BUG] Implicitly catching all exceptions warning in GpuOverrides
#4069	[BUG] parquet_test.py pytests FAILED on Databricks-9.1-ML-spark-3.1.2
#3461	[BUG] Cannot build project from a sub-directory
#4053	[BUG] buildall uses a stale aggregator dependency during test compilation
#3703	[BUG] test_hash_groupby_approx_percentile_long_repeated_keys failed with TypeError
#3706	[BUG] approx_percentile returns array of zero percentiles instead of null in some cases
#4017	[BUG] Why is the hash aggregate not handling empty result expressions
#3994	[BUG] can't open notebook 'docs/demo/GCP/mortgage-xgboost4j-gpu-scala.ipynb'
#3996	[BUG] Exception happened when getting a null row
#3999	[BUG] Integration cache_test failures - ArrayIndexOutOfBoundsException
#3532	[BUG] DatabricksShimVersion must carry runtime version info
#3834	[BUG] Approx_percentile deserialize error when calling "show" rather than "collect"
#3992	[BUG] failed create-parallel-world in databricks build
#3987	[BUG] "mvn clean package -DskipTests" is no longer working
#3866	[BUG] RLike integration tests failing on Azure Databricks 7.3
#3980	[BUG] udf-example build failed due to maven-antrun-plugin upgrade
#3966	[BUG] udf-examples module fails on `mvn compile` and `mvn test`
#3977	[BUG] databricks aggregator jar deployed failed
#3915	[BUG] typo in verify_same_sha_for_unshimmed prevents the offending class file name from being logged.
#1304	[BUG] Query fails with HostColumnarToGpu doesn't support Structs
#3924	[BUG] ExpressionEncoder does not work for input in `GpuScalaUDF`
#3911	[BUG] CI fails on an inconsistent set of partial builds
#2896	[BUG] Extra GpuColumnarToRow when using ParquetCachedBatchSerializer on databricks
#3864	[BUG] test_sample_produce_empty_batch failed in dataproc
#3823	[BUG]binary-dedup.sh script fails on mac
#3658	[BUG] DataFrame actions failing with error: Error : java.lang.NoClassDefFoundError: Could not initialize class com.nvidia.spark.rapids.GpuOverrides withlatest 21.10 jars
#3857	[BUG] nightly build push dist packge w/ single version of spark
#3854	[BUG] not found: type PoissonDistribution in databricks build
#3852	spark-nightly-build deploys all modules due to typo in `-pl`
#3844	[BUG] nightly spark311cdh build failed
#3843	[BUG] databricks nightly deploy failed
#3705	[BUG] Change `nullOnDivideByZero` from runtime parameter to aggregate expression for `stddev` and `variance` aggregation families
#3614	[BUG] ParquetMaterializer.scala appears in both v1 and v2 shims
#3430	[BUG] Profiling tool silently stops without producing any output on a Synapse Spark event log
#3311	[BUG] cache_test.py failed w/ cache.serializer in spark 3.1.2
#3710	[BUG] Usage of Class.forName without specifying a classloader
#3462	[BUG] IDE complains about duplicate ShimBasePythonRunner instances
#3476	[BUG] test_non_empty_ctas fails on yarn

PRs


#4362	Decimal128 support for Parquet
#4391	update gcp custom dataproc image version to avoid log4j issue[skip ci]
#4379	update hot fix cudf link v21.12.2
#4367	update 21.12 branch for doc [skip ci]
#4245	Update changelog 21.12 to latest [skip ci]
#4258	Sanitize column names in ParquetCachedBatchSerializer before writing to Parquet
#4308	Bump up GPU reserve memory to 640MB
#4307	Update Download page for 21.12 [skip ci]
#4261	Update cudfjni version to released 21.12.0
#4265	Remove aggregator dependency before deploying dist artifact
#4030	Support code coverage report with single version jar [skip ci]
#4287	Update 21.12 compatibility guide for known regexp issue [skip ci]
#4242	Fix indentation issue in getting-started-k8s guide [skip ci]
#4263	Add missing ORC write tests on Map of Decimal
#4257	Implement getShuffleRDD and fixup mismatched output types on shuffle reuse
#4250	Update the release script [skip ci]
#4222	Add arguments support to 'databricks/run-tests.py'
#4233	Add databricks init script for UCX
#4231	RAPIDS Shuffle Manager fallback if security is enabled
#4228	Fix unconditional nested loop joins on empty tables
#4217	Enable event log for qualification & profiling tools testing from IT
#4202	Parameter for the Databricks zone-id [skip ci]
#4199	modify some words for synapse getting started guide[skip ci]
#4200	Disable approx percentile tests that intermittently fail
#4187	Added a getting started guide for Synapse[skip ci]
#4192	Fix ORC read DECIMAL128 inside MapType
#4173	Update approx percentile docs to link to issue 4060 [skip ci]
#4174	Document Bloop, Metals and VS code as an IDE option [skip ci]
#4181	Fix element_at for 3.2.0 and array/struct cast
#4110	Add a getting started guide on workload qualification [skip ci]
#4106	Add docs for MIG on YARN [skip ci]
#4100	Add PCA example to ml-integration page [skip ci]
#4177	Decimal128: added missing decimal128 signature on Spark 32X
#4161	More integration tests with decimal128
#4165	Fix type checks for get array item in 3.2.0
#4163	Enable config to check for casting decimals to strings
#4154	Use num_slices to guarantee partition shape in the pandas udf tests
#4129	Check executor timezone is same as driver timezone when running on GPU
#4139	Decimal128 Support
#4128	Fix build errors in udf-examples native build
#4063	Regexp_replace support regexp
#4125	Remove unused imports
#4052	Support null safe host column vector
#4116	Add in tests to check for overflow in unbounded window
#4111	Added external doc links for JRE and Spark
#4105	Enforce checks for unused imports and missed interpolation
#4107	Set the task context in background reader threads
#4114	Refactoring cudf_udf test setup
#4109	Stop using redundant partitionSchemaOption dropped in 3.3.0
#4097	Enable auto-merge from branch-21.12 to branch-22.02 [skip ci]
#4094	Remove spark311db shim layer
#4082	Add abfs and abfss to the cloud scheme
#4071	Treat scalac warnings as errors
#4043	Promote cudf as dist direct dependency, mark aggregator provided
#4076	Sets the GPU device id in the UCX early start thread
#4087	Regex parser improvements and bug fixes
#4079	verify "Add array support to union by name " by adding an integration test
#4090	Update pre-merge expression for 2022+ CI [skip ci]
#4049	Change Databricks image from 8.2 to 9.1 [skip ci]
#4051	Upgrade ORC version from 1.5.8 to 1.5.10
#4080	Add case insensitive when clipping parquet blocks
#4083	Fix compiler warning in regex transpiler
#4070	Support building from sub directory
#4072	Fix overflow checking on optimized decimal sum
#4067	Append new authorized user to blossom-ci whitelist [skip ci]
#4066	Temply disable cudf_udf test
#4057	Restore original ASL 2.0 license text
#3937	Qualification tool: Detect JDBCRelation in eventlog
#3925	verify AQE and DPP both on
#3982	Fix the issue of parquet reading with case insensitive schema
#4054	Use install for the base version build thread [skip ci]
#4008	[Doc] Update the getting started guide for databricks: Change from 8.2 to 9.1 runtime [skip ci]
#4010	Enable MapType for ParquetCachedBatchSerializer
#4046	lower GPU memory reserve to 256MB
#3770	Enable approx percentile tests
#4038	Change the `catalystConverter` to be a Scala `val`.
#4035	Hash aggregate fix empty resultExpressions
#3998	Check for CPU cores and free memory in IT script
#3984	Check for data write command before inserting hash sort optimization
#4019	initialize RMM with a single pool size
#3993	Qualification tool: Remove "unsupported" word for nested complex types
#4033	skip spark 330 tests temporarily in nightly [skip ci]
#4029	Update buildall script and the build doc [skip ci]
#4014	fix can't open notebook 'docs/demo/GCP/mortgage-xgboost4j-gpu-scala.ipynb'[skip ci]
#4024	Allow using a custom Spark Resource Name for a GPU
#4012	Add Apache Spark 3.3.0-SNAPSHOT Shims
#4021	Explicitly use the public version of ParquetCachedBatchSerializer
#3869	Add Std dev samp for windowing
#3960	Use a fixed RMM pool size
#3767	Add shim for Databricks 9.1
#3862	Prevent approx_percentile aggregate from being split between CPU and GPU
#3871	Add integration test for RLike with embedded null in input
#3968	Allow null character in regexp_replace pattern
#3821	Support ORC write Map column
#3991	Fix aggregator jar copy logic
#3973	Add shims for Apache Spark 3.2.1-SNAPSHOT builds
#3967	Bring back AST support for BNLJ inner joins
#3947	Enable rlike tests on databricks
#3981	Replace tasks w/ target of maven-antrun-plugin in udf-example
#3976	Replace long artifact lists with an ant loop
#3972	Revert udf-examples dependency change to restore test build phase
#3978	Update aggregator jar name in databricks deploy script
#3965	Add how-to resolve auto-merge conflict [skip ci]
#3963	Add a dedicated RapidsConf option to tolerate GpuOverrides apply failures
#3923	Prepare for 3.2.1 shim, various shim build fixes and improvements
#3969	add doc on using compute-sanitizer
#3964	Qualification tool: Catch exception for invalid regex patterns
#3961	Avoid using HostColumnarToGpu for nested types
#3910	Refactor the aggregate API
#3897	Support running CPU based UDF efficiently
#3950	Fix failed auto-merge #3939
#3946	Document compatability of operations with side effects.
#3945	Update udf-examples dependencies to use dist jar
#3938	remove GDS alignment code
#3943	Add artifact revisions check for nightly tests [skip ci]
#3933	Profiling tool: Print potential problems
#3926	Add zip unzip to integration tests dockerfiles [skip ci]
#3757	Update to nvcomp-2.x JNI APIs
#3922	Stop using -U in build merges aggregator jars of nightly [skip ci]
#3907	Add version properties to integration tests modules
#3912	Stop using -U in the build that merges all aggregator jars
#3909	Fix warning when catching all throwables in GpuOverrides
#3766	Use JCudfSerialization to deserialize a table to host columns
#3820	Advertise CPU orderingSatisfies
#3858	update emr 6.4 getting started doc and pic[skip ci]
#3899	Fix sample test cases
#3896	Xfail the sample tests temporarily
#3848	Fix binary-dedupe failures and improve its performance on macOS
#3867	Disable rlike integration tests on Databricks
#3850	Add explain Plugin API for CPU plan
#3868	Fix incorrect schema of nested types of union - audit SPARK-36673
#3860	Add unit test for GpuKryoRegistrator
#3847	Add Running Qualification App API
#3861	Revert "Fix typo in nightly deploy project list (#3853)" [skip ci]
#3796	Add Rlike support
#3856	Fix not found: type PoissonDistribution in databricks build
#3853	Fix typo in nightly deploy project list
#3831	Support decimal type in ORC writer
#3789	GPU sample exec
#3846	Include pluginRepository for cdh build
#3819	Qualification tool: Detect RDD Api's in SQL plan
#3835	Minor cleanup: do not set cuda stream to null
#3845	Include 'DB_SHIM_NAME' from Databricks jar path to fix nightly deploy [skip ci]
#3523	Interpolate spark.version.classifier in build.dir
#3813	Change `nullOnDivideByZero` from runtime parameter to aggregate expression for `stddev` and `variance` aggregations
#3791	Add audit script to get list of commits from Apache Spark master branch
#3744	Add developer documentation for setting up Microk8s [skip ci]
#3817	Fix auto-merge conflict 3816 [skip ci]
#3804	Missing statistics in GpuBroadcastNestedLoopJoin
#3799	Optimize out bounds checking for joins when the gather map has only valid entries
#3801	Update premerge to use the combined snapshots jar
#3696	Support nested types in ORC writer
#3790	Fix overflow when casting integral to neg scale decimal
#3779	Enable some union of structs tests that were marked xfail
#3787	Fix auto-merge conflict 3786 from branch-21.10 [skip ci]
#3782	Fix auto-merge conflict 3781 [skip ci]
#3778	Remove extra ParquetMaterializer.scala file
#3773	Restore disabled ORC and Parquet tests
#3714	Qualification tool: Error handling while processing large event logs
#3758	Temporarily disable timestamp read tests for Parquet and ORC
#3748	Fix merge conflict with branch-21.10
#3700	CollectSet supports structs
#3740	Throw Exception if failure to load ParquetCachedBatchSerializer class
#3726	Replace Class.forName with ShimLoader.loadClass
#3690	Added support for Array[Struct] to GpuCreateArray
#3728	Qualification tool: Fix bug to process correct listeners
#3734	Fix squashed merge from #3725
#3725	Fix merge conflict with branch-21.10
#3680	cudaMalloc UCX bounce buffers when async allocator is used
#3681	Clean up and document metrics
#3674	Move file TestingV2Source.Scala
#3617	Update Version to 21.12.0-SNAPSHOT
#3612	Add support for nested types as non-key columns on joins
#3619	Added support for Array of Structs

Release 21.10

Features


#1601	[FEA] Support AggregationFunction StddevSamp
#3223	[FEA] Rework the shim layer to robustly handle ABI and API incompatibilities across Spark releases
#13	[FEA] Percentile support
#3606	[FEA] Support approx_percentile on GPU with decimal type
#3552	[FEA] extend allowed datatypes for add and multiply in ANSI mode
#3450	[FEA] test the UCX shuffle with the new build changes
#3043	[FEA] Qualification tool: Add support to filter specific configuration values
#3413	[FEA] Add in support for transform_keys
#3297	[FEA] ORC reader supports reading Map columns.
#3367	[FEA] Support GpuRowToColumnConverter on BinaryType
#3380	[FEA] Support CollectList/CollectSet on nested input types in GroupBy aggregation
#1923	[FEA] Fall back to the CPU when LEAD/LAG wants to IGNORE NULLS
#3044	[FEA] Qualification tool: Report the nested data types
#3045	[FEA] Qualification tool: Report the write data formats.
#3224	[FEA] Add maven compile/package plugin executions, one for each supported Spark dependency version
#3047	[FEA] Profiling tool: Structured output format
#2877	[FEA] Support HashAggregate on struct and nested struct
#2916	[FEA] Support GpuCollectList and GpuCollectSet as TypedImperativeAggregate
#463	[FEA] Support NESTED_SCHEMA_PRUNING_ENABLED for ORC
#1481	[FEA] ORC Predicate pushdown for Nested fields
#2879	[FEA] ORC reader supports reading Struct columns.
#27	[FEA] test current_date and current_timestamp
#3229	[FEA] Improve CreateMap to support multiple key and value expressions
#3111	[FEA] Support conditional nested loop joins
#3177	[FEA] Support decimal type in ORC reader
#3014	[FEA] Add initial support for CreateMap
#3110	[FEA] Support Map as input to explode and pos_explode
#3046	[FEA] Profiling tool: Scale to run large number of event logs.
#3156	[FEA] Support casting struct to struct
#2876	[FEA] Support joins(SHJ and BHJ) on struct as join key with nested struct in the selected column list
#68	[FEA] support StringRepeat
#3042	[FEA] Qualification tool: Add conjunction and disjunction filters.
#2615	[FEA] support collect_list and collect_set as groupby aggregation
#2943	[FEA] Support PreciseTimestampConversion when using windowing function
#2878	[FEA] Support Sort on nested struct
#2133	[FEA] Join support for passing MapType columns along when not join keys
#3041	[FEA] Qualification tool: Add filters based on Regex and user name.
#576	[FEA] Spark 3.1 orc nested predicate pushdown support

Performance


#3651	[DOC] Point users to UCX 1.11.2
#2370	[FEA] RAPIDS Shuffle Manager enable/disable config
#2923	[FEA] Move to dispatched binops instead of JIT binops

Bugs Fixed


#3929	[BUG] published rapids-4-spark dist artifact references aggregator
#3837	[BUG] Spark-rapids v21.10.0 release candidate jars failed on the OSS validation check.
#3769	[BUG] dedupe fails with find: './parallel-world/spark301/ ...' No such file or directory
#3783	[BUG] spark-rapids v21.10.0 release build failed on script "dist/scripts/binary-dedupe.sh"
#3775	[BUG] Hash aggregate with structs crashes with IllegalArgumentException
#3704	[BUG] Executor-side ClassCastException when testing with Spark 3.2.1-SNAPSHOT in k8s environment
#3760	[BUG] Databricks class cast exception failure
#3736	[BUG] Crossjoin performance degraded a lot on RAPIDS 21.10 snapshot
#3369	[BUG] UDF compiler can cause crashes with unexpected class input
#3713	[BUG] AQE shuffle coalesce optimization is broken with Spark 3.2
#3720	[BUG] Qualification tool warnings
#3718	[BUG] plugin failing to build for CDH due to missing dependency
#3653	[BUG] Issue seen with AQE on in Q5 (possibly others) using Spark 3.2 rc3
#3686	[BUG] binary-dedupe doesn't fail the build on errors
#3520	[BUG] Scaladoc warnings emitted during build
#3516	[BUG] MultiFileParquetPartitionReader can fail while trying to write the footer
#3648	[BUG] test_cast_decimal_to failing in databricks 7.3
#3670	[BUG] mvn test failed compiling rapids-4-spark-tests-next-spark_2.12
#3640	[BUG] q82 regression after #3288
#3642	[BUG] Shims improperly overridden
#3611	[BUG] test_no_fallback_when_ansi_enabled failed in databricks
#3601	[BUG] Latest 21.10 snapshot jars failing with java.lang.ClassNotFoundException: com.nvidia.spark.rapids.ColumnarRdd with XGBoost
#3589	[BUG] Latest 21.10 snapshot jars failing with java.lang.ClassNotFoundException: com.nvidia.spark.ExclusiveModeGpuDiscoveryPlugin
#3424	[BUG] Aggregations in ANSI mode do not detect overflows
#3592	[BUG] Failed to find data source: com.nvidia.spark.rapids.tests.datasourcev2.parquet.ArrowColumnarDataSourceV2
#3580	[BUG] Class deduplication pulls wrong class for ProxyRapidsShuffleInternalManagerBase
#3331	[BUG] Failed to read file into buffer in `CuFile.readFromFile` in gds standalone test
#3376	[BUG] Unit test failures in Spark 3.2 shim build
#3382	[BUG] Support years with up to 7 digits when casting from String to Date in Spark 3.2
#3266	CDP - Flakiness in JoinSuite in Integration tests
#3415	[BUG] Fix regressions in WindowFunctionSuite with Spark 3.2.0
#3548	[BUG] GpuSum overflow on 3.2.0+
#3472	[BUG] GpuAdd and GpuMultiply do not include failOnError
#3502	[BUG] Spark 3.2.0 TimeAdd/TimeSub fail due to new DayTimeIntervalType
#3511	[BUG] "Sequence" function fails with "java.lang.UnsupportedOperationException: Not supported on UnsafeArrayData"
#3518	[BUG] Nightly tests failed with RMM outstanding allocations on shutdown
#3383	[BUG] ParseDateTime should not support special dates with Spark 3.2
#3384	[BUG] AQE does not work with Spark 3.2 due to unrecognized GPU partitioning
#3478	[BUG] CastOpSuite and ParseDateTimeSuite failures spark 302 and others
#3495	Fix shim override config
#3482	[BUG] ClassNotFound error when running a job
#1867	[BUG] In Spark 3.2.0 and above dynamic partition pruning and AQE are not mutually exclusive
#3468	[BUG] GpuKryoRegistrator ClassNotFoundException
#3488	[BUG] databricks 8.2 runtime build failed
#3429	[BUG] test_sortmerge_join_struct_mixed_key_with_null_filter LeftSemi/LeftAnti fails
#3400	[BUG] Canonicalized GPU plans sometimes not consistent when using Spark 3.2
#3440	[BUG] Followup comments from PR3411
#3372	[BUG] 3.2.0 shim: ShuffledBatchRDD.scala:141: match may not be exhaustive.
#3434	[BUG] Fix the unit test failure of KnownNotNull in Scala UDF for Spark 3.2
#3084	[AUDIT] [SPARK-32484][SQL] Fix log info BroadcastExchangeExec.scala
#3463	[BUG] 301+-nondb is named incorrectly
#3435	[BUG] tools - test dsv1 complex and decimal test fails
#3388	[BUG] maven scalastyle checks don't appear to work for alterneate source directories
#3416	[BUG] Resource cleanup issues with Spark 3.2
#3339	[BUG] Databricks test fails test_hash_groupby_collect_partial_replace_fallback
#3375	[BUG] SPARK-35742 Replace semanticEquals with canonicalize
#3334	[BUG] UCX join_test FAILED on spark standalone
#3058	[BUG] GPU ORC reader complains errors when specifying columns that do not exist in file schema.
#3385	[BUG] misc_expr_test FAILED on Dataproc
#2052	[BUG] Spark 3.2.0 test fails due to SPARK-34906 Refactor TreeNode's children handling methods into specialized traits
#3401	[BUG] Qualification tool failed with java.lang.ArrayIndexOutOfBoundsException
#3333	[BUG]Mortgage ETL input_file_name is not correct when using CPU's CsvScan
#3391	[BUG] UDF example build fail
#3379	[BUG] q93 failed w/ UCX
#3364	[BUG] analysis tool cannot handle a job with no tasks.
#3235	Classes directly in Apache Spark packages
#3237	BasicColumnWriteJobStatsTracker might be affected by spark change SPARK-34399
#3134	[BUG] Add more checkings before coalescing ORC files
#3324	[BUG] Databricks builds failing with missing dependency issue
#3244	[BUG] join_test LeftAnti failing on Databricks
#3268	[BUG] CDH ParquetCachedBatchSerializer fails to build due to api change in VectorizedColumnReader
#3305	[BUG] test_case_when failed on Databricks 7.3 nightly build
#3139	[BUG] case when on some nested types can produce a crash
#3253	[BUG] ClassCastException for unsupported TypedImperativeAggregate functions
#3256	[BUG] udf-examples native build broken
#3271	[BUG] Databricks 301 shim compilation error
#3255	[BUG] GpuRunningWindowExecMeta is missing ExecChecks for partitionSpec in databricks runtime
#3222	[BUG] `test_running_window_function_exec_for_all_aggs` failed in the UCX EGX run
#3195	[BUG] failures parquet_test test:read_round_trip
#3176	[BUG] test_window_aggs_for_rows_collect_list[IGNORE_ORDER({'local': True})] FAILED on EGX Yarn cluster
#3187	[BUG] NullPointerException in SLF4J on startup
#3166	[BUG] Unable to build rapids-4-spark jar from source due to missing 3.0.3-SNAPSHOT for spark-sql
#3131	[BUG] hash_aggregate_test TypedImperativeAggregate tests failed
#3147	[BUG] window_function_test.py::test_window_ride_along failed in databricks runtime
#3094	[BUG] join_test.py::test_sortmerge_join_with_conditionals failed in databricks 8.2 runtime
#3078	[BUG] test_hash_join_map, test_sortmerge_join_map failed in databricks runtime
#3059	[BUG] orc_test:test_pred_push_round_trip failed

PRs


#3940	Update changelog [skip ci]
#3930	Dist artifact with provided aggregator dependency
#3918	Update changelog [skip ci]
#3906	Doc updated for v2110[skip ci]
#3840	Update changelog [skip ci]
#3838	Update deploy script [skip ci]
#3827	Update changelog 21.10 to latest [skip ci]
#3808	Rewording qualification and profiling tools doc files[skip ci]
#3815	Correct 21.10 docs such as PCBS related FAQ [skip ci]
#3807	Update 21.10.0 release doc [skip ci]
#3800	Update approximate percentile documentation
#3810	Update to include Spark 3.2.0 in nosnapshots target so it gets released officially.
#3806	Update spark320.version to 3.2.0
#3795	Reduce usage of escaping in xargs
#3785	[BUG] Update cudf version in version-dev script [skip ci]
#3771	Update cudfjni version to 21.10.0
#3777	Ignore nullability when checking for need to cast aggregation input
#3763	Force parallel world in Shim caller's classloader
#3756	Simplify shim classloader logic
#3746	Avoid using AST on inner joins and avoid coalesce after nested loop join filter
#3719	Advertise CPU sort order and partitioning expressions to Catalyst
#3737	Add note referencing known issues in approx_percentile implementation
#3729	Update to ucx 1.11.2 for 21.10
#3711	Surface problems with overrides and fallback
#3722	CDH build stopped working due to missing jars in maven repo
#3691	Fix issues with AQE and DPP enabled on Spark 3.2
#3373	Support `stddev` and `variance` aggregations families
#3708	disable percentile approx tests
#3695	Remove duplicated data types for collect_list tests
#3687	Improve dedupe script
#3646	Debug utility method to dump a table or columnar batch to Parquet
#3683	Change deploy scripts for new build system
#3301	Approx Percentile
#3673	Add the Scala jar as an external lib for a linkage warning
#3668	Improve the diagnostics in udf compiler for try-and-catch.
#3666	Recompute Parquet block metadata when estimating footer from multiple file input
#3671	Fix tests-spark310+ dependency
#3663	Add back the tests-spark310+
#3657	Revert "Use cudf to compute exact hash join output row sizes (#3288)"
#3643	Properly override Shims for int96Rebase
#3645	Verify unshimmed classes are bitwise-identical
#3650	Fix dist copy dependencies
#3649	Add ignore_order to other fallback tests for the aggregate
#3631	Change premerge to build all Spark versions
#3630	Fix CDH Build
#3636	Change nightly build to not deploy dist for each classifier version [skip ci]
#3632	Revert disabling of ctas test
#3628	Fix 313 ShuffleManager build
#3618	Update changelog script to strip ambiguous annotation [skip ci]
#3626	Add in support for casting decimal to other number types
#3615	Ignore order for the test_no_fallback_when_ansi_enabled
#3602	Dedupe proxy rapids shuffle manager byte code
#3330	Support `int96RebaseModeInWrite` and `int96RebaseModeInRead`
#3438	Parquet read unsigned int: uint8, uin16, uint32
#3607	com.nvidia.spark.rapids.ColumnarRdd not exposed to user for XGBoost
#3566	Enable String Array Max and Min
#3590	Unshim ExclusiveModeGpuDiscoveryPlugin
#3597	ANSI check for aggregates
#3595	Update the overflow check algorithm for Subtract
#3588	Disable test_non_empty_ctas test
#3577	Commonize more shim module files
#3594	Fix nightly integration test script for specfic artifacts
#3544	Add test for nested grouping sets, rollup, cube
#3587	Revert shared class list modifications in PR#3545
#3570	ANSI Support for Abs, UnaryMinus, and Subtract
#3574	Add in ANSI date time fallback
#3578	Deploy all of the classifier versions of the jars [skip ci]
#3569	Add commons-lang3 dependency to tests
#3568	Enable 3.2.0 unit test in premerge and nightly
#3559	Commonize shim module join and shuffle files
#3565	Auto-dedupe ASM-relocated shim dependencies
#3531	Fall back to the CPU for date/time parsing we cannot support yet
#3561	Follow on to ANSI Add
#3557	Add IDEA profile switch workarounds
#3504	Fix reserialization of broadcasted tables
#3556	Fix databricks test.sh script for passing spark shim version
#3545	Dynamic class file deduplication across shims in dist jar build
#3551	Fix window sum overflow for 3.2.0+
#3537	GpuAdd supports ANSI mode.
#3533	Define a SPARK_SHIM_VER to pick up specific rapids-4-spark-integration-tests jars
#3547	Range window supports DayTime on 3.2+
#3534	Fix package name and sql string issue for GpuTimeAdd
#3536	Enable auto-merge from branch 21.10 to 21.12 [skip ci]
#3521	Qualification tool: Report nested complex types in Potential Problems and improve write csv identification.
#3507	TimeAdd supports DayTimeIntervalType
#3529	Support UnsafeArrayData in scalars
#3528	Update NOTICE copyrights to 2021
#3527	Ignore CBO tests that fail against Spark 3.2.0
#3439	Stop parsing special dates for Spark 3.2+
#3524	Update hashing to normalize -0.0 on 3.2+
#3508	Auto abort dup pre-merge builds [skip ci]
#3501	Add limitations for Databricks doc
#3517	Update empty CTAS testing to avoid Hive if possible
#3513	Allow spark320 tests to run with 320 or 321
#3493	Initialze RAPIDS Shuffle Manager at driver/executor startup
#3496	Update parse date to leverage cuDF support for single digit components
#3454	Catch UDF compiler exceptions and fallback to CPU
#3505	Remove doc references to cudf JIT
#3503	Have average support nulls for 3.2.0
#3500	Fix GpuSum type to match resultType
#3485	Fix regressions in cast from string to date and timestamp
#3487	Add databricks build tests to pre-merge CI [skip ci]
#3497	Re-enable spark.rapids.shims-provider-override
#3499	Fix Spark 3.2.0 test_div_by_zero_ansi failures
#3418	Qualification tool: Add filtering based on configuration parameters
#3498	Update the scala repl loader to avoid issues with broadcast.
#3479	Test with Spark 3.2.1-SNAPSHOT
#3474	Build fixes and IDE instructions
#3460	Add DayTimeIntervalType/YearMonthIntervalType support
#3491	Shim GpuKryoRegistrator
#3489	Fix 311 databricks shim for AnsiCastOpSuite failures
#3456	Fallback to CPU when datasource v2 enables RuntimeFiltering
#3417	Adds pre/post steps for merge and update aggregate
#3431	Reinstate test_sortmerge_join_struct_mixed_key_with_null_filter
#3477	Update supported docs to clarify casting floating point to string
#3447	Add CUDA async memory resource as an option
#3473	Create non-shim specific version of ParquetCachedBatchSerializer
#3471	Fix canonicalization of GpuScalarSubquery
#3480	Temporarily disable failing cast string to date tests
#3377	Fix AnsiCastOpSuite failures with Spark 3.2
#3467	Update docs to better describe support for floating point aggregation and NaNs
#3459	Use Shims v2 for ShuffledBatchRDD
#3457	Update the children unpacking pattern for GpuIf.
#3464	Add test for empty relation propagation
#3458	Fix log info GPU BroadcastExchangeExec
#3466	Databricks build fixes for missing shouldFailDivOverflow and removal of needed imports
#3465	Fix name of 301+-nondb directory to stop at Spark 3.2.0
#3452	Enable AQE/DPP test for Spark 3.2
#3436	Qualification tool: Update expected result for test
#3455	Decrease pre_merge_ci parallelism to 4 and reordering time-consuming tests
#3420	`IntegralDivide` throws an exception on overflow in ANSI mode
#3433	Batch scalastyle checks across all modules upfront
#3453	Fix spark-tests script for classifier
#3445	Update nightly build to pull Databricks jars
#3446	Format aggregator pom and commonize some configuration
#3444	Add in tests for unaligned parquet pages
#3451	Fix typo in spark-tests.sh
#3443	Remove 301emr shim
#3441	update deploy script for Databricks
#3414	Add in support for transform_keys
#3320	Add AST support for logical AND and logical OR
#3425	Throw an error by default if CREATE TABLE AS SELECT overwrites data
#3422	Stop double closing SerializeBatchDeserializeHostBuffer host buffers when running with Spark 3.2
#3411	Make new build default and combine into dist package
#3368	Extend TagForReplaceMode to adapt Databricks runtime
#3428	Remove commented-out semanticEquals overrides
#3421	Revert to CUDA runtime image for build
#3381	Implement per-shim parallel world jar classloader
#3303	Update to cudf conditional join change that removes null equality argument
#3408	Add leafNodeDefaultParallelism support
#3426	Correct grammar in qualification tool doc
#3423	Fix hash_aggregate tests that leaked configs
#3412	Restore AST conditional join tests
#3403	Fix canonicalization regression with Spark 3.2
#3394	Orc read map
#3392	Support transforming BinaryType between Row and Columnar
#3393	Fill with null columns for the names exist only in read schema in ORC reader
#3399	Fix collect_list test so it covers nested types properly
#3410	Specify number of RDD slices for ID tests
#3363	Add AST support for null literals
#3396	Throw exception on parse error in ANSI mode when casting String to Date
#3315	Add in reporting of time taken to transition plan to GPU
#3409	Use devel cuda image for premerge CI
#3405	Qualification tool: Filter empty strings from Read Schema
#3387	Fallback to the CPU for IGNORE NULLS on lead and lag
#3398	Fix NPE on string repeat when there is no data buffer
#3366	Fix input_file_xxx issue when FileScan is running on CPU
#3397	Add tests for GpuInSet
#3395	Fix UDF native example build
#3389	Bring back setRapidsShuffleManager in the driver side
#3263	Qualification tool: Report write data format and nested types
#3378	Make Dockerfile.cuda consistent with getting-started-kubernetes.md
#3359	UnionExec array and nested array support
#3342	Profiling tool add CSV output option and add new combined mode
#3365	fix databricks builds
#3323	Enable optional Spark 3.2.0 shim build
#3361	Fix databricks 3.1.1 arrow dependency version
#3354	Support HashAggregate on struct and nested struct
#3341	ArrayMax and ArrayMin support plus map_entries, map_keys, map_values
#3356	Support Databricks 3.0.1 with new build profiles
#3344	Move classes out of Apache Spark packages
#3345	Add job commit time to task tracker stats
#3357	Avoid RAT checks on any CSV file
#3355	Add new authorized user to blossom-ci whitelist [skip ci]
#3340	xfail AST nested loop join tests until cudf empty left table bug is fixed
#3276	Use child type in some places to make it more clear
#3346	Mark more tests as premerge_ci_1
#3353	Fix automerge conflict 3349 [skip ci]
#3335	Support Databricks 3.1.1 in new build profiles
#3317	Adds in support for the transform_values SQL function
#3299	Insert buffer converters for TypedImperativeAggregate
#3325	Fix spark version classifier being applied properly
#3288	Use cudf to compute exact hash join output row sizes
#3318	Fix LeftAnti nested loop join missing condition case
#3316	Fix GpuProjectAstExec when projecting only literals
#3262	Re-enable the struct support for the ORC reader.
#3312	Fix inconsistent function name and add backward compatibility support for premerge job [skip ci]
#3319	Temporarily disable cache test except for spark 3.1.1
#3308	Branch 21.10 FAQ update forward compatibility, update Spark and CUDA versions
#3309	Prepare Spark 3.2.0 related changes
#3289	Support for ArrayTransform
#3307	Fix generation of null scalars in tests
#3306	Update guava to be 30.0-jre
#3304	Fix nested cast type checks
#3302	Fix shim aggregator dependencies when snapshot-shims profile provided
#3291	Bump guava from 28.0-jre to 29.0-jre in /tests
#3292	Bump guava from 28.0-jre to 29.0-jre in /integration_tests
#3293	Bump guava from 28.0-jre to 29.0-jre in /udf-compiler
#3294	Update Qualification and Profiling tool documentation for gh-pages
#3282	Test for `current_date`, `current_timestamp` and `now`
#3298	Minor parent pom fixes
#3296	Support map type in case when expression
#3295	Rename pytest 'slow_test' tag as 'premerge_ci_1' to avoid confusion
#3274	Add m2 cache to fast premerge build
#3283	Fix ClassCastException for unsupported TypedImperativeAggregate functions
#3251	CreateMap support for multiple key-value pairs
#3234	Parquet support for MapType
#3277	Build changes for Spark 3.0.3, 3.0.4, 3.1.1, 3.1.2, 3.1.3, 3.1.1cdh and 3.0.1emr
#3275	Improve over-estimating for ORC coalescing reading
#3280	Update project URL to the public doc website
#3285	Qualification tool: Check for metadata being null
#3281	Decrease parallelism for pre-merge pod to avoid potential OOM kill
#3264	Add parallel support to nightly spark standalone tests
#3257	Add maven compile/package plugin executions for Spark302 and Spark301
#3272	Fix Databricks shim build
#3270	Remove reference to old maven-scala-plugin
#3259	Generate docs for AST from checks
#3164	Support Union on Map types
#3261	Fix some typos[skip ci]
#3242	Support for LeftOuter/BuildRight and RightOuter/BuildLeft nested loop joins
#3239	Support decimal type in orc reader
#3258	Add ExecChecks to Databricks shims for RunningWindowFunctionExec
#3230	Initial support for CreateMap on GPU
#3252	Update to new cudf AST API
#3249	Fix typo in Spark311dbShims
#3183	Add TypeSig checks for join keys and other special cases
#3246	Disable test_broadcast_nested_loop_join_condition_missing_count on Databricks
#3241	Split pytest by 'slow_test' tag and run from different k8s pods to reduce premerge job duration
#3184	Support broadcast nested loop join for LeftSemi and LeftAnti
#3236	Fix Scaladoc warnings in GpuScalaUDF and BufferSendState
#2846	default rmm alloc fraction to the max to avoid unnecessary fragmentation
#3231	Fix some resource leaks in GpuCast and RapidsShuffleServerSuite
#3179	Support GpuFirst/GpuLast on more data types
#3228	Fix unreachable code warnings in GpuCast
#3200	Enable a smoke test for UCX in pre-merge
#3203	Fix Parquet test_round_trip to avoid CPU write exception
#3220	Use LongRangeGen instead of IntegerGen
#3218	Add UCX 1.11.0 to the pre-merge Docker image
#3204	Decrease parallelism for pre-merge integration tests
#3212	Fix merge conflict 3211 [skip ci]
#3188	Exclude slf4j classes from the spark-rapids jar
#3189	Disable snapshot shims by default
#3178	Fix hash_aggregate test failures due to TypedImperativeAggregate
#3190	Update GpuInSet for SPARK-35422 changes
#3193	Append res-life to blossom-ci whitelist [skip ci]
#3175	Add in support for explode on maps
#3171	Refine upload log stage naming in workflow file [skip ci]
#3173	Profile tool: Fix reporting app contains Dataset
#3165	Add optional projection via AST expression evaluation
#3113	Fix order of operations when using mkString in typeConversionInfo
#3161	Rework Profile tool to not require Spark to run and process files faster
#3169	Fix auto-merge conflict 3167 [skip ci]
#3162	Add in more generalized support for casting nested types
#3158	Enable joins on nested structs
#3099	Decimal_128 type checks
#3155	Simple nested additions v2
#2728	Support string `repeat` SQL
#3148	Updated RunningWindow to support extended types too
#3112	Qualification tool: Add conjunction and disjunction filters
#3117	First pass at enabling structs, arrays, and maps for more parts of the plan
#3109	Cudf agg type changes
#2971	Support GpuCollectList and GpuCollectSet as TypedImperativeAggregate
#3107	Add setting to enable/disable RAPIDS Shuffle Manager dynamically
#3105	Add filter in query plan for conditional nested loop and cartesian joins
#3096	add spark311db GpuSortMergeJoinExec conditional joins filter
#3086	Fix Support of MapType in joins on Databricks
#3089	Add filter node in the query plan for conditional joins
#3074	Partial support for time windows
#3061	Support Union on Struct of Map
#3034	Support Sort on nested struct
#3011	Support MapType in joins
#3031	add doc for PR status checks [skip ci]
#3028	Enable parallel build for pre-merge job to reduce overall duration [skip ci]
#3025	Qualification tool: Add regex and username filters.
#2980	Init version 21.10.0
#3000	Merge branch-21.08 to branch-21.10

Release 21.08.1

Bugs Fixed


#3350	[BUG] Qualification tool: check for metadata being null

PRs


#3351	Update changelog for tools v21.08.1 release [skip CI]
#3348	Change tool version to 21.08.1 [skip ci]
#3343	Qualification tool backport: Check for metadata being null (#3285)

Release 21.08

Features


#1584	[FEA] Support rank as window function
#1859	[FEA] Optimize row_number/rank for memory usage
#2976	[FEA] support for arrays in BroadcastNestedLoopJoinExec and CartesianProductExec
#2398	[FEA] `GpuIf` and `GpuCoalesce` supports `ArrayType`
#2445	[FEA] Support literal arrays in case/when statements
#2757	[FEA] Profiling tool display input data types
#2860	[FEA] Minimal support for LEGACY timeParserPolicy
#2693	[FEA] Profiling Tool: Print GDS + UCX related parameters
#2334	[FEA] Record GPU time and Fetch time separately, instead of recording Total Time
#2685	[FEA] Profiling compare mode for table SQL Duration and Executor CPU Time Percent
#2742	[FEA] include App Name from profiling tool output
#2712	[FEA] Display job and stage info in the dot graph for profiling tool
#2562	[FEA] Implement KnownNotNull on the GPU
#2557	[FEA] support sort_array on GPU
#2307	[FEA] Enable Parquet writing for arrays
#1856	[FEA] Create a batch chunking iterator and integrate it with GpuWindowExec

Performance


#866	[FEA] combine window operations into single call
#2800	[FEA] Support ORC small files coalescing reading
#737	[FEA] handle peer timeouts in shuffle
#1590	Rapids Shuffle - UcpListener
#2275	[FEA] UCP error callback deal with cleanup
#2799	[FEA] Support ORC multi-file cloud reading

Bugs Fixed


#3135	[BUG] Regression seen in `concatenate` in NDS with RAPIDS Shuffle Manager enabled
#3017	[BUG] orc_write_test failed in databricks runtime
#3060	[BUG] ORC read can corrupt data when specified schema does not match file schema ordering
#3065	[BUG] window exec tries to do too much on the GPU
#3066	[BUG] Profiling tool generate dot file fails to convert
#3038	[BUG] leak in `getDeviceMemoryBuffer` for the unspill case
#3007	[BUG] data mess up reading from ORC
#3029	[BUG] udf_test failed in ucx standalone env
#2723	[BUG] test failures in CI build (observed in UCX job) after starting to use 21.08
#3016	[BUG] databricks script failed to return correct exit code
#3002	[BUG] writing parquet with partitionBy() loses sort order
#2959	[BUG] Resolve common code source incompatibility with supported Spark versions
#2589	[BUG] RapidsShuffleHeartbeatManager needs to remove executors that are stale
#2964	[BUG] IGNORE ORDER, WITH DECIMALS: [Window] [MIXED WINDOW SPECS] FAILED in spark 3.0.3+
#2942	[BUG] Cache of Array using ParquetCachedBatchSerializer failed with "DATA ACCESS MUST BE ON A HOST VECTOR"
#2965	[BUG] test_round_robin_sort_fallback failed with ValueError: 'a_1' is not in list
#2891	[BUG] Discrepancy in getting count before and after caching
#2972	[BUG] When using timeout option(-t) of qualification tool, it does not print anything in output after timeout.
#2958	[BUG] When AQE=on, SMJ with a Map in SELECTed list fails with "key not found: numPartitions"
#2929	[BUG] No validation of format strings when formatting dates in legacy timeParserPolicy mode
#2900	[BUG] CAST string to float/double produces incorrect results in some cases
#2957	[BUG] Builds failing due to breaking changes in SPARK-36034
#2901	[BUG] `GpuCompressedColumnVector` cannot be cast to `GpuColumnVector` with AQE
#2899	[BUG] CAST string to integer produces incorrect results in some cases
#2937	[BUG] Fix more edge cases when parsing dates in legacy timeParserPolicy
#2939	[BUG] Window integration tests failing with `Lead expected at least 3 but found 0`
#2912	[BUG] Profiling compare mode fails when comparing spark 2 eventlog to spark 3 event log
#2892	[BUG] UCX error `Message truncated` observed with UCX 1.11 RC in Q77 NDS
#2807	[BUG] Use UCP_AM_FLAG_WHOLE_MSG and UCP_AM_FLAG_PERSISTENT_DATA for receive handlers
#2930	[BUG] Profiling tool does not show "Potential Problems" for dataset API in section "SQL Duration and Executor CPU Time Percent"
#2902	[BUG] CAST string to bool produces incorrect results in some cases
#2850	[BUG] "java.io.InterruptedIOException: getFileStatus on s3a://xxx" for ORC reading in Databricks 8.2 runtime
#2856	[BUG] cache of struct does not work on databricks 8.2ML
#2790	[BUG] In Comparison mode health check does not show the application id
#2713	[BUG] profiling tool does not error or warn if incompatible options are given
#2477	[BUG] test_single_sort_in_part is failed in nightly UCX and AQE (no UCX) integration
#2868	[BUG] to_date produces wrong value on GPU for some corner cases
#2907	[BUG] incorrect expression to detect previously set `--master`
#2893	[BUG] TransferRequest request transactions are getting leaked
#120	[BUG] GPU InitCap supports too much white space.
#2786	[BUG][initCap function]There is an issue converting the uppercase character to lowercase on GPU.
#2754	[BUG] cudf_udf tests failed w/ 21.08
#2820	[BUG] Metrics are inconsistent for GpuRowToColumnarToExec
#2710	[BUG] dot file generation can go over the limits of dot
#2772	[BUG] new integration test failures w/ maxFailures=1
#2739	[BUG] CBO causes less efficient plan for NDS q84
#2717	[BUG] CBO forces joins back onto CPU in some cases
#2718	[BUG] CBO falls back to CPU to write to Parquet in some cases
#2692	[BUG] Profiling tool: Add error handling for comparison functions
#2711	[BUG] reused stages should not appear multiple times in dot
#2746	[BUG] test_single_nested_sort_in_part integration test failure 21.08
#2690	[BUG] Profiling tool doesn't properly read rolled log files
#2546	[BUG] Build Failure when building from source
#2750	[BUG] nightly test failed with lists: `testStringReplaceWithBackrefs`
#2644	[BUG] test event logs should be compressed
#2725	[BUG] Heartbeat from unknown executor when running with UCX shuffle in local mode
#2715	[BUG] Part of the plan is not columnar class com.databricks.sql.execution.window.RunningWindowFunc
#2521	[BUG] cudf_udf failed in all spark release intermittently
#1712	[BUG] Scala UDF compiler can decompile UDFs with RAPIDS implementation

PRs


#3216	Update changelog to include download doc update [skip ci]
#3214	Update download and databricks doc for 21.06.2 [skip ci]
#3210	Update 21.08.0 changelog to latest [skip ci]
#3197	Databricks parquetFilters api change in db 8.2 runtime
#3168	Update 21.08 changelog to latest [skip ci]
#3146	update cudf Java binding version to 21.08.2
#3080	Update docs for 21.08 release
#3136	Update tool docs to explain default filesystem [skip ci]
#3128	Fix merge conflict 3126 from branch-21.06 [skip ci]
#3124	Fix merge conflict 3122 from branch-21.06 [skip ci]
#3100	Update databricks 3.0.1 shim to new ParquetFilter api
#3083	Initial CHANGELOG.md update for 21.08
#3079	Remove the struct support in ORC reader
#3062	Fix ORC read corruption when specified schema does not match file order
#3064	Tweak scaladoc to callout the GDS+unspill case in copyBuffer
#3049	Handle mmap exception more gracefully in RapidsShuffleServer
#3067	Update to UCX 1.11.0
#3024	Check validity of any() or all() results that could be null
#3069	Fall back to the CPU on window partition by struct or array
#3068	Profiling tool generate dot file fails on unescaped html characters
#3048	Apply unique committer job ID fix from SPARK-33230
#3050	Updates for google analytics [skip ci]
#3015	Fix ORC read error when read schema reorders file schema columns
#3053	cherry-pick #3028 [skip ci]
#2887	ORC reader supports struct
#3032	Add disorder read schema test case for Parquet
#3022	Add in docs to describe window performance
#3018	[BUG] fix db script hides error issue
#2953	Add in support for rank and dense_rank
#3009	Propagate child output ordering in GpuCoalesceBatches
#2989	Re-enable Array support in Cartesian Joins, Broadcast Nested Loop Joins
#2999	Remove unused configuration setting spark.rapids.sql.castStringToInteger.enabled
#2967	Resolve hidden source incompatibility between Spark30x and Spark31x Shims
#2982	Add FAQ entry for timezone error
#2839	GpuIf and GpuCoalesce support array and struct types
#2987	Update documentation for unsupported edge cases when casting from string to timestamp
#2977	Expire executors from the RAPIDS shuffle heartbeat manager on timeout
#2985	Move tools README to docs/additional-functionality/qualification-profiling-tools.md with some modification
#2992	Remove commented/redundant window-function tests.
#2994	Tweak RAPIDS Shuffle Manager configs for 21.08
#2984	Avoid comparing window range canonicalized plans on Spark 3.0.x
#2970	Put the GPU data back on host before processing cache on CPU
#2986	Avoid struct aliasing in test_round_robin_sort_fallback
#2935	Read the complete batch before returning when selectedAttributes is empty
#2826	CaseWhen supports scalar of list and struct
#2978	enable auto-merge from branch 21.08 to 21.10 [skip ci]
#2946	ORC reader supports list
#2947	Qualification tool: Filter based on timestamp in event logs
#2973	Assert that CPU and GPU row fields match when present
#2974	Qualification tool: fix performance regression
#2948	Remove unnecessary copies of ParquetCachedBatchSerializer
#2968	Fix AQE CustomShuffleReaderExec not seeing ShuffleQueryStageExec
#2969	Make the dir for spark301 shuffle shim match package name
#2933	Improve CAST string to float implementation to handle more edge cases
#2963	Add override getParquetFilters for shim 304
#2956	Profile Tool: make order consistent between runs
#2924	Fix bug when collecting directly from a GPU shuffle query stage with AQE on
#2950	Fix shutdown bugs in the RAPIDS Shuffle Manager
#2922	Improve UCX assertion to show the failed assertion
#2961	Fix ParquetFilters issue
#2951	Qualification tool: Allow app start and app name filtering and test with filesystem filters
#2941	Make test event log compression codec configurable
#2919	Fix bugs in CAST string to integer
#2944	Fix childExprs list for GpuWindowExpression, for Spark 3.1.x.
#2917	Refine GpuHashAggregateExec.setupReference
#2909	Support orc coalescing reading
#2938	Qualification tool: Add negation filter
#2940	qualification tool: add filtering by app start time
#2928	Qualification tool support recognizing decimal operations
#2934	Qualification tool: Add filter based on appName
#2904	Qualification and Profiling tool handle Read formats and datatypes
#2927	Restore aggregation sorted data hint
#2932	Profiling tool: Fix comparing spark2 and spark3 event logs
#2926	GPU Active Messages for all buffer types
#2888	Type check with the information from RapidsMeta
#2903	Fix cast string to bool
#2895	Add in running window optimization using scan
#2859	Add spillable batch caching and sort fallback to hash aggregation
#2898	Add fuzz tests for cast from string to other types
#2881	fix orc readers leak issue for ORC PERFILE type
#2842	Support STRUCT/STRING for LEAD()/LAG()
#2880	Added ParquetCachedBatchSerializer support for Databricks
#2911	Add in ID as sort for Job + Stage level aggregated task metrics
#2914	Profiling tool: add app index to tables that don't have it
#2906	Fix compiler warning
#2890	Fix cast to date bug
#2908	Fixes bad string contains in run_pyspark_from_build
#2886	Use UCP Listener for UCX connections and enable peer error handling
#2875	Add support for timeParserPolicy=LEGACY
#2894	Fixes a JVM leak for UCX TransactionRequests
#2854	Qualification Tool to output only the 'k' highest-ranked or 'k' lowest-ranked applications
#2873	Fix infinite loop in MultiFileCloudPartitionReaderBase
#2838	Replace `toTitle` with `capitalize` for GpuInitCap
#2870	Avoid readers acquiring GPU on next batch query if not first batch
#2882	Refactor window operations to do them in the exec
#2874	Update audit script to clone branch-3.2 instead of master
#2843	Qualification/Profiling tool add tests for Spark2 event logs
#2828	add cloud reading for orc
#2721	Check-list for corner cases in testing.
#2675	Support for Decimals with negative scale for Parquet Cached Batch Serializer
#2849	Update release notes to include qualification and profiling tool
#2852	Fix hash aggregate tests leaking configs into other tests
#2845	Split window exec into multiple stages if needed
#2853	Tag last batch when coalescing
#2851	Fix build failure - update ucx profiling test to fix parameter type to getEventLogInfo
#2785	Profiling tool: Print UCX and GDS parameters
#2840	Fix Gpu -> GPU
#2844	Document Qualification tool Spark requirements
#2787	Add metrics definition link to tool README.md[skip ci]
#2841	Add a threadpool to Qualification tool to process logs in parallel
#2833	Stop running so many versions of Spark unit tests for premerge
#2837	Append new authorized user to blossom-ci whitelist [skip ci]
#2822	Rewrite Qualification tool for better performance
#2823	Add semaphoreWaitTime and gpuOpTime for GpuRowToColumnarExec
#2829	Fix filtering directories on compression extension match
#2720	Add metrics documentation to the tuning guide
#2816	Improve some existing collectTime handling
#2821	Truncate long plan labels and refer to "print-plans"
#2827	Update cmake to build udf native [skip ci]
#2793	Report equivilant stages/sql ids as a part of compare
#2810	Use SecureRandom for UCPListener TCP port choice
#2798	Mirror apache repos to urm
#2788	Update the type signatures for some expressions
#2792	Automatically set spark.task.maxFailures and local[*, maxFailures]
#2805	Revert "Use UCX Active Messages for all shuffle transfers (#2735)"
#2796	show disk bytes spilled when GDS spill is enabled
#2801	Update pre-merge to use reserved_pool [skip ci]
#2795	Improve CBO debug logging
#2794	Prevent integer overflow when estimating data sizes in cost-based optimizer
#2784	Make spark303 shim version w/o snapshot and add shim layer for spark304
#2744	Cost-based optimizer: Implement simple cost model that demonstrates benefits with NDS queries
#2762	Profiling tool: Update comparison mode output format and add error handling
#2761	Update dot graph to include stages and remove some duplication
#2760	Add in application timeline to profiling tool
#2735	Use UCX Active Messages for all shuffle transfers
#2732	qualification and profiling tool support rolled and compressed event logs for CSPs and Apache Spark
#2768	Make window function test results deterministic.
#2769	Add developer documentation for Adaptive Query Execution
#2532	date_format should not suggest enabling incompatibleDateFormats for formats we cannot support
#2743	Disable dynamicAllocation and set maxFailures to 1 in integration tests
#2749	Revert "Add in support for lists in some joins (#2702)"
#2181	abstract the parquet coalescing reading
#2753	Merge branch-21.06 to branch-21.08 [skip ci]
#2751	remove invalid blossom-ci users [skip ci]
#2707	Support `KnownNotNull` running on GPU
#2747	Fix num_slices for test_single_nested_sort_in_part
#2729	fix 301db-shim typecheck typo
#2726	Fix local mode starting RAPIDS shuffle heartbeats
#2722	Support aggregation on NullType in RunningWindowExec
#2719	Avoid executing child plan twice in CoalesceExec
#2586	Update metrics use in GpuUnionExec and GpuCoalesceExec
#2716	Add file size check to pre-merge CI
#2554	Upload build failure log to Github for external contributors access
#2596	Initial running window memory optimization
#2702	Add in support for arrays in BroadcastNestedLoopJoinExec and CartesianProductExec
#2699	Add a pre-commit hook to reject large files
#2700	Set numSlices and use parallelize to build dataframe for partition-se…
#2548	support collect_set in rolling window
#2661	Make tools inherit common dependency versions from parent pom
#2668	Remove CUDA 10.x from getting started guide [skip ci]
#2676	Profiling tool: Print Job Information in compare mode
#2679	Merge branch-21.06 to branch-21.08 [skip ci]
#2677	Add pre-merge independent stage timeout [skip ci]
#2616	support GpuSortArray
#2582	support parquet write arrays
#2609	Fix automerge failure from branch-21.06 to branch-21.08
#2570	Added nested structs to UnionExec
#2581	Fix merge conflict 2580 [skip ci]
#2458	Split batch by key for window operations
#2565	Merge branch-21.06 into branch-21.08
#2563	Document: git commit twice when copyright year updated by hook
#2561	Fixing the merge of 21.06 to 21.08 for comment changes in Profiling tool
#2558	Fix cdh shim version in 21.08 [skip ci]
#2543	Init branch-21.08

Release 21.06.2

Bugs Fixed


#3191	[BUG] Databricks parquetFilters build failure in db 8.2 runtime

PRs


#3209	Update 21.06.2 changelog [skip ci]
#3208	Update rapids plugin version to 21.06.2 [skip ci]
#3207	Disable auto-merge from 21.06 to 21.08 [skip ci]
#3205	Branch 21.06 databricks update [skip ci]
#3198	Databricks parquetFilters api change in db 8.2 runtime

Release 21.06.1

Bugs Fixed


#3098	[BUG] Databricks parquetFilters build failure

PRs


#3127	Update CHANGELOG for the release v21.06.1 [skip ci]
#3123	Update rapids plugin version to 21.06.1 [skip ci]
#3118	Fix databricks 3.0.1 for ParquetFilters api change
#3119	Branch 21.06 databricks update [skip ci]

Release 21.06

Features


#2483	[FEA] Profiling and qualification tool
#951	[FEA] Create Cloudera shim layer
#2481	[FEA] Support Spark 3.1.2
#2530	[FEA] Add support for Struct columns in CoalesceExec
#2512	[FEA] Report gpuOpTime not totalTime for expand, generate, and range execs
#63	[FEA] support ConcatWs sql function
#2501	[FEA] Add support for scalar structs to named_struct
#2286	[FEA] update UCX documentation for branch 21.06
#2436	[FEA] Support nested types in CreateNamedStruct
#2461	[FEA] Report gpuOpTime instead of totalTime for project, filter, window, limit
#2465	[FEA] GpuFilterExec should report gpuOpTime not totalTime
#2013	[FEA] Support concatenating ArrayType columns
#2425	[FEA] Support for casting array of floats to array of doubles
#2012	[FEA] Support Window functions(lead & lag) for ArrayType
#2011	[FEA] Support creation of 2D array type
#1582	[FEA] Allow StructType as input and output type to InMemoryTableScan and InMemoryRelation
#216	[FEA] Range window-functions must support non-timestamp order-by expressions
#2390	[FEA] CI/CD for databricks 8.2 runtime
#2273	[FEA] Enable struct type columns for GpuHashAggregateExec
#20	[FEA] Support out of core joins
#2160	[FEA] Support Databricks 8.2 ML Runtime
#2330	[FEA] Enable hash partitioning with arrays
#1103	[FEA] Support date_format on GPU
#1125	[FEA] explode() can take expressions that generate arrays
#1605	[FEA] Support sorting on struct type keys

Performance


#1445	[FEA] GDS Integration
#1588	Rapids shuffle - UCX active messages
#2367	[FEA] CBO: Implement costs for memory access and launching kernels
#2431	[FEA] CBO should show benefits with q24b with decimals enabled

Bugs Fixed


#2652	[BUG] No Job Found. Exiting.
#2659	[FEA] Group profiling tool "Potential Problems"
#2680	[BUG] cast can throw NPE
#2628	[BUG] failed to build plugin in databricks runtime 8.2
#2605	[BUG] test_pandas_map_udf_nested_type failed in Yarn integration
#2622	[BUG] compressed event logs are not processed
#2478	[BUG] When tasks complete, cancel pending UCX requests
#1953	[BUG] Could not allocate native memory when running DLRM ETL with --output_ordering input on A100
#2495	[BUG] scaladoc warning GpuParquetScan.scala:727 "discarding unmoored doc comment"
#2368	[BUG] Mismatched number of columns while performing `GpuSort`
#2407	[BUG] test_round_robin_sort_fallback failed
#2497	[BUG] GpuExec failed to find metric totalTime in databricks env
#2473	[BUG] enable test_window_aggs_for_rows_lead_lag_on_arrays and make the order unambiguous
#2489	[BUG] Queries with window expressions fail when cost-based optimizer is enabled
#2457	[BUG] test_window_aggs_for_rows_lead_lag_on_arrays failed
#2371	[BUG] Performance regression for crossjoin on 0.6 comparing to 0.5
#2372	[BUG] FAILED ../../src/main/python/udf_cudf_test.py::test_window
#2404	[BUG] test_hash_pivot_groupby_nan_fallback failed on Dataproc
#2474	[BUG] when ucp listener enabled we bind 16 times always
#2427	[BUG] test_union_struct_missing_children[(Struct(not_null) failed in databricks310 and spark 311
#2455	[BUG] CaseWhen crashes on literal arrays
#2421	[BUG] NPE when running mapInPandas Pandas UDF in 0.5GA
#2428	[BUG] Intermittent ValueError in test_struct_groupby_count
#1628	[BUG] TPC-DS-like query 24a and 24b at scale=3TB fails with OOM
#2276	[BUG] SPARK-33386 - ansi-mode changed ElementAt/Elt/GetArray behavior in Spark 3.1.1 - fallback to cpu
#2309	[BUG] legacy cast of a struct column to string with a single nested null column yields null instead of '[]'
#2315	[BUG] legacy struct cast to string crashes on a two field struct
#2406	[BUG] test_struct_groupby_count failed
#2378	[BUG] java.lang.ClassCastException: GpuCompressedColumnVector cannot be cast to GpuColumnVector
#2355	[BUG] convertDecimal64ToDecimal32Wrapper leaks ColumnView instances
#2346	[BUG] segfault when using `UcpListener` in TCP-only setup
#2364	[BUG] qa_nightly_select_test.py::test_select integration test fails
#2302	[BUG] Int96 are not being written as expected
#2359	[BUG] Alias is different in spark 3.1.0 but our canonicalization code doesn't handle
#2277	[BUG] spark.sql.legacy.parquet.datetimeRebaseModeInRead=CORRECTED or LEGACY still fails to read LEGACY date from parquet
#2320	[BUG] TypeChecks diagnostics outputs column ids instead of unsupported types
#2238	[BUG] Unnecessary to cache the batches that will be sent to Python in `FlatMapGroupInPandas`.
#1811	[BUG] window_function_test.py::test_multi_types_window_aggs_for_rows_lead_lag[partBy failed

PRs


#2817	Update changelog for v21.06.0 release [skip ci]
#2806	Noted testing for A10, noted that min driver ver is HW specific
#2797	Update documentation for InitCap incompatibility
#2774	Update changelog for 21.06 release [skip ci]
#2770	[Doc] add more for Alluxio page [skip ci]
#2745	Add link to Mellanox RoCE documentation and mention --without-ucx installation option
#2740	Update cudf Java bindings to 21.06.1
#2664	Update changelog for 21.06 release [skip ci]
#2697	fix GDS spill bug when copying from the batch write buffer
#2691	Update properties to check if table there
#2687	Remove CUDA 10.x from getting started guide (#2668)
#2686	Profiling tool: Print Job Information in compare mode
#2657	Print CPU and GPU output when _assert_equal fails to help debug given…
#2681	Avoid NPE when casting empty strings to ints
#2669	Fix multiple problems reported and improve error handling
#2666	[DOC]Update custom image guide in GCP dataproc to reduce cluster startup time
#2665	Update docs to move RAPIDS Shuffle out of beta [skip ci]
#2671	Clean profiling&qualification tool README
#2673	Profiling tool: Enable tests and update compressed event log
#2672	Update cudfjni dependency version to 21.06.0
#2663	Qualification tool - add in estimating the App end time when the event log missing application end event
#2600	Accelerate `RunningWindow` queries on GPU
#2651	Profiling tool - fix reporting contains dataset when sql time 0
#2623	Fixed minor mistakes in documentation
#2631	Update docs for Databricks 8.2 ML
#2638	Add an init script for databricks 7.3ML with CUDA11.0 installed
#2643	Profiling tool: Health check follow on
#2640	Add physical plan to the dot file as the graph label
#2637	Fix databricks for 3.1.1
#2577	Update download.md and FAQ.md for 21.06.0
#2636	Profiling tool - Fix file writer for generating dot graphs, supporting writing sql plans to a file, change output to subdirectory
#2625	Exclude failed jobs/queries from Qualification tool output
#2626	Enable processing of compressed Spark event logs
#2632	Profiling tool: Add support for health check.
#2627	Ignore order for map udf test
#2620	Change aggregation of executor CPU and run time for Qualification tool to speed up query
#2618	Correct an issue for README for tools and also correct s3 solution in Args.scala
#2612	Profiling tool, add in job to stage, duration, executor cpu time, fix writing to HDFS
#2614	change rapids-4-spark-tools directory to tools in deploy script [skip ci]
#2611	Revert "disable cudf_udf tests for #2521"
#2604	Profile/qualification tool error handling improvements and support spark < 3.1.1
#2598	Rename rapids-4-spark-tools directory to tools
#2576	Add filter support for qualification and profiling tool.
#2603	Add the doc for -g option of the profiling tool.
#2594	Change the README of the qualification and profiling tool to match the current version.
#2591	Implement test for qualification tool sql metric aggregates
#2590	Profiling tool support for collection and analysis
#2587	Handle UCX connection timeouts from heartbeats more gracefully
#2588	Fix package name
#2574	Add Qualification tool support
#2571	Change test_single_sort_in_part to print source data frame on failure
#2569	Remove -SNAPSHOT in documentation in preparation for release
#2429	Change RMM_ALLOC_FRACTION to represent percentage of available memory, rather than total memory, for initial allocation
#2553	Cancel requests that are queued for a client/handler on error
#2566	expose unspill config option
#2460	align GDS reads/writes to 4 KiB
#2515	Remove fetchTime and standardize on collectTime
#2523	Not compile RapidsUDF when udf compiler is enabled
#2538	Fixed code indentation in ParquetCachedBatchSerializer
#2559	Release profiling tool jar to maven central
#2423	Add cloudera shim layer
#2520	Add event logs for integration tests
#2525	support interval.microseconds for range window TimeStampType
#2536	Don't do an extra shuffle in some TopN cases
#2508	Refactor the code for conditional expressions
#2542	enable auto-merge from 21.06 to 21.08 [skip ci]
#2540	Update spark 312 shim, and Add spark 313-SNAPSHOT shim
#2539	disable cudf_udf tests for #2521
#2514	Add Struct support for ParquetWriter
#2534	Remove scaladoc on an internal method to avoid warning during build
#2537	Add CentOS documentation and improve dockerfiles for UCX
#2531	Add nested types and decimals to CoalesceExec
#2513	Report opTime not totalTime for expand, range, and generate execs
#2533	Fix concat_ws test specifying only a separator for databricks
#2528	Make GenerateDot test more robust
#2529	Change Databricks 310 shim to be 311 to match reported spark.version
#2479	Support concat with separator on GPU
#2507	Improve test coverage for sorting structs
#2526	Improve debug print to include addresses and null counts
#2463	Add EMR 6.3 documentation
#2516	Avoid listener race collecting wrong plan in assert_gpu_fallback_collect
#2505	Qualification tool updates for datasets, udf, and misc fixes
#2509	Added in basic support for scalar structs to named_struct
#2449	Add code for generating dot file visualizations
#2475	Update shuffle documentation for branch-21.06 and UCX 1.10.1
#2500	Update Dockerfile for native UDF
#2506	Support creating Scalars/ColumnVectors from utf8 strings directly.
#2502	Remove work around for nulls in semi-anti joins
#2503	Remove temporary logging and adjust test column names
#2499	Fix regression in TOTAL_TIME metrics for Databricks
#2498	Add in basic support for scalar maps and allow nesting in named_struct
#2496	Add comments for lazy binding in WindowInPandas
#2493	improve window agg test for range numeric types
#2491	Fix regression in cost-based optimizer when calculating cost for Window operations
#2482	Window tests with smaller batches
#2490	Add temporary logging for Dataproc round robin fallback issue
#2486	Remove the null replacement in `computePredicate`
#2469	Adding additional functionalities to profiling tool
#2462	Report gpuOpTime instead of totalTime for project, filter, limit, and window
#2484	Fix the failing test `test_window` on Databricks
#2472	Fix hash_aggregate_test
#2476	Fix for UCP Listener created spark.port.maxRetries times
#2471	skip test_window_aggs_for_rows_lead_lag_on_arrays
#2446	Update plugin version to 21.06.0
#2409	Change shuffle metadata messages to use UCX Active Messages
#2397	Include memory access costs in cost models (cost-based optimizer)
#2442	fix GpuCreateNamedStruct not serializable issue
#2379	support GpuConcat on ArrayType
#2456	Fall back to the CPU for literal array values on case/when
#2447	Filter out the nulls after slicing the batches.
#2426	Implement cast of nested arrays
#2299	support creating array of array
#2451	Update tuning docs to add batch size recommendations.
#2435	support lead/lag on arrays
#2448	support creating list ColumnVector for Literal(ArrayType(NullType))
#2402	Add profiling tool
#2313	Supports `GpuLiteral` of array type

Older Releases

Changelog of older releases can be found at docs/archives

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Change log

Release 22.02

Features

Performance

Bugs Fixed

PRs

Release 21.12

Features

Performance

Bugs Fixed

PRs

Release 21.10

Features

Performance

Bugs Fixed

PRs

Release 21.08.1

Bugs Fixed

PRs

Release 21.08

Features

Performance

Bugs Fixed

PRs

Release 21.06.2

Bugs Fixed

PRs

Release 21.06.1

Bugs Fixed

PRs

Release 21.06

Features

Performance

Bugs Fixed

PRs

Older Releases