[TEST] Compatibility tests for data formats #8666

mythrocks · 2023-07-06T18:29:19Z

Here is a list of tests to confirm data format compatibility with Apache Spark, in the Spark RAPIDS plugin. This list is a work in progress:

Orc:

Parquet:

Tasks

Give feedback

Schema evolution: Test ORC reads after table schema changes #8692

task test
Add test for the timestamp error case described in SPARK-10177 #8693

task test
Add test for Parquet schema interpretation problem described in SPARK-16344 #8694

task test
Add an xfail test for Parquet reads for LIST<STRUCT<int, string>> #8708

task test
CPU fallback for user-defined types in ORC read/write: OrcQuerySuite.scala#L108 #8730

task test
ORC reads at scale with all null values: Like OrcQuerySuite.scala#L173, but with large number of rows. #8731

test
Test predicate pushdown (PPD) with timestamps, decimals, booleans, etc. Refer to OrcQuerySuite.scala#L464. #8823

test
Leverage Apache ORC example files in ORC tests #9215

test
Options

The text was updated successfully, but these errors were encountered:

mythrocks · 2023-07-14T17:42:46Z

Argh. Ignore the previous comment. The task descriptions are rendered unreadable if not logged in through Github Enterprise.

Sorry for the confusion. There isn't a problem here.

mythrocks added test Only impacts tests task Work required that improves the product but is not user facing labels Jul 6, 2023

jlowe mentioned this issue Jul 13, 2023

Add tests for column names with dots #8704

Merged

mythrocks self-assigned this Jul 14, 2023

mythrocks mentioned this issue Jul 18, 2023

Add test for selecting a single complex field array and its parent struct array [databricks] #8744

Merged

mattahrens added the reliability Features to improve reliability or bugs that severly impact the reliability of the plugin label Jul 26, 2023

mythrocks mentioned this issue Aug 1, 2023

[BUG] Possible failure to push down pruned read schema to ORC reader #8906

Closed

jlowe mentioned this issue Sep 11, 2023

Leverage Apache ORC example files in ORC tests #9215

Open

res-life mentioned this issue Sep 28, 2023

Test compatibility between pyarrow and GPU #9288

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TEST] Compatibility tests for data formats #8666

[TEST] Compatibility tests for data formats #8666

mythrocks commented Jul 6, 2023 •

edited by res-life

Loading

Tasks

mythrocks commented Jul 14, 2023 •

edited

Loading

[TEST] Compatibility tests for data formats #8666

[TEST] Compatibility tests for data formats #8666

Comments

mythrocks commented Jul 6, 2023 • edited by res-life Loading

Orc:

Parquet:

Tasks

mythrocks commented Jul 14, 2023 • edited Loading

mythrocks commented Jul 6, 2023 •

edited by res-life

Loading

mythrocks commented Jul 14, 2023 •

edited

Loading