forked from ClickHouse/ClickHouse
-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #558 from Altinity/backports/24.8.8/parquet-page-h…
…eader-v2-native-reader 24.8.8 Backport of ClickHouse#70807 parquet page header v2 native reader
- Loading branch information
Showing
4 changed files
with
175 additions
and
43 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
10 changes: 10 additions & 0 deletions
10
tests/queries/0_stateless/03251_parquet_page_v2_native_reader.reference
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
abc 2 | ||
abc 2 | ||
abc 3 | ||
abc 4 | ||
\N 5 | ||
abc 2 | ||
abc 2 | ||
abc 3 | ||
abc 4 | ||
\N 5 |
23 changes: 23 additions & 0 deletions
23
tests/queries/0_stateless/03251_parquet_page_v2_native_reader.sh
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
#!/usr/bin/env bash | ||
# Tags: no-ubsan, no-fasttest | ||
|
||
CUR_DIR=$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd) | ||
# shellcheck source=../shell_config.sh | ||
. "$CUR_DIR"/../shell_config.sh | ||
|
||
USER_FILES_PATH=$($CLICKHOUSE_CLIENT_BINARY --query "select _path,_file from file('nonexist.txt', 'CSV', 'val1 char')" 2>&1 | grep Exception | awk '{gsub("/nonexist.txt","",$9); print $9}') | ||
|
||
WORKING_DIR="${USER_FILES_PATH}/${CLICKHOUSE_TEST_UNIQUE_NAME}" | ||
|
||
mkdir -p "${WORKING_DIR}" | ||
|
||
DATA_FILE="${CUR_DIR}/data_parquet/datapage_v2.snappy.parquet" | ||
|
||
DATA_FILE_USER_PATH="${WORKING_DIR}/datapage_v2.snappy.parquet" | ||
|
||
cp ${DATA_FILE} ${DATA_FILE_USER_PATH} | ||
|
||
# Not reading all columns because some data types and encodings are not supported by native reader yet | ||
# TODO read all columns once implemented | ||
${CLICKHOUSE_CLIENT} --query="select a, c from file('${DATA_FILE_USER_PATH}', Parquet) order by c SETTINGS input_format_parquet_use_native_reader=false;" | ||
${CLICKHOUSE_CLIENT} --query="select a, c from file('${DATA_FILE_USER_PATH}', Parquet) order by c SETTINGS input_format_parquet_use_native_reader=true;" |