Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Hive parquet bloomfilter prune test #15633

Merged
merged 1 commit into from
Feb 22, 2023

Conversation

leetcode-1533
Copy link
Contributor

Description

Implement Hive parquet bloomfilter prune test.

Additional context and related issues

This unit test can help to verify whether enabling hive parquet's parquet.use-bloom-filter option will help Trino to load fewer blocks(row-groups)

More details: #14428 (comment)

Release notes

(x) This is not user-visible or docs only and no release notes are required.
( ) Release notes are required, please propose a release note for me.
( ) Release notes are required, with the following suggested text:

# Section
* Fix some things. ({issue}`issuenumber`)

@cla-bot cla-bot bot added the cla-signed label Jan 6, 2023
@leetcode-1533 leetcode-1533 marked this pull request as draft January 7, 2023 20:58
@leetcode-1533 leetcode-1533 marked this pull request as ready for review January 7, 2023 20:59
@leetcode-1533
Copy link
Contributor Author

@findinpath


JobConf jobConf = new JobConf(newEmptyConfiguration());
jobConf.setEnum(WRITER_VERSION, PARQUET_1_0);
jobConf.setBoolean(BLOOM_FILTER_ENABLED, true);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: Let's enable the bloom filter only for columnName and not for all columns

jobConf.setEnum(WRITER_VERSION, PARQUET_1_0);
jobConf.setBoolean(BLOOM_FILTER_ENABLED, true);

ParquetTester.writeParquetColumn(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

static import

@github-actions
Copy link

This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua

@github-actions github-actions bot added the stale label Feb 21, 2023
@bitsondatadev
Copy link
Member

@leetcode-1533, are you still working on this PR? If not we can close it for now and reopen it later. Let me know! Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging this pull request may close these issues.

4 participants