Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trino Parquet Column Encryption #9383

Open
shangxinli opened this issue Sep 25, 2021 · 8 comments
Open

Trino Parquet Column Encryption #9383

shangxinli opened this issue Sep 25, 2021 · 8 comments

Comments

@shangxinli
Copy link
Member

shangxinli commented Sep 25, 2021

Parquet modular encryption provides a column encryption feature. It would be great to have that in Presto. Due to the slowness of the PresteoDB, I would like to start the feature in TrinoDB. The original issue in PrestoDB can be found here.

It is a large feature and it can be divided into three parts: 1) Common code path 2) Decryption 3) Encryption. The command code path change and decryption change draft can be found here.

@xiacongling
Copy link
Contributor

Hi, @shangxinli . I noticed that the previous PR has been closed, is there any progress on Parquet column encryption?

@findepi findepi changed the title Presto Parquet Column Encryption Trino Parquet Column Encryption Sep 16, 2022
@amoghmargoor
Copy link
Member

amoghmargoor commented Jun 8, 2023

We at Apple have been running PME for a while now and would like to contribute the decryption support in Trino for it. We will be raising the draft PR soon for this. cc @vgankidi

@amoghmargoor
Copy link
Member

@my7ym
Copy link

my7ym commented Nov 30, 2023

@amoghmargoor Do you folks have any timeline to deliver the PME support for Trino? Thanks!

@hashhar
Copy link
Member

hashhar commented Sep 3, 2024

It seems this is no longer being actively worked on.

Feel free to re-open if needed.

@sopel39
Copy link
Member

sopel39 commented Sep 4, 2024

I thin we should reopen it. cc @amoghmargoor

@sopel39 sopel39 reopened this Sep 4, 2024
@amoghmargoor
Copy link
Member

I have just opened a new PR for this rebased, PR is WIP as we are trying to add test cases for the encryption. #23583. This will have many newer files compared to existing PRs due to parquet-hadoop module being decoupled.

@dfangs
Copy link

dfangs commented Jan 17, 2025

Hey folks, I just noticed that there has been some progress on Parquet Modular Encryption support for Trino since our last inquiry in 2023.

We actually ended up continuing the work in #20069 internally as our company had a goal to enable PME on Trino last year (decryption only, same as Presto), and it has been in production for around half a year.

I wonder what is the best way for us to contribute. Notably, I discovered some bugs/missing parts in the original Presto implementation (including a potential fix to prestodb/presto#22703) and did some refactoring on the encryption-related classes (e.g., InternalFileDecryptor) to make it much more readable.

We would like to push these changes to the upstream too if possible to keep our fork minimal (i.e. only limited to custom KMS implementation). I saw that there is a WIP in #24517; I'm happy to open a new PR too if that works better.

cc @sopel39 if you have some thoughts on collaboration, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

7 participants