Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add segment partitioning #23386

Closed
wants to merge 1 commit into from
Closed

Add segment partitioning #23386

wants to merge 1 commit into from

Conversation

wendigo
Copy link
Contributor

@wendigo wendigo commented Sep 12, 2024

No description provided.

@cla-bot cla-bot bot added the cla-signed label Sep 12, 2024
@wendigo wendigo requested a review from losipiuk September 12, 2024 15:13
Comment on lines +159 to +161
@ConfigDescription("Number of storage partitions used")
@Config("fs.segment.partitions")
public FileSystemSpoolingConfig setPartitions(int partitions)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think naming this partition is a bit misleading. It is introduced to bump number of "partitions" AWS S3 uses for spooled files, but it is not 1-1 mapping, also all of that bases on undocumented AWS S3 behavior, which we cannot validate.

Two suggestions:

  1. I would make this S3 specific. As we have no data (at least I do not have) to believe that it benefits other filesystems. And even if we need similar mechanism for other providers it may require different implemenation

  2. Maybe let's be explicit in what this is doing, instead pretending we know that it will bump number of S3 partitions to some exact value. Maybe better name for property would be something arund "spooling.file.prefix.entropy" ?

Copy link

github-actions bot commented Oct 3, 2024

This pull request has gone a while without any activity. Tagging the Trino developer relations team: @bitsondatadev @colebow @mosabua

@github-actions github-actions bot added the stale label Oct 3, 2024
@wendigo wendigo closed this Oct 3, 2024
@wendigo wendigo deleted the serafin/segment-partitions branch November 11, 2024 12:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

Successfully merging this pull request may close these issues.

2 participants