-
Notifications
You must be signed in to change notification settings - Fork 350
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add some random prefix to split ids. #4824
Labels
enhancement
New feature or request
Comments
fulmicoton
added a commit
that referenced
this issue
Oct 17, 2024
…key. The point is to workaround S3 rate limiting. Since it is based on keys, our ULID naming scheme can lead to hotspot in the keyspace. This solution has a downside. External scripts listing files will have a their job multiplied. For this reason, the prefix cardinality is configurable. Closes #4824
fulmicoton
added a commit
that referenced
this issue
Oct 17, 2024
…key. The point is to workaround S3 rate limiting. Since it is based on keys, our ULID naming scheme can lead to hotspot in the keyspace. This solution has a downside. External scripts listing files will have a their job multiplied. For this reason, the prefix cardinality is configurable. Closes #4824
fulmicoton
added a commit
that referenced
this issue
Oct 17, 2024
…key. The point is to workaround S3 rate limiting. Since it is based on keys, our ULID naming scheme can lead to hotspot in the keyspace. This solution has a downside. External scripts listing files will have a their job multiplied. For this reason, the prefix cardinality is configurable. Closes #4824
fulmicoton
added a commit
that referenced
this issue
Oct 17, 2024
…key. The point is to workaround S3 rate limiting. Since it is based on keys, our ULID naming scheme can lead to hotspot in the keyspace. This solution has a downside. External scripts listing files will have a their job multiplied. For this reason, the prefix cardinality is configurable. Closes #4824
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
S3 creates partitions based on a prefix of the object keys.
Because we use ULID, splits coming from the same day end up on the same servers.
On the search side, most of the queries tend to target the last few days, and end up hitting the same servers.
For this reason, in some case, we can hit the per-prefix rate limiting.
The text was updated successfully, but these errors were encountered: