-
Notifications
You must be signed in to change notification settings - Fork 24.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
testAbortedSnapshotDuringInitDoesNotStart fails with ClassCastException #38226
Comments
Pinging @elastic/es-distributed |
Nasty ... I'll be able to get to this tomorrow afternoon. Should be an easy fix though. |
* The response type here is not empty and was always wrong but this only became visible now that 0a604e3 was introduced * As a result of 0a604e3 we started actually handling the response of this request and logging/handling exceptions before that we simply dropped the classcast exception here quietly using the empty response handler * Closes elastic#38226
* Fix Incorrect Transport Response Handler Type * The response type here is not empty and was always wrong but this only became visible now that 0a604e3 was introduced * As a result of 0a604e3 we started actually handling the response of this request and logging/handling exceptions before that we simply dropped the classcast exception here quietly using the empty response handler * fix busy assert not handling `Exception` * Closes #38226 * Closes #38256
Reopening since this is still being reported in #38264 (comment) |
Muted again on master with 15510da. |
I tracked this down now, this is a real bug. The fix here is to do a refactoring similar to https://github.com/elastic/elasticsearch/compare/master...ywelsch:snapshot-refactored?expand=1#diff-a0853be4492c052f24917b5c1464003dR975 and remove the duplicate spots where we call |
Muted by 715e581 on master |
…38368) * The problem in #38226 is that in some corner cases multiple calls to `endSnapshot` were made concurrently, leading to non-deterministic behavior (`beginSnapshot` was triggering a repository finalization while one that was triggered by a `deleteSnapshot` was already in progress) * Fixed by: * Making all `endSnapshot` calls originate from the cluster state being in a "completed" state (apart from on short-circuit on initializing an empty snapshot). This forced putting the failure string into `SnapshotsInProgress.Entry`. * Adding deduplication logic to `endSnapshot` * Also: * Streamlined the init behavior to work the same way (keep state on the `SnapshotsService` to decide which snapshot entries are stale) * closes #38226
This test is failing with ClassCastException.
I can't reproduce this locally but this test failed 4 times today.
CI:
The text was updated successfully, but these errors were encountered: