-
Notifications
You must be signed in to change notification settings - Fork 25.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple CCR tests with uncaught exception on windows #44610
Comments
Pinging @elastic/es-distributed |
Fails for others as well ( in the same run as in the description )
|
I have multiple runs on windows on my local CI and this failure only seems to happen for various CCR tests |
The problem here is that a transport service is not getting fully started:
=> we get stuck on the latch waiting for it to properly start up. Me and @cbuescher already investigated a similar issue in #41745 ... so @cbuescher it seems like this problem might have more to it since we're now seeing it in an IT as well. |
* We shouldn't just swallow the interrupt here quietly and keep going on the IO thread * Currently interrupt continues here just the same way an invocation of `acceptIncomingRequests` woudl have made things continue * Relates elastic#44610
I opened #44622 which might surface some hidden issue here. It seems to me we're handling requests on integ-test nodes that never fully started and are swallowing interrupts which would explain the situation here (io loop never stops cleanly on an stopped (then interrupted) node). |
* We shouldn't just swallow the interrupt here quietly and keep going on the IO thread * Currently interrupt continues here just the same way an invocation of `acceptIncomingRequests` woudl have made things continue * Relates #44610
…c#44622) * We shouldn't just swallow the interrupt here quietly and keep going on the IO thread * Currently interrupt continues here just the same way an invocation of `acceptIncomingRequests` woudl have made things continue * Relates elastic#44610
* Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Fix line endings in ESJsonLayoutTests * Mute failing ForecastIT test on windows Tracking in #44609 * mute AutoFollowIT.testConflictingPatterns tracking in #44610 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * Revert "mute AutoFollowIT.testConflictingPatterns" This reverts commit 012de08. * mute x-pack internal cluster test windows tracking #44610 * Mute failure unconfigured node name * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256
* Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Fix line endings in ESJsonLayoutTests * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute failure unconfigured node name * mute x-pack internal cluster test windows tracking #44610 * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256
* Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * mute x-pack internal cluster test windows tracking #44610 * Mute failure unconfigured node name * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256
* Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * Disable testing conventions on Windows (#43532) (#44506) Tests are disabled on Windows. Conventions also need to be disabled. * mute x-pack internal cluster test windows tracking #44610 * Mute failure unconfigured node name * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Disable task for mute * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256
This could be fixed by #44805 as well. |
AFAICS this did not affect 6.8 and the tests have not been disabled there, which is why I've removed the label here. #44805 looks to fix the issue. It's only backported to 7.4, however, as it is based on other changes to 7.4. |
https://scans.gradle.com/s/vl3yktrv4xey4/tests/htwk6wzdfugzg-ntvlkoitzt6ms?openStackTraces=WzIsMSwwXQ
The text was updated successfully, but these errors were encountered: