-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pauseonstart test failures #38847
Comments
Taking a look at the history on this test, I think the first failures happened on Friday last week. This indicates to me that recent changes may have caused this failure. Let me look into recent changes and see if anything stands out. |
I'll be tracking my research into this failure in this issue. I've got a repro of this under the debugger locally. It looks like somehow, the polling loop rarely gets in a weird state on Windows x86 where the first two entries in the runtime/src/coreclr/src/debug/debug-pal/win/diagnosticsipc.cpp Lines 313 to 318 in c5b39f3
The reverse connection's handle (
I'm not sure how this could happen, though. I'm going to see about adding some additional logging and seeing if I can pinpoint what happened. |
It failed again in outerloop:
|
I think we should turn this test off for Windows x86 until I can diagnose what this issue is. |
Only x86? I see a very recent Windows x64 timeout failure here: |
Well, darn... That looks to be the same symptom. I hadn't seen it on x64 yet. It definitely seems more prevalent on x86. I'll turn the test off for all Windows legs, then spina draft PR with the tests turned back on to do further CI triaging. |
I've been investigating this some more today and I've got a hypothesis of what's going on. There is a missing runtime/src/coreclr/src/debug/debug-pal/win/diagnosticsipc.cpp Lines 258 to 318 in 9c3f017
I have observed it happening a couple times locally and it looks like that is what is happening. The only reason I think that's the case, is I collected the following stress log:
In the hung state, the debugger says that the first element of The fix would be to just put the else statement back in. |
tracing\eventpipe\pauseonstart\pauseonstart\pauseonstart.cmd has failed in multiple CI runs:
R2R Windows_NT x64 Checked forcerelocs @ Windows.10.Amd64.Open
https://dev.azure.com/dnceng/public/_build/results?buildId=716943&view=ms.vss-test-web.build-test-results-tab&runId=22218608&paneView=dotnet-dnceng.dnceng-build-release-tasks.helix-test-information-tab&resultId=110533
Also: all the Windows x86 legs of
runtime-coreclr jitstress-isas-x86
:https://dev.azure.com/dnceng/public/_build/results?buildId=716582&view=ms.vss-test-web.build-test-results-tab&runId=22207570&resultId=110525&paneView=debug
@josalem
The text was updated successfully, but these errors were encountered: