Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Task executes but step seems to hang for 3 minutes before it completes #105

Closed
komish opened this issue Jul 8, 2024 · 3 comments · Fixed by #106
Closed

[BUG] Task executes but step seems to hang for 3 minutes before it completes #105

komish opened this issue Jul 8, 2024 · 3 comments · Fixed by #106
Labels
bug Something isn't working

Comments

@komish
Copy link
Contributor

komish commented Jul 8, 2024

Version

Download action repository 'redhat-actions/openshift-tools-installer@v1' (SHA:2de9a80cf012ad0601021515481d433b91ef8fd5)

Describe the bug

Around 2024 July 04, I started seeing a consistent 3 minute runtime for openshift-tools-installer. Two separate calls, installing separate tools. We were previously seeing this take anywhere from 1-15 seconds, but now we're consistently seeing it take 3 minutes.

From the logs, I can see that all of the actual "work" being executed by the task completes almost immediately (within the 1-15 seconds), but the actions step just ... hangs until the 3m timer completes. Then it halts (green check, successful).

This is causing issues with some of our e2e workloads causing them to hit GitHub Actions timeouts.

Steps to reproduce, workflow links, screenshots

Here's a test repo demonstrating the problem.
https://github.com/komish/actions-workflow-call-test/actions/runs/9847203226

It installs oc and chart-verifier. In both cases, it's skipping the cache. oc comes the mirror, and chart-verifier comes from github.

Here's the same happening against all active Ubuntu images available in GHA. Ignore the failed one (running on 20.04). It's unrelated.

https://github.com/komish/actions-workflow-call-test/actions/runs/9847235069

Here's the last time we saw a short runtime. Both under 15s (respectively).

https://github.com/openshift-helm-charts/sandbox/actions/runs/9788116992/job/27025670858#step:14:1 (oc)
https://github.com/openshift-helm-charts/sandbox/actions/runs/9788116992/job/27025670858#step:12:1 (chart-verifier)

@komish komish added the bug Something isn't working label Jul 8, 2024
@komish
Copy link
Contributor Author

komish commented Jul 8, 2024

cc @divyansh42 let me know what information I can provide. I don't think I see any changes in this repo in that time frame, but I'm having trouble debugging further.

I see some kind of image update for ubuntu22.04 (which is what I use) in https://github.com/actions/runner-images/releases/tag/ubuntu22%2F20240630.1 - but I also tested against other ubuntu images to observe the same result. Their release time frame is close, but not exact - they released on 2024-07-02 and we saw a successful run on 2024-07-03. Could be close enough to attribute this issue to the image. Hard to tell.

The only remaining thing I can think may be in scope (other than this repo) is the action code itself https://github.com/actions/runner/releases/tag/v2.317.0 - but it hasn't release in this time frame either.

@komish
Copy link
Contributor Author

komish commented Jul 10, 2024

Opened an issue against actions/runner-images to see if their recent releases have some kind of issue causing this: actions/runner-images#10211

@komish
Copy link
Contributor Author

komish commented Jul 18, 2024

Folks, I think the underlying problem is here: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

I don't see this problem on Node16, just Node20. GitHub Actions are replacing Node16 with Node20 by default now.

I tried to debug, and could consistently reproduce this with Node20 on my workstation running your dev-test script. I applied a couple of workarounds (e.g. adding in a .finally block that exits the process), but I didn't really feel like that's the right way to go about this.

I'm guessing there's some promise not being resolved properly somewhere.

mgoerens pushed a commit to mgoerens/development that referenced this issue Nov 19, 2024
The redhat-actions/openshift-tools-installer is experiencing slowness
when installing oc. This steps can take up to 3 minutes. More info:
redhat-actions/openshift-tools-installer#105

Using a custom script reduces the installation of oc to less than 10s.

Signed-off-by: Matthias Goerens <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant