Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workflow hangs after clicking stop and terminate #10491

Closed
2 of 3 tasks
JPZ13 opened this issue Feb 8, 2023 · 3 comments · Fixed by #10639
Closed
2 of 3 tasks

Workflow hangs after clicking stop and terminate #10491

JPZ13 opened this issue Feb 8, 2023 · 3 comments · Fixed by #10639
Assignees
Labels
area/templates/container-set P1 High priority. All bugs with >=5 thumbs up that aren’t P0, plus: Any other bugs deemed high priority type/bug

Comments

@JPZ13
Copy link
Member

JPZ13 commented Feb 8, 2023

Pre-requisites

  • I have double-checked my configuration
  • I can confirm the issues exists when I tested with :latest
  • I'd like to contribute the fix myself (see contributing guide)

What happened/what you expected to happen?

When clicking stop or terminate within the UI, we are noticing that workflows are left hanging in a running state. The only way to get them to actually stop is to delete the workflow. This causes a problem for users who want to stop the workflow but keep the record in their archives

Here's a list of some other issues that I think might be related. I will keep this list up to date as more arise:

Version

latest

Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.

Unfortunately, this happens intermittently. I will update this issue with some videos and see if I can find some workflows that consistently trigger it using only public images

Logs from the workflow controller

see above

Logs from in your workflow's wait container

See above
@JPZ13 JPZ13 added the type/bug label Feb 8, 2023
@caelan-io caelan-io added the P1 High priority. All bugs with >=5 thumbs up that aren’t P0, plus: Any other bugs deemed high priority label Feb 16, 2023
@JPZ13
Copy link
Member Author

JPZ13 commented Feb 23, 2023

Grain.Highlight.7vm9f15b0ukV93KlO59k6kH8m8eTMiPiwxysYezG.mp4

The above video demonstrates what we're seeing along with the controller logs. It doesn't look like there's any logging output unfortunately

@JPZ13
Copy link
Member Author

JPZ13 commented Feb 23, 2023

On a call with @juliev0 - we need the Argo Server logs to get more clarity into what's happening @tico24 or any others, if you see this happen again, can you post the Argo Server logs as well?

Also, if we're seeing any logs being sent from the main or wait container that can inform us whether the SIGINT or SIGTERM signals are going out, that would be super helpful as well

@tico24
Copy link
Member

tico24 commented Feb 24, 2023

Naturally, every workflow I've tried to stop today just stops gracefully. I'll add logs as soon as I can recreate.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
area/templates/container-set P1 High priority. All bugs with >=5 thumbs up that aren’t P0, plus: Any other bugs deemed high priority type/bug
Projects
None yet
5 participants