-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wait container failing: level=info msg="stopping progress monitor (context done)" error="context canceled"
#10490
Comments
This is actually not related to the repo artifact, but it seems to be related to artifact passing. I am also getting this on other stuff I am passing through workflows. How did such regression come from a new update? |
I think we should pay more attention to the
|
@sxllwx i don't have any "unable to resolve delta" messages |
For your case, you can provide the following information for troubleshooting:
|
@sxllwx Thanks for the response! Briefly updated the original post. The "unable to resolve reference delta" was in Argo v3.4.4, hence why I updated to 3.4.5. As a current workaround, I have setup a custom template where I implement the cloning myself. As soon as I did set this up, my pipeline continued to crash signalling the exit 2 above. Some additional observations:
I ended up downgrading to v.3.4.4. and the issue was no longer present. The only log notifications that resembles an error is the following:
Follow-ups:
|
Thank you for the information Can you provide the manifest of the Pod with Error (exit code 2)? |
@danajp perhaps #10039 is the cause? #9179 (comment) comment caught my attention "Most users will never need tty. Replacing os.Exec with pty is risky (reasons should be obvious). I think benefit outweights risk." there is also creack/pty#127 |
@tooptoop4 Seems unlikely. |
…rgoproj#10033 argoproj#10490 Signed-off-by: Alex Collins <[email protected]>
…rgoproj#10033 argoproj#10490 Signed-off-by: Alex Collins <[email protected]>
The following facts have been confirmed so far:
|
Update: Here is a summary:
Some things we need to clarify (will involve a lot of code):
https://github.com/argoproj/argo-workflows/blob/v3.4.5/cmd/argoexec/commands/wait.go#L36-L46 The https://github.com/argoproj/argo-workflows/blob/v3.4.4/cmd/argoexec/commands/wait.go#L36-L46 Here
From the above analysis, we can see that whether exit(2) depends on the timing of receiving
I hope my answer can clarify your doubts (you are welcome to question again): @lvijnck @tooptoop4 @alexec |
Do you have an estimation when will this fix be released? |
) Signed-off-by: Alex Collins <[email protected]>
) Signed-off-by: Alex Collins <[email protected]>
level=info msg="stopping progress monitor (context done)" error="context canceled"
Pre-requisites
:latest
What happened/what you expected to happen?
I am trying to implement a mechanism to clone a repository and pass it downstream. This has been failing due to the "unable to resolve delta" that was happening in 3.4.4. I just updated to argo v3.4.5.
I am now getting errors on Argo's "wait" container. I performed several tests and concluded the following:
Version
v.3.4.5
Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: