You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We recently upgraded from Argo Rollouts 1.4.x to 1.6.6 to see if we could resolve a few underlying panics that seem to be happening. However, we are still seeing high number of panics whenever applications trigger an experiement, and we generally will get the following log entry:
There is a large mix of experiments that we have in play, but the one that seems to trigger this most often is a basic placeholder experiment that simply returns an exit 0 response code, and is a one-liner that just sends "quitquitquit" to the experiment. Its used during the initial development of an application before being fleshed out with a full experiment and analysis during the final stages of development.
However, its not the only case where we get those log entries, and our fully fleshed out experiments also seem to trigger this log entry.
On top of that it seems to happen far more frequently then we'd expect for a handful of deployments over the course of an hour or day, in that yesterday we had 699 log entries vs ~20 deployments.
Checklist:
I've included steps to reproduce the bug.
I've included the version of argo rollouts.
To Reproduce
Create a basic rollout that can consume the following analysis template:
This is the most barebones experiment we see that can trigger this.
Expected behavior
The experiment should just exist cleanly and not trigger a panic, but more often than not the panic is triggered.
Version
v.1.4.1
v1.6.6 (currently deployed)
Logs
# Paste the logs from the rollout controller
# Logs for the entire controller:
kubectl logs -n argo-rollouts deployment/argo-rollouts
# Logs for a specific rollout:
kubectl logs -n argo-rollouts deployment/argo-rollouts | grep rollout=<ROLLOUTNAME
Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritize the issues with the most 👍.
The text was updated successfully, but these errors were encountered:
We recently upgraded from Argo Rollouts 1.4.x to 1.6.6 to see if we could resolve a few underlying panics that seem to be happening. However, we are still seeing high number of panics whenever applications trigger an experiement, and we generally will get the following log entry:
There is a large mix of experiments that we have in play, but the one that seems to trigger this most often is a basic placeholder experiment that simply returns an exit 0 response code, and is a one-liner that just sends "quitquitquit" to the experiment. Its used during the initial development of an application before being fleshed out with a full experiment and analysis during the final stages of development.
However, its not the only case where we get those log entries, and our fully fleshed out experiments also seem to trigger this log entry.
On top of that it seems to happen far more frequently then we'd expect for a handful of deployments over the course of an hour or day, in that yesterday we had 699 log entries vs ~20 deployments.
Checklist:
To Reproduce
Create a basic rollout that can consume the following analysis template:
This is the most barebones experiment we see that can trigger this.
Expected behavior
The experiment should just exist cleanly and not trigger a panic, but more often than not the panic is triggered.
Version
v.1.4.1
v1.6.6 (currently deployed)
Logs
Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritize the issues with the most 👍.
The text was updated successfully, but these errors were encountered: