Allow retry of 500 API errors to be handled by restart policies #3073

clinta · 2017-08-22T14:24:01Z

API error 500 from docker is often recoverable, in our case we typically see this error from a volume driver after a container crashes. The volume driver must wait for a timeout from the now dead container before allowing a new container to mount it. This can take longer than 5 seconds.

Allowing these retires to be handled by the jobs configured restart policies seems to be a more intuitive solution than having a special hard-coded retry just for these errors.

dadgar · 2017-08-22T17:36:45Z

client/driver/docker.go

-			time.Sleep(1 * time.Second)
-			goto START
-		}
+		return structs.NewRecoverableError(startErr, true)


Can you change it such that we still retry quickly for 5 times and then fall back to the recoverable error if it is a 500? We want the fast retry behavior in this case since often it recovers quickly and the users restart policy may cause the task to fail.

dadgar · 2017-08-24T23:57:34Z

Thanks @clinta

github-actions · 2023-03-25T02:11:06Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

dadgar reviewed Aug 22, 2017

View reviewed changes

Allow retry of 500 API errors to be handled by restart policies

6b98ddf

clinta force-pushed the docker-500 branch from 1262d0f to 6b98ddf Compare August 22, 2017 18:05

dadgar merged commit ba1eecb into hashicorp:master Aug 24, 2017

github-actions bot locked as resolved and limited conversation to collaborators Mar 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow retry of 500 API errors to be handled by restart policies #3073

Allow retry of 500 API errors to be handled by restart policies #3073

clinta commented Aug 22, 2017

dadgar Aug 22, 2017

dadgar commented Aug 24, 2017

github-actions bot commented Mar 25, 2023

Allow retry of 500 API errors to be handled by restart policies #3073

Allow retry of 500 API errors to be handled by restart policies #3073

Conversation

clinta commented Aug 22, 2017

dadgar Aug 22, 2017

Choose a reason for hiding this comment

dadgar commented Aug 24, 2017

github-actions bot commented Mar 25, 2023