Internal API for retrying HTTP requests #518

hiranya911 · 2019-04-30T18:17:41Z

The existing HttpClient retries all HTTP requests once that are failing due to connection timeout and reset errors. We would like to further extend this retries support to meet the following new requirements:

Ability to retry requests on specific HTTP error codes (e.g. 503)
Ability to delay subsequent retries according to the Retry-After header sent by the server
Ability to use exponential backoff
Ability to configure retries for individual services in the SDK

In order to meet all the above requirements, this PR introduces a new internal interface called RetryConfig. The HttpClient optionally takes a RetryConfig as a constructor parameter, and retries failing requests based on that.

For the moment, we use the following as the default RetryConfig in order to retain the existing behavior:

const DEFAULT_RETRY_CONFIG: RetryConfig = {
  maxRetries: 1,
  ioErrorCodes: ['ECONNRESET', 'ETIMEDOUT'],
  maxDelayInMillis: 60 * 1000,
};

We can change this to something more sophisticated in the future (in other Admin SDKs we retry up to 4 times, and also retry on 500 and 503 HTTP errors).

bojeil-google · 2019-05-18T03:02:53Z

src/utils/api-request.ts

@@ -166,8 +166,66 @@ export class HttpError extends Error {
  }
 }

+/**
+ * Specifies how failing HTTP requests should be retried.


Please explain what the different fields mean in this interface (an example would be helpful). For example maxDelayInMillis is per retry and not in total per request, etc. backoff factor in seconds, etc.

bojeil-google · 2019-05-20T18:08:36Z

src/utils/api-request.ts

-          return this.sendWithRetry(config, attempts + 1);
+      })
+      .catch((err: LowLevelError) => {
+        const [delayMillis, canRetry] = this.getRetryDelayMillis(retryAttempts, err);


The private functions added do not provide enough descriptions on what is being computed or a summary of the underlying logic making it harder to deduce the underlying behavior. It helps to either add more comments here or descriptions of the private functions below.

I think getRetryDelayMillis() is the confusing one, although I feel that the variable names explain what's going on. In any case, I've added a jsdoc comment to the method to further explain it.

bojeil-google · 2019-05-20T19:37:03Z

test/unit/utils/api-request.spec.ts

+    }).should.eventually.be.rejectedWith(err).and.have.property('code', 'app/network-error');
+  });
+
+  it('should not retry when for error codes that are not configured', () => {


should not retry when error codes are not configured

bojeil-google · 2019-05-20T19:52:37Z

src/utils/api-request.ts

+
+  private backOffDelayMillis(retryAttempts: number): number {
+    if (retryAttempts === 0) {
+      return 0;


Why do you not wait on the first retry?
You will end up with a wait pattern: first fail, 0, 2000, 4000
Instead of: first fail, 1000, 2000, 4000
I think the latter has a better likelihood of succeeding with less time overall.

This is to retain the existing behavior, and also to align with how retries are implemented in other languages/libraries. Basically we only delay retries in the event of consecutive errors. Here's a similar implementation from Python's urllib3 package: https://github.com/urllib3/urllib3/blob/master/src/urllib3/util/retry.py#L222

In general, the existing strategy of immediately retrying on the first error has worked well for us so far. This is particularly useful in environments like GCF where low-level transient errors seem to be common.

bojeil-google · 2019-05-20T23:46:13Z

test/unit/utils/api-request.spec.ts

+      expect(resp.data).to.deep.equal(respData);
+      expect(resp.isJson()).to.be.true;
+      expect(delayStub.callCount).to.equal(1);
+      expect(delayStub.args[0][0]).to.be.gt(27 * 1000).and.to.be.lte(30 * 1000);


How was 27 * 1000 determined?

Managed to make the assertion exact by using sinon fake timers.

…-node into hkj-retry-config

hiranya911 · 2019-05-21T20:44:42Z

Thanks @bojeil-google. Made most of the suggested changes. Ready for another look.

bojeil-google · 2019-05-22T07:46:37Z

src/utils/api-request.ts

+  /** Maximum number of times to retry a given request. */
+  maxRetries: number;
+
+  /** HTTP status codes taht should be retried. */


bojeil-google · 2019-05-22T08:14:01Z

src/utils/api-request.ts

+  ioErrorCodes?: string[];
+
+  /**
+   * The multiplier for exponential back off. The retry delay is calculated using the formula `(2^n) * backOffFactor`,


Clarify that it's in seconds
The retry delay is calculated using the formula (2^n) * backOffFactor seconds,

hiranya911 added 2 commits April 29, 2019 17:06

Framework for automatic HTTP retries

9ba0091

Added docs and more tests

6dc34c7

hiranya911 requested a review from bojeil-google April 30, 2019 18:22

hiranya911 assigned bojeil-google Apr 30, 2019

This was referenced Apr 30, 2019

FR: [Messaging] Add option to retry sending or expose Retry-After header #43

Closed

Best way to determine specific errors? firebase/firebase-admin-dotnet#45

Closed

Merge branch 'master' into hkj-retry-config

91212c8

bojeil-google suggested changes May 21, 2019

View reviewed changes

bojeil-google assigned hiranya911 and unassigned bojeil-google May 21, 2019

hiranya911 added 3 commits May 21, 2019 13:03

Merge branch 'master' into hkj-retry-config

f9932af

Updated documentation; Improved a clock-based test using fake timers

5494181

Merge branch 'hkj-retry-config' of github.com:firebase/firebase-admin…

b4674df

…-node into hkj-retry-config

hiranya911 assigned bojeil-google and unassigned hiranya911 May 21, 2019

bojeil-google suggested changes May 22, 2019

View reviewed changes

bojeil-google assigned hiranya911 and unassigned bojeil-google May 22, 2019

Fixed a typo; Updated comment

d53a9ae

hiranya911 requested a review from bojeil-google May 22, 2019 18:34

hiranya911 assigned bojeil-google and unassigned hiranya911 May 22, 2019

Trigger builds

c9c836c

bojeil-google approved these changes May 22, 2019

View reviewed changes

bojeil-google assigned hiranya911 and unassigned bojeil-google May 22, 2019

hiranya911 merged commit add8656 into master May 22, 2019

hiranya911 deleted the hkj-retry-config branch May 22, 2019 21:12

snyk-bot mentioned this pull request Dec 24, 2020

[Snyk] Security upgrade @google-cloud/firestore from 0.15.2 to 0.21.0 Jeremip11/firebase-admin-node#6

Open

snyk-bot mentioned this pull request Sep 6, 2021

[Snyk] Security upgrade @google-cloud/firestore from 0.15.2 to 0.21.0 Jeremip11/firebase-admin-node#9

Open

regnaio mentioned this pull request Mar 24, 2022

[FR] Custom default RetryConfig #1615

Open

Jeremip11 mentioned this pull request Oct 27, 2023

[Snyk] Security upgrade @google-cloud/firestore from 0.15.2 to 0.21.0 Jeremip11/firebase-admin-node#16

Open

Jeremip11 mentioned this pull request Dec 14, 2023

[Snyk] Fix for 1 vulnerabilities Jeremip11/firebase-admin-node#20

Open

This was referenced Jan 1, 2024

[Snyk] Fix for 1 vulnerabilities Jeremip11/firebase-admin-node#22

Open

[Snyk] Fix for 1 vulnerabilities Jeremip11/firebase-admin-node#23

Open

Jeremip11 mentioned this pull request Mar 15, 2024

[Snyk] Fix for 1 vulnerabilities Jeremip11/firebase-admin-node#24

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Internal API for retrying HTTP requests #518

Internal API for retrying HTTP requests #518

hiranya911 commented Apr 30, 2019

bojeil-google May 18, 2019

hiranya911 May 21, 2019

bojeil-google May 20, 2019

hiranya911 May 21, 2019 •

edited

Loading

bojeil-google May 20, 2019

hiranya911 May 21, 2019

bojeil-google May 20, 2019

hiranya911 May 21, 2019

bojeil-google May 20, 2019

hiranya911 May 21, 2019

hiranya911 commented May 21, 2019

bojeil-google May 22, 2019

hiranya911 May 22, 2019

bojeil-google May 22, 2019

hiranya911 May 22, 2019

Internal API for retrying HTTP requests #518

Internal API for retrying HTTP requests #518

Conversation

hiranya911 commented Apr 30, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hiranya911 May 21, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hiranya911 commented May 21, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hiranya911 May 21, 2019 •

edited

Loading