Batch mget requests on dashboard load and cache field_stat results #10081

stacey-gammon · 2017-01-26T16:30:52Z

Took @nreese's PR (#10050) and added some additional logic to avoid multiple field_stat requests.

Here is my finding after playing around with loading times of a dashboard with 30 visualizations (note: all are from the same index, requiring only a single field_stats lookup).

Total # of runs: 10
Total # of visualizations: 30

Variation	total requests	Avg page load	lowest	highest
Current	87	5.928s	5.6s	6.65s
one mget	58	5.794s	5.37s	6.29s
one mget and one field stat	29	4.95s	4.8s	5.3s

So it appears that this implementation will shave off about a second of page load time. I suspect that number would grow for slow connection speeds.

…single _mget

…defer

trevan · 2017-01-26T16:37:07Z

Every now and then, I'll have one of the field stats request take a long time (>60 seconds). Only sending one field stats request per dashboard has been very beneficial in speed for me.

stacey-gammon · 2017-01-26T16:39:28Z

Only sending one field stats request per dashboard has been very beneficial in speed for me.

@trevan You tested this PR and found it helped? Or do you mean you have already implemented logic to send only one field_stats api request and found it helped?

trevan · 2017-01-26T16:41:37Z

@stacey-gammon, I already had extremely similar logic to what you have and found it very helpful.

spalger

Sorry I let this sit for so long, I've been heads down on #9853 for a while

spalger · 2017-02-03T22:52:41Z

src/ui/public/courier/saved_object/saved_object.js

@@ -127,7 +127,7 @@ export default function SavedObjectFactory(esAdmin, kbnIndex, Promise, Private,
     * @return {Promise}
     * @resolved {SavedObject}
     */
-    this.init = _.once(() => {
+    this.init = _.once((isDefered) => {


A few observations/questions:

_.once()'d functions should almost never accept arguments

What does isDefered mean? What is/should be defered? The Saved Object?

Calling the docSource._createRequest method from here feels like a violation of the intended public api for the docSource. Why not just change how docSource.fetch() functions?

@spalger isDefered tells init how to load the saved object. When false, the saved object is fetched immediately. When true, the saved object fetch request is placed on the request queue ... it is deferred.

The PR was just trying to keeping things simple. I did not want to re-engineer anything. Kibana already had the concept of a request queue and I figured this was a good use for it.

Yeah, I think that's a good idea. If we want to keep the current code I would probably change the variable to deferDocSourceRequest (my second comment was more about naming).

That said, I think all calls to docSource.fetch() should probably be using the request queue. I don't think there is any reason to require requesters to opt-into this style of fetching. One wrinkle in that is the interval at which the request queue is cleared. We might need to have some debounced request clearing trigger that gets triggered each time docSource.fetch() is called on any source...

Maybe something like this (pseudo code):

// DocSource class fetch() { const request = new Request({ urgent: true }); requestQueue.addNew(request) return request.defer.promise }

// in RequestQueue class addNew(req) { this._all.push(req); this._scheduleCheck() } _scheduleCheck() { if (this._pendingCheck) { clearTimeout(this._pendingCheck) } this._pendingCheck = setTimeout( () => this._checkForReadyRequest(), CLEAR_REQUEST_QUEUE_DEBOUNCE_MS ) } _checkForReadyRequests() { return this._all.filter(req => { return req.isUrgent() || req.isNaturallyReady() }) }

That said, I think all calls to docSource.fetch() should probably be using the request queue.

What about when just retrieving a single object by id? E.g. this gets called when you save a dashboard because it reloads the dashboard you just saved by id. In that case, maybe it's not worth the overhead? It looks like the looper checks for requests every 1.5 seconds, so that's adding an average .75s delay when just getting a single object.

That said, I agree about the naming, but what about batchRequest? I like batch better than defer, as it gives more reasoning behind it.

I feel like the entire batching logic could be improved to circumvent the looper delay entirely. Send in an array of savedObjects, or ids, and grab them all at once in a single batch (or split batches up to avoid any one batch from being too large if that was a potential problem). But that would be too large of a code overhaul at this point, so IMO, just doing the rename might be the best way to go at this point.

spalger · 2017-02-03T23:13:13Z

src/ui/public/courier/fetch/strategy/search.js

@@ -15,6 +15,8 @@ export default function FetchStrategyForSearch(Private, Promise, timefilter, kbn
     * @return {Promise} - a promise that is fulfilled by the request body
     */
    reqsFetchParamsToBody: function (reqsFetchParams) {
+      const indexToListMapping = {};


Just pushed updates, not really qualified to review anymore

spalger · 2017-02-08T19:14:21Z

@nreese, @stacey-gammon and I worked on integrating the batching into the courier itself, I just pushed the changes we worked on that make all calls to DataSource#fetch() batch by a tiny amount, rather than waiting for the next tick of the looper (applies to data and search sources). Mind taking a look?

@pickypg another review from you would be helpful too

stacey-gammon · 2017-02-08T19:38:26Z

src/ui/public/courier/fetch/fetch.js

-    else return fetchThese(requests);
-  }
+  const debouncedFetchThese = _.debounce(() => {
+    const requests = requestQueue.get().filter(req => req.isFetchRequested());


Did this end up not being a problem with requests being fetched more than once?

Oops, forgot to update this to the new req.isFetchRequestedAndPending() method that takes requests' started state into account.

stacey-gammon

One minor comment, otherwise looks good! Though some of this is my code, so should have another reviewer too,

stacey-gammon · 2017-02-09T19:39:38Z

src/ui/public/courier/fetch/request/request.js

+     *  @return {Boolean}
+     */
+    isFetchRequestedAndPending() {
+      return !!this._fetchRequested && !this.started;


Why the need for the double negative? Is it to avoid the undefined case? It'll still all work as expected right? Maybe just initialize this._fetchRequested to false in that case?

Just seems a little strange when the value is essentially a boolean, and the double negative requires an extra extra brain cycle or two to make the conversion. :)

Forces boolean true out of anything. I agree that it's unusual since it's part of a larger boolean statement and now I wonder if there's more to the trick than I remember or just a force of habit.

Yeah, when it was just returning the value of _fetchRequested I was a little more concerned about it always being a boolean value, but now that it's returning the && calculation it's totally unnecessary.

pickypg

LGTM

Backports PR #10081 **Commit 1:** defer loading visualization saved objects so they can be loaded in a single _mget * Original sha: 5c1344e * Authored by nreese <[email protected]> on 2017-01-24T23:41:05Z **Commit 2:** Merge branch 'defer' of https://github.com/nreese/kibana into nreese-defer * Original sha: 96af3ea * Authored by Stacey Gammon <[email protected]> on 2017-01-26T13:44:55Z **Commit 3:** Don't request field stats more than once for the same index pattern * Original sha: 9a02e5b * Authored by Stacey Gammon <[email protected]> on 2017-01-26T15:57:34Z **Commit 4:** [ui/courier] batch fetch requests for all searches and docs * Original sha: 20d55fe * Authored by spalger <[email protected]> on 2017-02-06T22:13:23Z **Commit 5:** [ui/courier] remove remaining mentions of req.isFetchRequested() * Original sha: f5bd5ca * Authored by spalger <[email protected]> on 2017-02-08T21:22:32Z **Commit 6:** [courier/fetch/request] remove unneceessary !! * Original sha: eb5446d * Authored by spalger <[email protected]> on 2017-02-09T20:48:48Z

…10273) Backports PR #10081 **Commit 1:** defer loading visualization saved objects so they can be loaded in a single _mget * Original sha: 5c1344e * Authored by nreese <[email protected]> on 2017-01-24T23:41:05Z **Commit 2:** Merge branch 'defer' of https://github.com/nreese/kibana into nreese-defer * Original sha: 96af3ea * Authored by Stacey Gammon <[email protected]> on 2017-01-26T13:44:55Z **Commit 3:** Don't request field stats more than once for the same index pattern * Original sha: 9a02e5b * Authored by Stacey Gammon <[email protected]> on 2017-01-26T15:57:34Z **Commit 4:** [ui/courier] batch fetch requests for all searches and docs * Original sha: 20d55fe * Authored by spalger <[email protected]> on 2017-02-06T22:13:23Z **Commit 5:** [ui/courier] remove remaining mentions of req.isFetchRequested() * Original sha: f5bd5ca * Authored by spalger <[email protected]> on 2017-02-08T21:22:32Z **Commit 6:** [courier/fetch/request] remove unneceessary !! * Original sha: eb5446d * Authored by spalger <[email protected]> on 2017-02-09T20:48:48Z

…10274) Backports PR #10081 **Commit 1:** defer loading visualization saved objects so they can be loaded in a single _mget * Original sha: 5c1344e * Authored by nreese <[email protected]> on 2017-01-24T23:41:05Z **Commit 2:** Merge branch 'defer' of https://github.com/nreese/kibana into nreese-defer * Original sha: 96af3ea * Authored by Stacey Gammon <[email protected]> on 2017-01-26T13:44:55Z **Commit 3:** Don't request field stats more than once for the same index pattern * Original sha: 9a02e5b * Authored by Stacey Gammon <[email protected]> on 2017-01-26T15:57:34Z **Commit 4:** [ui/courier] batch fetch requests for all searches and docs * Original sha: 20d55fe * Authored by spalger <[email protected]> on 2017-02-06T22:13:23Z **Commit 5:** [ui/courier] remove remaining mentions of req.isFetchRequested() * Original sha: f5bd5ca * Authored by spalger <[email protected]> on 2017-02-08T21:22:32Z **Commit 6:** [courier/fetch/request] remove unneceessary !! * Original sha: eb5446d * Authored by spalger <[email protected]> on 2017-02-09T20:48:48Z

nreese and others added 3 commits January 24, 2017 16:41

defer loading visualization saved objects so they can be loaded in a …

5c1344e

…single _mget

Merge branch 'defer' of https://github.com/nreese/kibana into nreese-…

96af3ea

…defer

Don't request field stats more than once for the same index pattern

9a02e5b

stacey-gammon mentioned this pull request Jan 26, 2017

request dashboard visualizations and saved searchs in single _mget request #10050

Closed

stacey-gammon requested a review from spalger January 26, 2017 16:34

stacey-gammon added Feature:Dashboard Dashboard related features :Sharing review v5.3.0 labels Jan 26, 2017

stacey-gammon assigned spalger Jan 31, 2017

stacey-gammon requested a review from pickypg February 2, 2017 17:07

stacey-gammon assigned pickypg Feb 2, 2017

spalger previously requested changes Feb 3, 2017

View reviewed changes

spalger reviewed Feb 3, 2017

View reviewed changes

[ui/courier] batch fetch requests for all searches and docs

20d55fe

spalger force-pushed the nreese-defer branch from e9faa23 to 20d55fe Compare February 8, 2017 19:11

stacey-gammon commented Feb 8, 2017

View reviewed changes

[ui/courier] remove remaining mentions of req.isFetchRequested()

f5bd5ca

stacey-gammon commented Feb 9, 2017

View reviewed changes

pickypg approved these changes Feb 9, 2017

View reviewed changes

[courier/fetch/request] remove unneceessary !!

eb5446d

spalger merged commit f2da2a3 into elastic:master Feb 9, 2017

elastic-jasper mentioned this pull request Feb 9, 2017

[5.x] Batch mget requests on dashboard load and cache field_stat results #10273

Merged

elastic-jasper mentioned this pull request Feb 9, 2017

[5.3] Batch mget requests on dashboard load and cache field_stat results #10274

Merged

spalger added v5.4.0 v6.0.0 labels Feb 9, 2017

stacey-gammon mentioned this pull request Feb 14, 2017

Slow dashboard loading - kibana requests each visualization in a separate XHR request #9662

Closed

Bargs mentioned this pull request Feb 15, 2017

Quickly switching saved searches leaves Discover in invalid state #10379

Closed

stacey-gammon mentioned this pull request Mar 28, 2017

Top hits aggregation doesn't work on dashboard if there are no documents in view #10905

Closed

stacey-gammon deleted the nreese-defer branch April 6, 2017 11:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch mget requests on dashboard load and cache field_stat results #10081

Batch mget requests on dashboard load and cache field_stat results #10081

stacey-gammon commented Jan 26, 2017 •

edited by tbragin

Loading

trevan commented Jan 26, 2017

stacey-gammon commented Jan 26, 2017

trevan commented Jan 26, 2017

spalger left a comment

spalger Feb 3, 2017

nreese Feb 3, 2017

spalger Feb 3, 2017 •

edited

Loading

spalger Feb 3, 2017

stacey-gammon Feb 6, 2017

spalger Feb 3, 2017

spalger commented Feb 8, 2017

stacey-gammon Feb 8, 2017

spalger Feb 8, 2017 •

edited

Loading

stacey-gammon left a comment

stacey-gammon Feb 9, 2017

pickypg Feb 9, 2017

spalger Feb 9, 2017

pickypg left a comment

Batch mget requests on dashboard load and cache field_stat results #10081

Batch mget requests on dashboard load and cache field_stat results #10081

Conversation

stacey-gammon commented Jan 26, 2017 • edited by tbragin Loading

trevan commented Jan 26, 2017

stacey-gammon commented Jan 26, 2017

trevan commented Jan 26, 2017

spalger left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spalger Feb 3, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spalger commented Feb 8, 2017

Choose a reason for hiding this comment

spalger Feb 8, 2017 • edited Loading

Choose a reason for hiding this comment

stacey-gammon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pickypg left a comment

Choose a reason for hiding this comment

stacey-gammon commented Jan 26, 2017 •

edited by tbragin

Loading

spalger Feb 3, 2017 •

edited

Loading

spalger Feb 8, 2017 •

edited

Loading