Add function to retrieve and cache vector files #62

nickpeihl · 2021-03-31T20:33:28Z

Fixes #60. The getVectorDataOfType function takes either a geojson or topojson argument and retrieves and caches the specified vector file from EMS.

This is similar to how the vector tile stylesheets are cached. For file layers we use the memoize function rather than the once function because the argument may differ between function calls.

Note: Reviewers may want to hide whitespace changes.

thomasneirynck

This looks great. Just two comments for consideration.

thomasneirynck · 2021-04-01T02:00:55Z

src/file_layer.ts

  constructor(config: FileLayerConfig, emsClient: EMSClient, proxyPath: string) {
    super(config, emsClient, proxyPath);
    this._config = config;
  }

+  async getVectorDataOfType(


thoughts on changing this to FileLayer#getGeoJson and doing any format conversion automagically in here if necessary?

If the default format is topojson, ems-client still downloads topojson (because it is smaller over the wire), but performs the geojson conversion "automagically".

The reason for this is that if the conversion does not happen, the client (in this case Kibana), would still need to run the topojson->geojson conversion each time still. Conversion-cost is not that huge of a penalty, but nonetheless it is one. And if ems-client already introduces caching, it may as well cache the usable converted geojson result.

The reason I think we want to avoid making this cache format-dependent is how the suggestEMSTermJoin-function proposed here elastic/kibana#94969 might end up being used.

For example, imagine a client looping over all columns in an index-pattern and calling this function with a few sample-values each to "find" a joinable layer. Even though ems-client transparently caches the downloaded result (great!), the topojson->geojson conversion would still need to happen at each call, which in a tight loop will start to be noticeable.

+1 I like the idea of fully hiding the format logic behind the client, so it only outputs geojson features up. That way library users don't need to know or deal with TopoJSON and just consume a more familiar data representation.

thomasneirynck · 2021-04-01T02:11:18Z

src/file_layer.ts

@@ -37,11 +40,31 @@ type EMSFormats = EmsFileLayerFormatGeoJson | EmsFileLayerFormatTopoJson;
 export class FileLayer extends AbstractEmsService {
  protected readonly _config: FileLayerConfig;

+  private _getVectorDataOfType = _.memoize(


I wonder if we should push the caching into the instance of the EMSClient instead, rather than have a memoized result on each individual FileLayer object. The reason for this is that this cache, in theory, can grow with however many layers EMS-exposes. E.g. if a client just loops over all the FileLayer objects and gets the geojson, this will keep 60+ files in memory.

I'd consider using a single LRU-cache (https://www.npmjs.com/package/lru-cache), maybe with a size of 10, and expose it on the EMSClient.

e.g.

EMSClient#getGeoJsonFromCache(layerId)

and

EMSClient#cacheGeoJson(layerId)

FileLayer#getGeoJson can then just use these methods.

This cache is cleared with EMSClient#_invalidateSettings.

Good idea. I was wondering about that, too.

Maybe this is over engineering, but what about using local storage for caching, so the state is saved between EMS client life cycles? A simple expiration logic would be enough since our assets don't change much.

I believe there are localstorage polyfills for node users and our own tests, but never used them though.

Maybe this is over engineering, but what about using local storage for caching, so the state is saved between EMS client life cycles? A simple expiration logic would be enough since our assets don't change much.

I believe there are localstorage polyfills for node users and our own tests, but never used them though.

I think it's better to retain a limited cache in memory rather than in the local storage. We may need to fix data in the service and I don't think we can invalidate user's locally stored data easily.

If we really wanted to have persistent caching, I think we would use Service Workers in the browser. There we can set TTLs and have native cross-browser support. But I don't think we use Service Workers anywhere in Kibana now and I don't want to add them without having a larger discussion with the team.

Fair enough 👍

thomasneirynck

thx, this is great, and going to be a nice little win for any maps user.

thomasneirynck · 2021-04-12T14:37:32Z

test/ems_client.test.ts


-describe('ems_client', () => {


Add function to retrieve and cache vector files

8b668d8

nickpeihl requested review from jsanz and thomasneirynck March 31, 2021 20:33

nickpeihl added the v7.13.0 label Mar 31, 2021

thomasneirynck reviewed Apr 1, 2021

View reviewed changes

Fetch and cache geojson

a7d6c83

nickpeihl requested a review from thomasneirynck April 5, 2021 23:58

thomasneirynck approved these changes Apr 12, 2021

View reviewed changes

test/ems_client.test.ts

describe('ems_client', () => {

Copy link

Contributor

thomasneirynck Apr 12, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:)

nickpeihl merged commit 2d1b7b7 into elastic:master Apr 12, 2021

nickpeihl mentioned this pull request Jun 7, 2021

EMS Boundaries do not render in Maps OR in dashboards, ONLY when edited from dashboard elastic/kibana#101497

Closed

jsanz mentioned this pull request Jul 10, 2024

Add script to check alignment with Kibana dependencies, update Lodash, downgrade lru-cache #242

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function to retrieve and cache vector files #62

Add function to retrieve and cache vector files #62

nickpeihl commented Mar 31, 2021 •

edited

Loading

thomasneirynck left a comment

thomasneirynck Apr 1, 2021

jsanz Apr 1, 2021

thomasneirynck Apr 1, 2021 •

edited

Loading

nickpeihl Apr 1, 2021

jsanz Apr 2, 2021

nickpeihl Apr 2, 2021

jsanz Apr 2, 2021

thomasneirynck left a comment

thomasneirynck Apr 12, 2021

Add function to retrieve and cache vector files #62

Add function to retrieve and cache vector files #62

Conversation

nickpeihl commented Mar 31, 2021 • edited Loading

thomasneirynck left a comment

Choose a reason for hiding this comment

thomasneirynck Apr 1, 2021

Choose a reason for hiding this comment

jsanz Apr 1, 2021

Choose a reason for hiding this comment

thomasneirynck Apr 1, 2021 • edited Loading

Choose a reason for hiding this comment

nickpeihl Apr 1, 2021

Choose a reason for hiding this comment

jsanz Apr 2, 2021

Choose a reason for hiding this comment

nickpeihl Apr 2, 2021

Choose a reason for hiding this comment

jsanz Apr 2, 2021

Choose a reason for hiding this comment

thomasneirynck left a comment

Choose a reason for hiding this comment

thomasneirynck Apr 12, 2021

Choose a reason for hiding this comment

nickpeihl commented Mar 31, 2021 •

edited

Loading

thomasneirynck Apr 1, 2021 •

edited

Loading