Can we somehow make the engine's LiveVersionMap tracking optional? #19787

mikemccand · 2016-08-03T18:19:26Z

I'm opening this to discuss possible options:

I've been scrutinizing ES indexing performance on the NYC taxi data set (1.2 B taxi rides, numerics heavy: http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml).

These documents are small (24 fields, though a bit sparse with ~23% cells missing) and are almost entirely numbers (indexed as points + doc values).

As a "ceiling" for indexing performance I also indexed the same data set using Lucene's "thin wrapper" demo server (http://github.com/mikemccand/luceneserver), indexing the same documents as efficiently as I know how (see indexTaxis.py).

The demo Lucene server has many differences vs. ES: it has no transaction log (does not periodically fsync), uses addDocuments not updateDocument, can index from a more compact documents source (190 GB CSV file, vs 512 GB json file for ES), does not add a costly _uid field (nor _version, _type) , uses a streaming bulk API, etc. I disabled _all and _source in ES, but net/net ES is substantially slower than the demo Lucene server.

So, one big thing I noticed that is maybe a lowish hanging fruit is that ES loses a lot of its indexing buffer to LiveVersionMap: if I give ES 1 GB indexing buffer, and index into only 1 shard, and disable refresh, the version map is taking ~2/3 of that buffer, leaving only ~1/3 for Lucene's IndexWriter:

node0: [2016-08-03 09:39:07,557][DEBUG][index.engine             ] [node0] [taxis][0] use refresh to write indexing buffer (heap size=[313.7mb]), to also clear version map (heap size=[730.3mb])

This also means ES is necessarily doing periodic refresh when I didn't ask it to.

This is quite frustrating because I don't need optimistic concurrency here, nor real-time gets, nor refreshes. However, I fear the version map might be required during recovery, to ensure when playing back indexing operations from the transaction log that they do not incorrectly overwrite newer indexing operations? But then, this use case is also append-only, so maybe during recovery we could safely skip that, if the user turns on this new setting.

The version map makes an entry in a HashMap for each document indexed, and the entry stores non-trivial information, creating at least 4 new objects, holding longs/ints, etc. If we can't make it turn-off-able maybe we should instead try to reduce its per-indexing-op overhead...

The text was updated successfully, but these errors were encountered:

s1monw · 2016-08-03T19:37:20Z

I think we can disable the live-version map since it has basically 2 purposes:

cache version lookups which are not important if you do append only or even if you don't use updates at all
persisting tombstones for a certain amount of time (GC deletes) which is also not need if you don't delete.

I think we can make it an option to have a no-op version map that simply doesn't do anything.

dakrone · 2016-08-03T20:08:13Z

Even in the append only use case, I was under the impression that versions were still necessary due to replication having a situation where a document could be sent more than once during network trouble, and _version being the only way to prevent duplicate documents in that case?

s1monw · 2016-08-03T20:26:05Z

Even in the append only use case, I was under the impression that versions were still necessary due to replication having a situation where a document could be sent more than once during network trouble, and _version being the only way to prevent duplicate documents in that case?

the version will always be loaded from the index in that case that should be fine. The only problem is if you are using deletes here. If you do that we need to use the version map.

dakrone · 2016-08-03T20:27:34Z

the version will always be loaded from the index in that case that should be fine.

Okay, thanks for explaining!

s1monw · 2016-08-03T20:35:16Z

one option would be to only keep the deletes in the map and therefor use less memory or no memory at all in the append only case. I think we can make this work if realtime GET is disabled somehow it should not make any difference.

mikemccand · 2016-08-03T20:42:18Z

Maybe we should simply remove real-time get? Is near-real-time get really not good enough? We default refresh to every 1s. Or users can use "wait for refresh" (#17986), coming in 5.0.0.

dakrone · 2016-08-03T20:45:36Z

Maybe we should simply remove real-time get?

What about making it a per-index setting that defaults to disabled? I can think of some use cases for our plugins that make/made use of realtime get

s1monw · 2016-08-03T21:11:53Z

I guess what we all missed here is that documents that are not yet refreshed are held in the version map since we can't load it's version from the index, yet. I am sorry but it won't be that simple. :(

bleskes · 2016-08-04T21:19:38Z

simple it won't be indeed - I wrote the result of previous discussions on a new meta issue: #19813

I think we can close this one in favor of that, but I let @mikemccand make that call :)

mikemccand · 2016-08-05T08:12:52Z

OK good I'll close this issue; thanks @bleskes.

mikemccand · 2016-08-12T19:31:19Z

Actually, I think we should have separate issues to track the individual improvements: getting ES back to the indexing performance of raw Lucene is going to be a big project, with many separate improvements. We can use the meta issue #19813 to track overall progress, but I think we should keep separate issues like this one and #19913 open to track progress of each small step.

) Today we do a lot of accounting inside the engine to maintain locations of documents inside the transaction log. This is only needed to ensure we can return the documents source from the engine if it hasn't been refreshed. Aside of the added complexity to be able to read from the currently writing translog, maintainance of pointers into the translog this also caused inconsistencies like different values of the `_ttl` field if it was read from the tlog or not. TermVectors are totally different if the document is fetched from the tranlog since copy fields are ignored etc. This chance will simply call `refresh` if the documents latest version is not in the index. This streamlines the semantics of the `_get` API and allows for more optimizations inside the engine and on the transaction log. Note: `_refresh` is only called iff the requested document is not refreshed yet but has recently been updated or added. #Relates to #19787

dnhatn · 2018-04-03T01:10:15Z

I am going to close this. The LiveVersionMap should have been optional since #27752. Thanks all!

mikemccand added >enhancement discuss :Engine labels Aug 3, 2016

mikemccand closed this as completed Aug 5, 2016

mikemccand mentioned this issue Aug 10, 2016

Optimize the case when _version is never specified / always 0 #19913

Closed

mikemccand reopened this Aug 12, 2016

mikemccand mentioned this issue Aug 22, 2016

Use _refresh instead of reading from Translog in the RT GET case #20102

Merged

jpountz mentioned this issue Aug 23, 2016

Fix RAM usage estimation of LiveVersionMap. #20123

Merged

dnhatn closed this as completed Apr 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we somehow make the engine's LiveVersionMap tracking optional? #19787

Can we somehow make the engine's LiveVersionMap tracking optional? #19787

mikemccand commented Aug 3, 2016

s1monw commented Aug 3, 2016

dakrone commented Aug 3, 2016

s1monw commented Aug 3, 2016

dakrone commented Aug 3, 2016

s1monw commented Aug 3, 2016

mikemccand commented Aug 3, 2016

dakrone commented Aug 3, 2016

s1monw commented Aug 3, 2016

bleskes commented Aug 4, 2016

mikemccand commented Aug 5, 2016

mikemccand commented Aug 12, 2016

dnhatn commented Apr 3, 2018 •

edited

Loading

Can we somehow make the engine's LiveVersionMap tracking optional? #19787

Can we somehow make the engine's LiveVersionMap tracking optional? #19787

Comments

mikemccand commented Aug 3, 2016

s1monw commented Aug 3, 2016

dakrone commented Aug 3, 2016

s1monw commented Aug 3, 2016

dakrone commented Aug 3, 2016

s1monw commented Aug 3, 2016

mikemccand commented Aug 3, 2016

dakrone commented Aug 3, 2016

s1monw commented Aug 3, 2016

bleskes commented Aug 4, 2016

mikemccand commented Aug 5, 2016

mikemccand commented Aug 12, 2016

dnhatn commented Apr 3, 2018 • edited Loading

dnhatn commented Apr 3, 2018 •

edited

Loading