Snapshot deletion and creation slow down as number of snapshots in repository grows #8958

imotov · 2014-12-15T16:03:32Z

In order to create a new snapshot or delete an existing snapshot, elasticsearch has to load all existing shard level snapshots to figure out which files need to be copied and which files can be cleaned. The number of files to be checked is equal to number_of_shards * number_of_snapshots, which on a large clusters and frequent snapshots can lead to very long operation times especially with non-filesystem repositories. See elastic/elasticsearch-cloud-aws#150 and this group post for examples of issues that this behavior is causing.

The text was updated successfully, but these errors were encountered:

…th large number of snapshots Fixes elastic#8958

nickcanz · 2015-01-23T18:26:44Z

Just wanted to chime in, this issue has affected us a great deal as well. It made "sense" after I thought it through, how ES snapshotting works, but was an unpleasant surprise.

…th large number of snapshots Each shard repository consists of snapshot file for each snapshot - this file contains a map between original physical file that is snapshotted and its representation in repository. This data includes original filename, checksum and length. When a new snapshot is created, elasticsearch needs to read all these snapshot files to figure which file are already present in the repository and which files still have to be copied there. This change adds a new index file that contains all this information combined into a single file. So, if a repository has 1000 snapshots with 1000 shards elasticsearch will only need to read 1000 blobs (one per shard) instead of 1,000,000 to delete a snapshot. This change should also improve snapshot creation speed on repositories with large number of snapshot and high latency. Fixes elastic#8958

niemyjski · 2016-03-16T19:24:54Z

I seem to be seeing this behavior with azure blob storage after upgrading to 1.7.5

imotov · 2016-03-21T15:48:22Z

@niemyjski It was fixed in #8969 in 2.0.0 and above. The fix wasn't backported to 1.7.5.

tamsky · 2016-06-29T00:23:39Z

And, if you've read this far and were wondering if the fix for this might ever get backported to 1.x... the answer is apparently not:

@ #8969 (comment) : imotov says

this was a significant change that required changing the snapshot file format and it was too big of a change for a patch level release. So we didn't port to 1.x and there are no current plans to do it.

imotov added >bug :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs labels Dec 15, 2014

imotov self-assigned this Dec 15, 2014

imotov mentioned this issue Dec 15, 2014

Removing snapshot is very request-intensive elastic/elasticsearch-cloud-aws#150

Closed

imotov added a commit to imotov/elasticsearch that referenced this issue Dec 15, 2014

Improve snapshot creation and deletion performance on repositories wi…

39a6e57

…th large number of snapshots Fixes elastic#8958

imotov mentioned this issue Dec 15, 2014

Improve snapshot creation and deletion performance on repositories with large number of snapshots #8969

Merged

imotov added a commit to imotov/elasticsearch that referenced this issue Jan 23, 2015

Improve snapshot creation and deletion performance on repositories wi…

ee3c3d1

…th large number of snapshots Fixes elastic#8958

imotov closed this as completed in #8969 Jun 2, 2015

This was referenced Sep 30, 2015

S3 Snapshots become very slow with more existing snapshots elastic/elasticsearch-cloud-aws#174

Open

Deleting snapshots taking for forever elastic/elasticsearch-cloud-aws#194

Open

untergeek mentioned this issue Mar 18, 2016

3.4.1 timeouts with 1.7.5 elastic/curator#582

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Snapshot deletion and creation slow down as number of snapshots in repository grows #8958

Snapshot deletion and creation slow down as number of snapshots in repository grows #8958

imotov commented Dec 15, 2014

nickcanz commented Jan 23, 2015

niemyjski commented Mar 16, 2016

imotov commented Mar 21, 2016

tamsky commented Jun 29, 2016

Snapshot deletion and creation slow down as number of snapshots in repository grows #8958

Snapshot deletion and creation slow down as number of snapshots in repository grows #8958

Comments

imotov commented Dec 15, 2014

nickcanz commented Jan 23, 2015

niemyjski commented Mar 16, 2016

imotov commented Mar 21, 2016

tamsky commented Jun 29, 2016