Archived settings prevent updating other settings #28026

nik9000 · 2017-12-29T16:54:52Z

It looks like if you:

Start 5.x
Add a persistent cluster setting that is unsupported by 6.x
Upgrade to 6.x
Attempt to update another setting

Then you get an error back about the archived setting not being a valid setting. You can clear the archived setting with PUT _cluster/settings { "persistent": { "archived.*": null } } but you must do this before updating any other settings. It feels like you should be able to deal with the archived settings at your leisure.

I put together a test that reproduces this by adding this to FullClusterRestartIT.

The text was updated successfully, but these errors were encountered:

jasontedor · 2018-01-05T14:39:11Z

We discussed this during Fix-it-Friday and agreed that we should not archive unknown and broken cluster settings. Instead, we should fail to recover the cluster state. The solution for users in an upgrade case would be to rollback to the previous version, address the settings that would be unknown or broken in the next major version, and then proceed with the upgrade.

otrosien · 2018-01-29T11:43:07Z

The solution does not seem to apply for transient settings. I'm getting acknowledgement from ES, but the invalid setting stays. (in my case indices.store.throttle.type)

mayya-sharipova · 2018-01-29T19:45:24Z

@otrosien how were you able to keep transient settings between versions? Did you do a rolling upgrade from 5.6 to 6.x?

adichad · 2018-01-30T05:57:47Z

@otrosien 's teammate here. @mayya-sharipova Yes, we did a rolling upgrade of Elasticsearch. after the upgrade, the transient settings remained, but trying to either remove the unsupported setting or change any other setting in the transient set throws the error:

curl -XPUT -H"Content-Type: application/json" -s localhost:9200/_cluster/settings -d '{"transient": { "indices.*":null } }'

> {"error":{"root_cause":[{"type":"remote_transport_exception","reason":"[1Mwia6T][172.31.164.55:9300][cluster:admin/settings/update]"}],"type":"illegal_argument_exception","reason":"unknown setting [indices.store.throttle.type] please check that any required plugins are installed, or check the breaking changes documentation for removed settings"},"status":400}

For us the problem is not "archival" of bad settings, but the complete inability to edit transient settings now that they contain one unsupported setting.

We can update any persistent settings because those were empty before the upgrade, but for the settings that exist in our transient settings, the transient versions take precedence according to documentation: https://www.elastic.co/guide/en/elasticsearch/reference/6.1/cluster-update-settings.html#_precedence_of_settings
so we cannot effectively change any of those settings now.

We would expect to have a bugfix release of Elasticsearch, which allows this cleanup without requiring a full cluster restart.

At this point, the only option we have is to create a new cluster in parallel, index to it, and change DNS settings. This is extremely expensive, because our cluster is large(ish), with 100s of data nodes. service disruption by way of a full-cluster restart is not an option for us.

bleskes · 2018-01-30T08:20:28Z

@adichad which exact version are you using? I'm asking because as far as I can tell from glancing at the code, #27671 should allow you to remove that setting.

otrosien · 2018-01-30T11:00:22Z

@bleskes the masters are on 6.1.1, the data nodes still on 6.1.0. indices.store.throttle.type is still a cluster-wide setting, so from my understanding #27671 doesn't apply.

mayya-sharipova · 2018-01-30T17:39:42Z

@otrosien @adichad

indices.store.throttle.type setting was deprecated in 6.0 [1] , so after the upgrade it should have archived prefix added to this setting. Did you try to remove the archived version of this setting:

curl -XPUT -H "Content-Type: application/json" -s localhost:9200/_cluster/settings -d '{"transient": { "archived.indices.*":null } }'

[1]https://www.elastic.co/guide/en/elasticsearch/reference/current/breaking_60_settings_changes.html#_store_throttling_settings

otrosien · 2018-01-31T09:46:42Z

@mayya-sharipova we tried all variations of removing that setting. Apparently it was not moved to archived when we upgraded. Is it somehow possible to trigger this?

scratchy · 2018-02-07T14:10:29Z

Having the same issue in #28524

Were unable to rollback, so a force reset solution would be nice.

Since its a production cluster we also dont want to shutdown for this...

faxm0dem · 2018-02-07T14:15:35Z

If the official solution is what @jasontedor said, this should really make it to the documentation on rolling upgrade procedure

scratchy · 2018-02-07T14:36:51Z

This should not be the official solution for this.

Getting hell lot of errors downgrading / rollbacking ending in:

nested: IllegalStateException[index [products_37_es/mZ1tmbEdTaeNYSpCquAWGA] version not supported: 6.1.3 the node version is: 6.0.0]; ]

org.elasticsearch.cluster.block.ClusterBlockException: blocked by: [SERVICE_UNAVAILABLE/1/state not recovered / initialized];

jasontedor · 2018-02-07T14:55:08Z

There is a misunderstanding here. This comment that is being referred to as the "official solution" is not a solution. It is a proposal for how we should change Elasticsearch so that users can not end up in the situation that is causing so many problems here. It requires code changes to implement that solution and a new release carrying that solution.

faxm0dem · 2018-02-07T15:00:09Z

Thanks @jasontedor for the clarification.
Is there a workaround for @scratchy who has a cluster with newly created indices, and who therefore cannot rollback?

mayya-sharipova · 2018-02-07T16:45:27Z

@faxm0dem The current workaround is to remove archived settings by PUT _cluster/settings { "persistent": { "archived.*": null } } . But it looks like deprecated settings have not been added archived prefix. We will discuss in our next meeting possible workarounds from this.

waltrinehart · 2018-02-09T17:55:29Z

If you have dedicated master nodes, we were able to workaround this by downgrading them to a previous version (5.6.1 in our case) and then removing the offending settings, then re-upgrading.

faxm0dem · 2018-02-11T13:21:13Z

Oh very cool thanks! @scratchy can you try this?

dustin-decker · 2018-02-28T21:11:48Z

I work with @waltrinehart. We were able to apply the downgrade master workaround for transient settings, but not for permanent settings. We cannot upgrade from 6.1.1 to 6.2.2 because of the stuck permanent settings. The only way forward that we see is to downgrade to 5.x and do a full cluster restart to remove the permanent setting, which is not really a viable option for us. In the current state we cannot modify cluster settings at all. This also implies that we cannot disable shard allocation before doing a rolling upgrade.

The only real solution that we see right now is a software patch allowing us to remove this setting and move forward.

dustin-decker · 2018-03-06T17:05:17Z

We found that shutting down all of our master nodes simultaneously and starting them back up was sufficient to clear the persistent setting.

The cluster still required initializing all the shards even though the data nodes stayed up. This isn't possible for everyone though, so I think an alternative path without such disruption is still needed.

Follow cluster recovery we saw the setting was properly archived and could be removed. Confirms that it is an issue that crops up during rolling upgrades.

jasontedor · 2018-03-17T20:49:23Z

We integrated a change (#28888) that will automatically archive any unknown or invalid settings on any settings update. This prevents their presence for failing the request and once archived they can be deleted.

dorony · 2018-03-25T10:04:53Z

@jasontedor do you know when this will be released?

jasontedor · 2018-03-25T12:13:34Z

@dorony The change #28888 will be in the next 6.2 patch release (6.2.4) which is not yet released although we do not provide release dates.

Currently unknown or invalid cluster settings get archived. For a better user experience, we stop archving broken cluster settings. Instead, we will fail to recover the cluster state. The solution for users in an upgrade case would be to rollback to the previous version, address the settings that would be unknown or invalid the next major version, and then proceed with the upgrade. Closes elastic#28026

ghost · 2018-05-04T02:16:47Z

I'm no expert, but I'm suffering from this bug/situation right now and, if you're looking for QA feedback: this has put our production deployment in a very precarious state.

pjanzen · 2018-11-23T14:59:30Z

I am running ES 6.3.0 and I executed:

curl -H "Content-Type: application/json" -XPUT 'localhost:9200/_cluster/settings' -d '{ "persistent" : { "archived.*":null }}'

and restarted the full cluster. That did it for me.

DaveCTurner · 2019-06-10T13:30:32Z

The situation described in the OP is still true today (e.g. for upgrades from snapshots built from 7.x to master) but the other points raised in this thread seem to have been addressed by #28888.

Do we still consider this a bug? We could say that if you upgrade your cluster without addressing all the deprecation warnings first then there is a risk that some things may not work for you. In this case it's PUT _cluster/settings that doesn't work, and it's fixable. If we let a cluster carry on without taking explicit action to remove these broken settings then I expect they'll never get removed. I'm raising this for discussion again.

DaveCTurner · 2019-06-12T14:23:18Z

We discussed this today and agreed that we are happy with the behaviour as it stands, so this can be closed.

chingis-elastic · 2020-06-26T15:12:20Z

Hey team, sorry to dig up an old issue but we just hit this during cloud-observability upgrade (from 6.8 to 7.8). Some of our clusters have setting

xpack.notification.slack.account.<account_name>.url

which is apparently not supported in 7.x and hence got archived.*. I wonder why there wouldn't be an additional check/action in 7.x upgrade assistant to warn about unsupported settings? Or even check and remove them if they have no effect.

When upgrade succeeds, those settings leave cluster basically unusable (at least, on Elastic Cloud)

DaveCTurner · 2020-06-29T07:13:12Z

@chingis-elastic that this was not caught ahead of the upgrade sounds like it might be a bug somewhere in the deprecation or upgrade assistance areas. Would you open a new issue for it to make sure that gets investigated? Closed issues like this don't normally see any further activity.

nik9000 added :Core/Infra/Settings Settings infrastructure and APIs discuss labels Dec 29, 2017

mayya-sharipova self-assigned this Jan 7, 2018

mayya-sharipova mentioned this issue Jan 16, 2018

Discontinue archiving broken cluster settings #28253

Closed

mayya-sharipova removed the discuss label Jan 29, 2018

mayya-sharipova added the discuss label Feb 7, 2018

$@polyfractal$ polyfractal mentioned this issue Feb 7, 2018

non existent setting not removeable but still in cluster #28524

Closed

mayya-sharipova mentioned this issue Feb 12, 2018

[Settings] Allow Deletion of unknown settings #28609

Closed

mayya-sharipova removed the discuss label Mar 30, 2018

colings86 added the >bug label Apr 24, 2018

mayya-sharipova removed their assignment Nov 27, 2018

DaveCTurner added the team-discuss label Jun 10, 2019

DaveCTurner closed this as completed Jun 12, 2019

jaymode removed the team-discuss label Jun 12, 2019

chingis-elastic mentioned this issue Jun 30, 2020

[upgrade to 7.x] xpack.notification.slack.account.<account_name>.url settings get archived without notice #58724

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Archived settings prevent updating other settings #28026

Archived settings prevent updating other settings #28026

nik9000 commented Dec 29, 2017

jasontedor commented Jan 5, 2018

otrosien commented Jan 29, 2018

mayya-sharipova commented Jan 29, 2018

adichad commented Jan 30, 2018 •

edited

Loading

bleskes commented Jan 30, 2018

otrosien commented Jan 30, 2018

mayya-sharipova commented Jan 30, 2018

otrosien commented Jan 31, 2018

scratchy commented Feb 7, 2018 •

edited

Loading

faxm0dem commented Feb 7, 2018

scratchy commented Feb 7, 2018

jasontedor commented Feb 7, 2018

faxm0dem commented Feb 7, 2018

mayya-sharipova commented Feb 7, 2018

waltrinehart commented Feb 9, 2018

faxm0dem commented Feb 11, 2018

dustin-decker commented Feb 28, 2018 •

edited

Loading

dustin-decker commented Mar 6, 2018 •

edited

Loading

jasontedor commented Mar 17, 2018

dorony commented Mar 25, 2018

jasontedor commented Mar 25, 2018

ghost commented May 4, 2018 •

edited by ghost

Loading

pjanzen commented Nov 23, 2018

DaveCTurner commented Jun 10, 2019

DaveCTurner commented Jun 12, 2019

chingis-elastic commented Jun 26, 2020 •

edited by jasontedor

Loading

DaveCTurner commented Jun 29, 2020

Archived settings prevent updating other settings #28026

Archived settings prevent updating other settings #28026

Comments

nik9000 commented Dec 29, 2017

jasontedor commented Jan 5, 2018

otrosien commented Jan 29, 2018

mayya-sharipova commented Jan 29, 2018

adichad commented Jan 30, 2018 • edited Loading

bleskes commented Jan 30, 2018

otrosien commented Jan 30, 2018

mayya-sharipova commented Jan 30, 2018

otrosien commented Jan 31, 2018

scratchy commented Feb 7, 2018 • edited Loading

faxm0dem commented Feb 7, 2018

scratchy commented Feb 7, 2018

jasontedor commented Feb 7, 2018

faxm0dem commented Feb 7, 2018

mayya-sharipova commented Feb 7, 2018

waltrinehart commented Feb 9, 2018

faxm0dem commented Feb 11, 2018

dustin-decker commented Feb 28, 2018 • edited Loading

dustin-decker commented Mar 6, 2018 • edited Loading

jasontedor commented Mar 17, 2018

dorony commented Mar 25, 2018

jasontedor commented Mar 25, 2018

ghost commented May 4, 2018 • edited by ghost Loading

pjanzen commented Nov 23, 2018

DaveCTurner commented Jun 10, 2019

DaveCTurner commented Jun 12, 2019

chingis-elastic commented Jun 26, 2020 • edited by jasontedor Loading

DaveCTurner commented Jun 29, 2020

adichad commented Jan 30, 2018 •

edited

Loading

scratchy commented Feb 7, 2018 •

edited

Loading

dustin-decker commented Feb 28, 2018 •

edited

Loading

dustin-decker commented Mar 6, 2018 •

edited

Loading

ghost commented May 4, 2018 •

edited by ghost

Loading

chingis-elastic commented Jun 26, 2020 •

edited by jasontedor

Loading