Compatibility with segment replication #1023

dreamer-89 · 2023-06-29T02:19:17Z

Summary

With 2.9.0 release, there are lot of enhancements going in for segment replication[1][2] feature (went GA in 2.7.0), we need to ensure different plugins are compatible with current state of this feature. Previously, we ran tests on plugin repos to verify this compatibility but want plugin owners to be aware of these changes so that required updates (if any) can be made. With 2.10.0 release, remote store feature is going GA which internally uses SEGMENT replication strategy only i.e. it enforces all indices to use SEGMENT replication strategy. So, it is important to validate plugins are compatible with segment replication feature.

What changed

1. Refresh policy behavior

RefreshPolicy.IMMEDIATE will only refresh primary shards but not replica shards immediately. Instead post refresh, primary will start a round of segment replication to update the replica shard copies leading to eventual consistency.
RefreshPolicy.WAIT_UNTIL ensures the indexing operation is searchable in your cluster i.e. RAW (Read after write guarantee). With segment replication, this guarantee is not promised due to delay in replica shared updates from asynchronous background refreshes.

2. Refresh lag on replicas

With segment replication, there is inherent delay in documents to be searchable on replica shard copies. This is due to the fact that replica shard copies over data (segment) files from primary. Thus, compared to document replication, there will be on average increase in amount of time the replica shards are consistent with primaries.

3. System/hidden indices support

With opensearch-project/OpenSearch#8200, system and hidden indices are now supported with SEGMENT replication strategy. We need to ensure there are no bottlenecks which prevents system/hidden indices with segment replication.

Next steps

With segment replication strong reads are not guaranteed. Thus, if the plugin needs strong reads guarantees specially as alternative to change in behavior of refresh policy and lag on replicas (point 1 and 2 above), we need to update search requests to target primary shard only. With opensearch-project/OpenSearch#7375, core now supports primary shards only based search. Please follow documentation for examples and details

Open questions

In case of any questions or issues, please post it in core issue

Reference

[1] Design

[2] Documentation

The text was updated successfully, but these errors were encountered:

dreamer-89 · 2023-06-29T19:52:58Z

Request owners to add v2.9.0 label on this issue.

dreamer-89 · 2023-07-10T23:35:28Z

Hi Plugin Owners,
Gentle reminder to look into this issue as code freeze date for 2.9.0 release is near i.e. July 11th.

dreamer-89 · 2023-07-13T19:43:38Z

Gentle reminder for plugin owners as we are past code freeze date for 2.9.0 release

ylwu-amzn · 2023-08-16T04:16:13Z

2.9 released, we don't see issue for this. Close now

dreamer-89 · 2023-08-31T18:44:07Z

@ylwu-amzn : Thanks for working on this issue. I just wanted to call out that please do verify that for strong reads (if any), your plugin relies only on get/mget APIs. I am asking as it is still possible to have strong reads guarantees via write paths by using IMMEDIATE/ WAIT_UNTIL refresh policy which ensures replica shard copies are refreshed with the indexing request, ensuring any follow up data retrieval request receives latest data. With SEGMENT both IMMEDIATE/WAIT_UNTIL does not guarantee replica shard refreshes, so for strong reads, any follow up data retrieval query (other than get/mget APIs), client would need to provide either _primary - hits primary shard only but provides strong consistency or _primary_first - hits primary first (as name suggest), if primary not available request is routed to replica copies resulting in better availability (though data when hitting replica could be stale).

dreamer-89 added enhancement New feature or request untriaged labels Jun 29, 2023

dreamer-89 mentioned this issue Jun 29, 2023

[Meta] Validate plugins compatibility with segment replication opensearch-project/OpenSearch#8211

Closed

37 tasks

wanglam mentioned this issue Jul 3, 2023

Compatibility with segment replication opensearch-project/ml-commons-dashboards#226

Closed

gaiksaya added the v2.9.0 v2.9.0 label Jul 3, 2023

ylwu-amzn closed this as completed Aug 16, 2023

kaituo mentioned this issue Aug 31, 2023

Compatibility with segment replication opensearch-project/anomaly-detection#989

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility with segment replication #1023

Compatibility with segment replication #1023

dreamer-89 commented Jun 29, 2023 •

edited

Loading

dreamer-89 commented Jun 29, 2023

dreamer-89 commented Jul 10, 2023

dreamer-89 commented Jul 13, 2023

ylwu-amzn commented Aug 16, 2023

dreamer-89 commented Aug 31, 2023

Compatibility with segment replication #1023

Compatibility with segment replication #1023

Comments

dreamer-89 commented Jun 29, 2023 • edited Loading

Summary

What changed

1. Refresh policy behavior

2. Refresh lag on replicas

3. System/hidden indices support

Next steps

Open questions

Reference

dreamer-89 commented Jun 29, 2023

dreamer-89 commented Jul 10, 2023

dreamer-89 commented Jul 13, 2023

ylwu-amzn commented Aug 16, 2023

dreamer-89 commented Aug 31, 2023

dreamer-89 commented Jun 29, 2023 •

edited

Loading