Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds latency metrics for ClusterState Appliers and Listeners #12333

Merged
merged 10 commits into from
May 20, 2024

Conversation

gargharsh3134
Copy link
Contributor

@gargharsh3134 gargharsh3134 commented Feb 15, 2024

Description

Publish Otel Histogram type metrics to track latencies of:

  1. ClusterState Appliers: cluster.state.appliers.latency in ms with dimension Operation as the invoking applier.
  2. ClusterState Listeners: cluster.state.listeners.latency in ms with dimension Operation as the invoking listener.
  3. Reroute: allocation.reroute.latency in ms.
  4. ClusterState Computation: cluster.state.new.compute.latency in ms with dimension Operation as the invoking taskExecutor.
  5. ClusterState Publishing: cluster.state.publish.success.latency in ms.

Related Issues

Resolves #[12332]

Check List

  • New functionality includes testing.
    • All tests pass
  • New functionality has been documented.
    • New functionality has javadoc added
  • Failing checks are inspected and point to the corresponding known issue(s) (See: Troubleshooting Failing Builds)
  • Commits are signed per the DCO using --signoff
  • Commit changes are listed out in CHANGELOG.md file (See: Changelog)
  • Public documentation issue/PR created

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Copy link
Contributor

github-actions bot commented Feb 15, 2024

Compatibility status:

Checks if related components are compatible with change 985b048

Incompatible components

Skipped components

Compatible components

Compatible components: [https://github.com/opensearch-project/custom-codecs.git, https://github.com/opensearch-project/observability.git, https://github.com/opensearch-project/neural-search.git, https://github.com/opensearch-project/flow-framework.git, https://github.com/opensearch-project/security-analytics.git, https://github.com/opensearch-project/job-scheduler.git, https://github.com/opensearch-project/cross-cluster-replication.git, https://github.com/opensearch-project/opensearch-oci-object-storage.git, https://github.com/opensearch-project/k-nn.git, https://github.com/opensearch-project/notifications.git, https://github.com/opensearch-project/geospatial.git, https://github.com/opensearch-project/asynchronous-search.git, https://github.com/opensearch-project/reporting.git, https://github.com/opensearch-project/sql.git, https://github.com/opensearch-project/ml-commons.git, https://github.com/opensearch-project/common-utils.git, https://github.com/opensearch-project/index-management.git, https://github.com/opensearch-project/anomaly-detection.git, https://github.com/opensearch-project/performance-analyzer-rca.git, https://github.com/opensearch-project/security.git, https://github.com/opensearch-project/alerting.git, https://github.com/opensearch-project/performance-analyzer.git]

Copy link
Contributor

✅ Gradle check result for d29b1cd: SUCCESS

Copy link

codecov bot commented Feb 15, 2024

Codecov Report

Attention: Patch coverage is 92.85714% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 71.39%. Comparing base (b15cb0c) to head (985b048).
Report is 291 commits behind head on main.

Current head 985b048 differs from pull request most recent head d1fd5c6

Please upload reports for the commit d1fd5c6 to get more accurate results.

Files Patch % Lines
server/src/main/java/org/opensearch/node/Node.java 72.72% 2 Missing and 1 partial ⚠️
Additional details and impacted files
@@             Coverage Diff              @@
##               main   #12333      +/-   ##
============================================
- Coverage     71.42%   71.39%   -0.03%     
- Complexity    59978    60284     +306     
============================================
  Files          4985     5011      +26     
  Lines        282275   283693    +1418     
  Branches      40946    41117     +171     
============================================
+ Hits         201603   202543     +940     
- Misses        63999    64373     +374     
- Partials      16673    16777     +104     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@gargharsh3134 gargharsh3134 force-pushed the csaLatencies branch 2 times, most recently from ac76c80 to edc7ff8 Compare March 1, 2024 05:12
Copy link
Contributor

github-actions bot commented Mar 1, 2024

❌ Gradle check result for ac76c80: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

github-actions bot commented Mar 1, 2024

❌ Gradle check result for edc7ff8: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@amkhar
Copy link
Contributor

amkhar commented Mar 1, 2024

@gargharsh3134 - please add testing details.
And see if any unit tests needs to be added to assert that those telemetry data is being recorded properly or not ?

@Gaganjuneja
Copy link
Contributor

Let's finish this PR. So that we can pick this change as well in the performance testing. Thanks!

Copy link
Contributor

❌ Gradle check result for 9e01686: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

❌ Gradle check result for 3f8acdd: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

❌ Gradle check result for d1fd5c6: null

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

@rajiv-kv rajiv-kv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@sachinpkale
Copy link
Member

Re-triggering the gradle build

Copy link
Contributor

✅ Gradle check result for d1fd5c6: SUCCESS

@sachinpkale sachinpkale added the backport 2.x Backport to 2.x branch label May 20, 2024
@sachinpkale
Copy link
Member

@gargharsh3134 Can you check why these Gradle Assemble tasks are failing?

@sachinpkale sachinpkale merged commit a254aa9 into opensearch-project:main May 20, 2024
29 of 36 checks passed
opensearch-trigger-bot bot pushed a commit that referenced this pull request May 20, 2024
* Adds latency metrics for ClusterState Appliers and Listeners

Signed-off-by: Harsh Garg <[email protected]>
(cherry picked from commit a254aa9)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
gargharsh3134 pushed a commit to gargharsh3134/OpenSearch that referenced this pull request Jun 3, 2024
sachinpkale pushed a commit that referenced this pull request Jun 9, 2024
…art of #12333 (#13926)

* Retaining the old constructors for classes marked as API changed as part of #12333
---------
Signed-off-by: Harsh Garg <[email protected]>
Co-authored-by: Harsh Garg <[email protected]>
gargharsh3134 added a commit to gargharsh3134/OpenSearch that referenced this pull request Jun 10, 2024
…rch-project#12333)

* Adds latency metrics for ClusterState Appliers and Listeners

Signed-off-by: Harsh Garg <[email protected]>
gargharsh3134 added a commit to gargharsh3134/OpenSearch that referenced this pull request Jun 10, 2024
…art of opensearch-project#12333 (opensearch-project#13926)

* Retaining the old constructors for classes marked as API changed as part of opensearch-project#12333
---------
Signed-off-by: Harsh Garg <[email protected]>
Co-authored-by: Harsh Garg <[email protected]>
gargharsh3134 added a commit to gargharsh3134/OpenSearch that referenced this pull request Jun 10, 2024
…rch-project#12333)

* Adds latency metrics for ClusterState Appliers and Listeners

Signed-off-by: Harsh Garg <[email protected]>
gargharsh3134 added a commit to gargharsh3134/OpenSearch that referenced this pull request Jun 10, 2024
…art of opensearch-project#12333 (opensearch-project#13926)

* Retaining the old constructors for classes marked as API changed as part of opensearch-project#12333
---------
Signed-off-by: Harsh Garg <[email protected]>
Co-authored-by: Harsh Garg <[email protected]>
parv0201 pushed a commit to parv0201/OpenSearch that referenced this pull request Jun 10, 2024
…rch-project#12333)

* Adds latency metrics for ClusterState Appliers and Listeners

Signed-off-by: Harsh Garg <[email protected]>
parv0201 pushed a commit to parv0201/OpenSearch that referenced this pull request Jun 10, 2024
…art of opensearch-project#12333 (opensearch-project#13926)

* Retaining the old constructors for classes marked as API changed as part of opensearch-project#12333
---------
Signed-off-by: Harsh Garg <[email protected]>
Co-authored-by: Harsh Garg <[email protected]>
wdongyu pushed a commit to wdongyu/OpenSearch that referenced this pull request Aug 22, 2024
…art of opensearch-project#12333 (opensearch-project#13926)

* Retaining the old constructors for classes marked as API changed as part of opensearch-project#12333
---------
Signed-off-by: Harsh Garg <[email protected]>
Co-authored-by: Harsh Garg <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport 2.x Backport to 2.x branch
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants