Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DAOS-16693 telemetry: Avoid race between init/read (#15306) #15322

Merged
merged 2 commits into from
Oct 22, 2024

Conversation

mjmac
Copy link
Contributor

@mjmac mjmac commented Oct 15, 2024

In rare cases, a reader may attempt to access a telemetry
node after it has been added to the tree, but before it
has been fully initialized. Use an atomic to prevent
reads before the initialization has completed. Unlucky
readers will get a -DER_AGAIN instead of crashing.

Signed-off-by: Michael MacDonald [email protected]

In rare cases, a reader may attempt to access a telemetry
node after it has been added to the tree, but before it
has been fully initialized. Use an atomic to prevent
reads before the initialization has completed. Unlucky
readers will get a -DER_AGAIN instead of crashing.

Signed-off-by: Michael MacDonald <[email protected]>
@mjmac mjmac requested review from a team as code owners October 15, 2024 17:49
Copy link

github-actions bot commented Oct 15, 2024

Ticket title is 'Race between add_metric() and d_tm_get_* causes segfaults'
Status is 'Awaiting backport'
Job should run at elevated priority (1)
Errors are Unknown component
https://daosio.atlassian.net/browse/DAOS-16693

Copy link
Contributor

@kjacque kjacque left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Is this a clean cherry-pick?

@mjmac mjmac added bug clean-cherry-pick Cherry-pick from another branch that did not require additional edits approved-to-merge PR has received release branch merge approval labels Oct 16, 2024
@mjmac
Copy link
Contributor Author

mjmac commented Oct 16, 2024

LGTM. Is this a clean cherry-pick?

Yep, just added the label.

@github-actions github-actions bot added the priority Ticket has high priority (automatically managed) label Oct 18, 2024
@daosbuild1
Copy link
Collaborator

Test stage NLT on EL 8.8 completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-15322/2/testReport/

@mjmac mjmac force-pushed the mjmac/DAOS-16693-2.6 branch 2 times, most recently from a351d85 to 6e760d1 Compare October 18, 2024 13:15
Skip-nlt: true
Allow-unstable-test: true
Required-githooks: true
Change-Id: I65b221ee67f200c26e975021d91fa9c6750dfb9b
@mjmac mjmac force-pushed the mjmac/DAOS-16693-2.6 branch from 6e760d1 to b30eb74 Compare October 18, 2024 20:34
@daosbuild1
Copy link
Collaborator

Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15322/5/execution/node/1351/log

@mjmac
Copy link
Contributor Author

mjmac commented Oct 21, 2024

Skipped NLT due to known problems with it on the branch. There was one hw-medium test failure that appears to be due to hardware issues:

Only 1 of 2 PMEM devices seen.

This already passed tests on master; I don't see a whole lot of point in grinding away to get it to pass on this branch. Tagging @daos-stack/daos-gatekeeper for forced landing.

@mjmac mjmac requested a review from a team October 21, 2024 14:29
@mjmac mjmac added the forced-landing The PR has known failures or has intentionally reduced testing, but should still be landed. label Oct 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved-to-merge PR has received release branch merge approval bug clean-cherry-pick Cherry-pick from another branch that did not require additional edits forced-landing The PR has known failures or has intentionally reduced testing, but should still be landed. priority Ticket has high priority (automatically managed)
Development

Successfully merging this pull request may close these issues.

4 participants