Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

non-deterministic sparse sum failure #3115

Closed
stress-tess opened this issue Apr 23, 2024 · 0 comments · Fixed by #3117
Closed

non-deterministic sparse sum failure #3115

stress-tess opened this issue Apr 23, 2024 · 0 comments · Fixed by #3117
Assignees
Labels
bug Something isn't working

Comments

@stress-tess
Copy link
Member

The sparse sum helper failed the nightly with the following:

num locales: 2
Failure with seeds:
8086790153783974714,
2380734683647922470,
2906987507681887800,
4967208342496478642

Even with runtime checks enabled I wasn't able to reproduce until I made an address sanitizer (asan) build. And even then it seems to be non-deterministic and sometimes passes. This leads me to believe there is a race condition

@stress-tess stress-tess added the bug Something isn't working label Apr 23, 2024
@stress-tess stress-tess self-assigned this Apr 23, 2024
@stress-tess stress-tess changed the title sparse_sum_helper race condition non-deterministic sparse sum failure Apr 24, 2024
stress-tess added a commit to stress-tess/arkouda that referenced this issue Apr 24, 2024
This PR (closes Bears-R-Us#3115) fixes the non-determinstic `sparse_sum_helper` failure. I was seeing runs where both the `if` and the `if else` would execute. Since the fetch conditions are mutually exclusive, we can use an `if` for both. I didn't realize that `on` statements were non-blocking, so there's pontential for them toexecute concurrently on different locales. Since my logic requires the previous locale to finish before the next begins, I've added an atomic helper to enforce this (thanks jeremiah for helping with that!)
stress-tess added a commit to stress-tess/arkouda that referenced this issue Apr 24, 2024
This PR (closes Bears-R-Us#3115) fixes the non-determinstic `sparse_sum_helper` failure. I was seeing runs where both the `if` and the `if else` would execute. Since the fetch conditions are mutually exclusive, we can use an `if` for both. I didn't realize that `on` statements were non-blocking, so there's pontential for them toexecute concurrently on different locales. Since my logic requires the previous locale to finish before the next begins, I've added an atomic helper to enforce this (thanks jeremiah for helping with that!)
stress-tess added a commit to stress-tess/arkouda that referenced this issue Apr 24, 2024
This PR (closes Bears-R-Us#3115) fixes the non-determinstic `sparse_sum_helper` failure. I was seeing runs where both the `if` and the `if else` would execute. Since the fetch conditions are mutually exclusive, we can use an `if` for both. I didn't realize that `on` statements were non-blocking, so there's pontential for them toexecute concurrently on different locales. Since my logic requires the previous locale to finish before the next begins, I've added an atomic helper to enforce this (thanks jeremiah for helping with that!)
github-merge-queue bot pushed a commit that referenced this issue Apr 24, 2024
This PR (closes #3115) fixes the non-determinstic `sparse_sum_helper` failure. I was seeing runs where both the `if` and the `if else` would execute. Since the fetch conditions are mutually exclusive, we can use an `if` for both. I didn't realize that `on` statements were non-blocking, so there's pontential for them toexecute concurrently on different locales. Since my logic requires the previous locale to finish before the next begins, I've added an atomic helper to enforce this (thanks jeremiah for helping with that!)

Co-authored-by: Tess Hayes <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant