128 warning for duplicate named tasks #135

ShreyaTalati · 2023-09-07T20:18:18Z

Adding PR for showing an exception to the users regarding duplicate task names

wlruys · 2023-09-07T20:43:24Z

src/python/parla/common/spawn.py

@@ -82,6 +80,13 @@ def spawn(task=None,
            idx = task

        task = taskspace[idx]
+    lock = threading.Lock()


This won't protect against the race condition as each instance of spawn will have a separate lock instance.

The lock would have to be a field of the shared Task instance (that said I think a lock solution may have too big of a performance hit; a C++ function with a CAS state check/set is likely the fastest but worth measuring.)

Testing the race will be hard though, since reproducing it relies on the GIL owner switching between the byte code instructions state for the check and set. This will be very unlikely with the standard switch interval (5ms) until the GIL Optional PEP goes through 🤞

Maybe something like the state function was for a bit (with >= ) :

parla-experimental/src/c/backend/task.cpp

Line 327 in bfdb9a6

return status;

Edit: err, whoops, that's not linking correctly to the right commit hash (c70f9fc):

Task::State InnerTask::set_state(int state) { Task::State new_state = static_cast<Task::State>(state); Task::State old_state; bool success = true; do { old_state = this->state.load(); if (old_state > new_state) { success = false; } } while (!this->state.compare_exchange_weak(old_state, new_state)); if (!success) { throw std::runtime_error("Task States must always be increasing."); } return old_state; }

This is not thread-safe right?
Edit: ohh, it using compare_exchange_weak, got it!

I was reading through this - https://www.codeproject.com/Articles/808305/Understand-std-atomic-compare-exchange-weak-in-Cpl, and it states the following about using while loop for compare_exchange_weak:
Note that we generally cannot use this pattern to implement a mutex. Otherwise, multiple threads may be inside the critical section at the same time.
Would compare_exchange_strong suit better?

I think either is okay because the loop will reject failures if it is not the only thread inside the critical region, i.e. when this->state.load() changes between when it is read and when it would be modified, and try again until it was the only one inside the region.

oh! I'm actually surprised that terminates without tagging the cython wrapper of task::set_state as except +!
Does this interrupt and shut down the runtime or does the program hang?

It shuts down the runtime

Although, I would guess it doesn't hit the "Python layer" terminate path so the logs are not dumped?

parla-experimental/src/python/parla/__init__.py

Line 105 in bfdb9a6

self.release()

Also, maybe more importantly, it is possible the new set_state function leads to errors on await and the spawned continuation tasks.

Although, I would guess it doesn't hit the "Python layer" terminate path so the logs are not dumped? : yes

wlruys · 2023-09-07T20:46:08Z

src/python/parla/common/spawn.py

+    if(task.py_state != "SPAWNED"):
+       task.py_state = "SPAWNED"
+    else:
+        raise Exception("Duplicate task ID spawned. This will cause runtime to hang. Aborting...")


Maybe change exception type to RuntimeError? Also might be better to print the duplicate ID

wlruys · 2023-09-12T23:00:03Z

src/python/parla/common/spawn.py

+        try:
+            scheduler.spawn_task(task)
+        except RuntimeError:
+            raise RuntimeError("Task IDs can only be increasing. Possibly duplicate task ID present: " + str(task))


Recommend changing this to say "Conflicting task state while spawning task. Possible duplicate TaskID..."

but otherwise LGTM!

ShreyaTalati added 3 commits September 6, 2023 09:53

adding test file and changes to check duplicate tasks spwane

Verified

This commit was signed with the committer’s verified signature.

SimonBrandner Šimon Brandner

GPG key ID: D1D45825D60C24D2

Verified
Learn about vigilant mode

a0250f0

adding lock to throw exception for duplicate task ID and testcase file

0e3b02a

adding the pyx file to store python task state

df97f78

ShreyaTalati linked an issue Sep 7, 2023 that may be closed by this pull request

Warning for Duplicate Named Tasks #128

Closed

ShreyaTalati requested review from wlruys, nicelhc13, bozhiyou and yinengy September 7, 2023 20:19

wlruys reviewed Sep 7, 2023

View reviewed changes

ShreyaTalati added 2 commits September 8, 2023 07:34

Adding logic to C++ to handle race conditions

985e33e

catching exception in python

d8190c2

ShreyaTalati requested a review from wlruys September 12, 2023 20:38

wlruys approved these changes Sep 12, 2023

View reviewed changes

updating print statement

93968e8

ShreyaTalati merged commit 80fb867 into main Sep 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

128 warning for duplicate named tasks #135

128 warning for duplicate named tasks #135

ShreyaTalati commented Sep 7, 2023

wlruys Sep 7, 2023 •

edited

Loading

wlruys Sep 7, 2023 •

edited

Loading

ShreyaTalati Sep 8, 2023 •

edited

Loading

ShreyaTalati Sep 8, 2023

wlruys Sep 8, 2023

wlruys Sep 8, 2023

ShreyaTalati Sep 8, 2023

wlruys Sep 8, 2023

wlruys Sep 8, 2023

ShreyaTalati Sep 8, 2023

wlruys Sep 7, 2023 •

edited

Loading

wlruys Sep 12, 2023

ShreyaTalati Sep 13, 2023

128 warning for duplicate named tasks #135

128 warning for duplicate named tasks #135

Conversation

ShreyaTalati commented Sep 7, 2023

wlruys Sep 7, 2023 • edited Loading

Choose a reason for hiding this comment

wlruys Sep 7, 2023 • edited Loading

Choose a reason for hiding this comment

ShreyaTalati Sep 8, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wlruys Sep 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wlruys Sep 7, 2023 •

edited

Loading

wlruys Sep 7, 2023 •

edited

Loading

ShreyaTalati Sep 8, 2023 •

edited

Loading

wlruys Sep 7, 2023 •

edited

Loading