Introduce lock to prevent parallel task execution #9858

tsmaeder · 2021-08-06T12:07:53Z

Signed-off-by: Thomas Mäder [email protected]

What it does

The idea is to have a lock that prevents interleaved execution of the check whether a task is already running with the actual starting of the task.
The approach is to delay later execution of tasks instead of ignoring them, because runTask() is used in various places that treat failure to start a task as an error.

How to test

to reproduce first:

Install the extension from the issue: https://github.com/eclipse-theia/theia/files/6922291/prefs-explorer-0.0.1.vsix.zip
Change the theia source like described here: TaskService.runTask(...) not safe for parallel Execution #9806 (comment)

You should be able to start multiple instances of the same task by clicking on the tasks view.

to verify it's fixed, keep the above changes, but check out the PR branch

You should not be able to create multiple instances of the same task.

Review checklist

as an author, I have thoroughly tested my changes and carefully followed the review guidelines

Reminder for reviewers

as a reviewer, I agree to behave in accordance with the review guidelines

tsmaeder · 2021-08-06T13:24:52Z

Grrrr...changed code to use injection and didn't rerun the tests. Stand by...

msujew

I can confirm that the issue exists on master and is resolved by these changes. Starting a task multiple times is no longer possible.

msujew · 2021-08-09T14:10:08Z

packages/core/src/common/promise-util.ts

+ *
+ * const stringValue = await myPromise.then(delay(600)).then(value => value.toString());
+ *
+ * @param ms the number of millisecond os dealy


Suggested change

* @param ms the number of millisecond os dealy

* @param ms the number of milliseconds to delay

msujew · 2021-08-09T14:13:08Z

packages/core/src/common/promise-util.ts

+ * A function to allow a promise resolution to be delayed by a number of milliseconds. Usage is like so:
+ *
+ * const stringValue = await myPromise.then(delay(600)).then(value => value.toString());


Suggested change

* A function to allow a promise resolution to be delayed by a number of milliseconds. Usage is like so:

*

* const stringValue = await myPromise.then(delay(600)).then(value => value.toString());

* A function to allow a promise resolution to be delayed by a number of milliseconds. Usage is as follows:

*

* `const stringValue = await myPromise.then(delay(600)).then(value => value.toString());`

packages/task/src/browser/task-service.ts

packages/core/src/common/lock.ts

paul-marechal · 2021-08-09T15:13:34Z

packages/core/src/common/lock.spec.ts

+    beforeEach(() => {
+
+    });


Not useful.

packages/task/src/browser/task-service.ts

colin-grant-work · 2021-08-10T21:51:20Z

Although in general I'm not a fan of the question 'why not use an existing library?' I do think it deserves to be posed. There seem to be a few libraries that implement lock-like functionality with enough downloads to make it plausible that they're useful:
async-mutex
async-lock

paul-marechal · 2021-08-10T22:13:37Z

@colin-grant-work async-mutex looks great.

paul-marechal · 2021-08-12T17:52:01Z

@tsmaeder what do you think about async-mutex? In your case you might be interested by the Lock class. I am interested by the Semaphore class for one of my PRs (in order to limit concurrency).

tsmaeder · 2021-08-17T13:57:23Z

@paul-marechal I'm not sure: I'm relying on the fact that I can multi-release the same Lock multiple times without it causing an error. The doc of async-mutex does not mention that. Out of curiosity, how does a semaphore (aka n parallel tasks) make sense in the single-threaded environment?

paul-marechal · 2021-08-17T17:13:27Z

I'm relying on the fact that I can multi-release the same Lock multiple times without it causing an error.

I don't see that being relied upon within the code? Perhaps I am missing it? But from what I understand calling async-lock's acquired release function multiple times only releases once. We can always open a PR to add that missing documentation upstream.

[...] how does a semaphore (aka n parallel tasks) make sense in the single-threaded environment?

Semaphores apply to any concurrent system. Note that concurrency can be achieved without parallelism. Node's async tasks are very much concurrent, that's the whole selling point of Node :) See this or that.

In my case I would spawn X amount of upload promises, and the semaphore would limit the amount uploaded by those concurrent tasks so that only a fewer amount Y is actually uploading at any given time. The semaphore would guard the "upload budget count" shared resource.

tsmaeder · 2021-08-18T08:27:25Z

In my case I would spawn X amount of upload promises, and the semaphore would limit the amount uploaded by those concurrent tasks so that only a fewer amount Y is actually uploading at any given time.

But why: doesn't the I/O of the upload block the promises from proceeding just as effectively as waiting on the semaphore?

paul-marechal · 2021-08-18T15:21:46Z

The I/O is happening in parallel in my case, so the semaphore would help throttle the logic. Right now I also had to implement my own class to handle only X tasks at a given time, but semaphores from async-mutex might be more adequate than re-implementing something equivalent in Theia.

A few lines of code might be worth a thousand words:

I invite you to open a new empty tab, open the dev tools, open the network tab and/or the console and paste the following snippets. They each do 16 requests to some random API, but one is doing the requests in parallel and the other not.

// parallel
(async function() {
    console.log('start');
    const start = Date.now();
    const promises = [];
    for (let i = 0; i < 16; i++) {
        promises.push((async () => {
            const response = await fetch('https://v2.jokeapi.dev/joke/Any?type=single');
            const { joke = 'something went wrong' } = await response.json();
            return joke;
        })());
    }
    const jokes = await Promise.all(promises);
    const end = Date.now();
    console.log('jokes:', jokes);
    console.log(`end (took: ${end - start}ms)`);
})();

// sequential
(async function() {
    console.log('start');
    const start = Date.now();
    const jokes = [];
    for (let i = 0; i < 16; i++) {
        const response = await fetch('https://v2.jokeapi.dev/joke/Any?type=single');
        const { joke = 'something went wrong' } = await response.json();
        jokes.push(joke);
    }
    const end = Date.now();
    console.log('jokes:', jokes);
    console.log(`end (took: ${end - start}ms)`);
})();

Promises are handles to some underlying operation. Said operation happens "outside of JS" meaning it can happen in parallel. JS code attached to a promise won't run in parallel of another JS handler code, but we still benefit from having the underlying process doing things in the background in parallel.

tsmaeder · 2021-08-19T12:20:49Z

@paul-marechal but why throttle uploads? And if we are trottling, what is the resource we're trying to conserve? I'm not questioning Semaphores in general, I'm asking about your specific case.

tsmaeder · 2021-08-19T12:23:29Z

@colin-grant-work I'm kinda split down the middle about using a library: async-lock is a no-go for me since it's a not being maintained. Async mutex seems weird: it has two interface classes that basically do the same thing (a mutex is really just a semaphore with n=1). I'm just no sure there is enough meat there to not write the utility ourselves.

colin-grant-work · 2021-08-19T14:15:22Z

@tsmaeder, I'm certainly not wedded to either of those, and this is a simple-enough utility that I don't think we need to worry too much about missing subtleties. It's one I've been tempted to write in the past to handle synchronous calls to update preferences, and I think your implementation handles that case, as well, and with the added feature of safe multiple-release.

paul-marechal · 2021-08-20T16:48:00Z

@tsmaeder in my case I noticed that not throttling uploads on my environment takes more time than when throttled. But from what I've seen it's a common thing to do to not hammer servers down with too many concurrent requests.

Signed-off-by: Thomas Mäder <[email protected]>

tsmaeder · 2021-08-23T13:58:08Z

I've changed the code to use async-mutex.

paul-marechal

I confirm that I am unable to start the same task multiple times anymore, following the "How to test" instructions.

Code LGTM.

packages/core/src/common/promise-util.ts

Signed-off-by: Thomas Mäder <[email protected]>

tsmaeder requested review from RomanNikitenko and paul-marechal August 6, 2021 12:07

msujew reviewed Aug 9, 2021

View reviewed changes

paul-marechal reviewed Aug 9, 2021

View reviewed changes

vince-fugnitto added the tasks issues related to the task system label Aug 9, 2021

This was referenced Aug 12, 2021

Tasks are not ended properly eclipse-che/che#19821

Closed

Plugins Sprint 206 eclipse-che/che#20290

Closed

tsmaeder added 4 commits August 23, 2021 13:34

Introduce lock to prevent parallel task execution

6cdd283

Signed-off-by: Thomas Mäder <[email protected]>

Use console instead of injected logger

06fe2d4

Signed-off-by: Thomas Mäder <[email protected]>

Fix english and formatting

4fd7eff

Signed-off-by: Thomas Mäder <[email protected]>

Use async-mutey package

d7fe10d

Signed-off-by: Thomas Mäder <[email protected]>

tsmaeder force-pushed the 9806_parallel_task_runs branch from 27bd572 to d7fe10d Compare August 23, 2021 12:41

paul-marechal approved these changes Aug 23, 2021

View reviewed changes

packages/core/src/common/promise-util.ts Show resolved Hide resolved

tsmaeder merged commit ff9e050 into eclipse-theia:master Aug 24, 2021

dna2github pushed a commit to dna2fork/theia that referenced this pull request Aug 25, 2021

Introduce lock to prevent parallel task execution (eclipse-theia#9858)

8a926c8

Signed-off-by: Thomas Mäder <[email protected]>

RomanNikitenko pushed a commit that referenced this pull request Sep 16, 2021

Introduce lock to prevent parallel task execution (#9858)

70c21b5

Signed-off-by: Thomas Mäder <[email protected]>

RomanNikitenko pushed a commit that referenced this pull request Sep 16, 2021

Introduce lock to prevent parallel task execution (#9858)

8e3931d

Signed-off-by: Thomas Mäder <[email protected]>

azatsarynnyy pushed a commit to redhat-developer/eclipse-theia that referenced this pull request Sep 23, 2021

Introduce lock to prevent parallel task execution (eclipse-theia#9858)

ccb1bea

Signed-off-by: Thomas Mäder <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce lock to prevent parallel task execution #9858

Introduce lock to prevent parallel task execution #9858

tsmaeder commented Aug 6, 2021

tsmaeder commented Aug 6, 2021

msujew left a comment

msujew Aug 9, 2021

msujew Aug 9, 2021

paul-marechal Aug 9, 2021

tsmaeder Aug 17, 2021

colin-grant-work commented Aug 10, 2021

paul-marechal commented Aug 10, 2021

paul-marechal commented Aug 12, 2021

tsmaeder commented Aug 17, 2021

paul-marechal commented Aug 17, 2021 •

edited

Loading

tsmaeder commented Aug 18, 2021

paul-marechal commented Aug 18, 2021 •

edited

Loading

tsmaeder commented Aug 19, 2021

tsmaeder commented Aug 19, 2021

colin-grant-work commented Aug 19, 2021

paul-marechal commented Aug 20, 2021 •

edited

Loading

tsmaeder commented Aug 23, 2021

paul-marechal left a comment

	* @param ms the number of millisecond os dealy
	* @param ms the number of milliseconds to delay

Introduce lock to prevent parallel task execution #9858

Introduce lock to prevent parallel task execution #9858

Conversation

tsmaeder commented Aug 6, 2021

What it does

How to test

Review checklist

Reminder for reviewers

tsmaeder commented Aug 6, 2021

msujew left a comment

Choose a reason for hiding this comment

msujew Aug 9, 2021

Choose a reason for hiding this comment

msujew Aug 9, 2021

Choose a reason for hiding this comment

paul-marechal Aug 9, 2021

Choose a reason for hiding this comment

tsmaeder Aug 17, 2021

Choose a reason for hiding this comment

colin-grant-work commented Aug 10, 2021

paul-marechal commented Aug 10, 2021

paul-marechal commented Aug 12, 2021

tsmaeder commented Aug 17, 2021

paul-marechal commented Aug 17, 2021 • edited Loading

tsmaeder commented Aug 18, 2021

paul-marechal commented Aug 18, 2021 • edited Loading

tsmaeder commented Aug 19, 2021

tsmaeder commented Aug 19, 2021

colin-grant-work commented Aug 19, 2021

paul-marechal commented Aug 20, 2021 • edited Loading

tsmaeder commented Aug 23, 2021

paul-marechal left a comment

Choose a reason for hiding this comment

paul-marechal commented Aug 17, 2021 •

edited

Loading

paul-marechal commented Aug 18, 2021 •

edited

Loading

paul-marechal commented Aug 20, 2021 •

edited

Loading