Fix a race condition when a system is started on a different queue from its event serialising queue. #38

andersio · 2019-02-28T22:38:50Z

Resolve #37.

Why?

Due to the nature of prefix lazily starting the producer being prefixed, we cannot rely on on(value:) to ignite the feedbacks with the initial state. At the time prefix(value:) calls on(value:) for the initial value, the events-reducer producer has not yet been started yet. Consequentially, it would lead to dropped events when the system is instantiated on a queue different from the queue used for serializing events.

Explanation

The current operator application order is prefix(value:) followed by on(value:).

Recall that prefix(value:) is essentially a concat b. Since b would not start before a completes, the events -> reducer part is not started until a has finished sending out the prefix value.

When prefix(value:) sends out the initial value, on(value:) sends the initial value to the state signal, which in turn updates all feedback signals. But since the events-reducer producer hasn’t started yet (which is the flatMap(.concat) semantic), all events generated by feedbacks could potentially be delivered to the void, when the queue instantiating the system runs behind the queue serialising all feedback events.

How to fix it?

Having said that, prefix(value:) is guaranteed to have started the prefixed producer as part of the synchronous producer starting process. So we can address the issue by applying on(started:) after prefix(value:) to ignite the system, while having on(value:) applied instead before prefix(value:) to still keep the reducer-to-feedbacks path open.

RuiAAPeres · 2019-02-28T22:48:35Z

Due to the nature of prefix lazily starting the producer being prefixed, we cannot rely on on(value:) to ignite the feedbacks with the initial state. At the time prefix(value:) calls on(value:) for the initial value, the events-reducer producer has not yet been started yet. Consequentially, it would lead to dropped events when the system is instantiated on a queue different from the queue used for serializing events.

I really would like to understand this fix, since we extensively use ReactiveFeedback in our codebase, but I can't even understand the first paragraph. 😢

Do you think you could create a small diagram (like a timeline), so I can have a mental image of the events?

andersio · 2019-02-28T23:17:16Z

The race condition

Scenario A

Scenario B

The fix

RuiAAPeres · 2019-02-28T23:43:40Z

(beautiful diagrams, thanks! ❤️ )

(I am assuming S0 and S1, are related to State0 and State1?)

I am looking at scenario B, does it mean that the prefix S0 (or initial state) is never handled by the reducer?

andersio · 2019-03-01T08:38:10Z

@RuiAAPeres The reducer is constructed with the scan(initial:_:) operator, and has been fed with the initial state directly.

The reducer wouldn’t process any state directly, only events produced by feedbacks in response to the latest state. Then by processing an event, it computes a new state that gets fed back into the feedbacks.

sergdort · 2019-03-01T09:57:34Z

ReactiveFeedback/SignalProducer+System.swift

@@ -29,8 +29,23 @@ extension SignalProducer where Error == NoError {

            return SignalProducer<Event, NoError>(Signal.merge(events))
                .scan(initial, reduce)
-                .prefix(value: initial)


Are you saying that prefix operator starts source producer before you even subscribe to prefix?

No. LHS is started only after RHS (a producer emitting one value) completes, and that's the core of the issue.

RHS (the prefixed initial) sends out a value, feedbacks receive such value and produce an event, and the said event may be dequeued by the feedback loop queue before LHS (the reducer) starts on the initialising queue.

RuiAAPeres · 2019-03-01T10:39:36Z

ReactiveFeedbackTests/SystemTests.swift

+
+        let observedState: Atomic<[String]> = Atomic([])
+
+        let semaphore = DispatchSemaphore(value: 0)


Pardon my ignorance, but why do we need the aid of semaphore for this test?

Reproducing scenarios that require specific timing order often require manual synchronisation and sometimes repeated runs to lower chance of false positives, because the OS scheduler need not do you a favour when scheduling threads, and we can only create a controlled environment at best effort with these primitives.

Alternatively, this can be done by TestScheduler, but only if advance() supports draining tasks one-by-one.

RuiAAPeres · 2019-03-01T10:42:32Z

~~@andersio can you provide a case, in our codebase, where this bug manifests itself?~~ https://github.com/Babylonpartners/babylon-ios/pull/6847

ReactiveFeedback/SignalProducer+System.swift

inamiy

Just to not forget to fix Property+System.swift... #38 (comment)

Also, we probably need to state <~ SignalProducer.system(...).skip(first: 2) instead since initial will be emitted twice.

andersio · 2019-03-19T13:48:59Z

@Inamy That doesn’t sound right to me. It shouldn’t emit the initial value twice afterwards.

inamiy

@andersio #38 (comment)
My bad. It doesn't seem required, so I will approve now 👍

Add a failing test case for the race condition.

ef04747

andersio added the bug label Feb 28, 2019

andersio self-assigned this Feb 28, 2019

andersio changed the title ~~Anders/fix async start race~~ Fix a race condition when a system is started on a different queue from its event serialising queue. Feb 28, 2019

andersio force-pushed the anders/fix-async-start-race branch 2 times, most recently from 6fcc41f to b29d979 Compare February 28, 2019 22:47

Fix the race condition.

75523c5

andersio force-pushed the anders/fix-async-start-race branch from b29d979 to 75523c5 Compare February 28, 2019 22:48

sergdort reviewed Mar 1, 2019

View reviewed changes

RuiAAPeres reviewed Mar 1, 2019

View reviewed changes

sergdort approved these changes Mar 18, 2019

View reviewed changes

inamiy reviewed Mar 18, 2019

View reviewed changes

ReactiveFeedback/SignalProducer+System.swift Outdated Show resolved Hide resolved

Combine the two occasions of on into one.

6f96dbd

inamiy suggested changes Mar 19, 2019

View reviewed changes

inamiy approved these changes Mar 19, 2019

View reviewed changes

RuiAAPeres merged commit 42656a3 into develop Mar 20, 2019

RuiAAPeres deleted the anders/fix-async-start-race branch March 20, 2019 10:50

inamiy mentioned this pull request Apr 15, 2019

[Proposal] Mealy-Machine ReactiveFeedback babylonhealth/ios-playbook#98

Closed

andersio mentioned this pull request Jan 14, 2020

Rename predicate: to occurrencesPassing:. Update CocoaPods. #50

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a race condition when a system is started on a different queue from its event serialising queue. #38

Fix a race condition when a system is started on a different queue from its event serialising queue. #38

andersio commented Feb 28, 2019 •

edited

Loading

RuiAAPeres commented Feb 28, 2019

andersio commented Feb 28, 2019

RuiAAPeres commented Feb 28, 2019

andersio commented Mar 1, 2019 •

edited

Loading

sergdort Mar 1, 2019 •

edited

Loading

andersio Mar 18, 2019 •

edited

Loading

andersio Mar 18, 2019

RuiAAPeres Mar 1, 2019

andersio Mar 18, 2019 •

edited

Loading

RuiAAPeres commented Mar 1, 2019 •

edited

Loading

inamiy left a comment

andersio commented Mar 19, 2019

inamiy left a comment


		let observedState: Atomic<[String]> = Atomic([])

		let semaphore = DispatchSemaphore(value: 0)

Fix a race condition when a system is started on a different queue from its event serialising queue. #38

Fix a race condition when a system is started on a different queue from its event serialising queue. #38

Conversation

andersio commented Feb 28, 2019 • edited Loading

Why?

Explanation

How to fix it?

RuiAAPeres commented Feb 28, 2019

andersio commented Feb 28, 2019

The race condition

Scenario A

Scenario B

The fix

RuiAAPeres commented Feb 28, 2019

andersio commented Mar 1, 2019 • edited Loading

sergdort Mar 1, 2019 • edited Loading

Choose a reason for hiding this comment

andersio Mar 18, 2019 • edited Loading

Choose a reason for hiding this comment

andersio Mar 18, 2019

Choose a reason for hiding this comment

RuiAAPeres Mar 1, 2019

Choose a reason for hiding this comment

andersio Mar 18, 2019 • edited Loading

Choose a reason for hiding this comment

RuiAAPeres commented Mar 1, 2019 • edited Loading

inamiy left a comment

Choose a reason for hiding this comment

andersio commented Mar 19, 2019

inamiy left a comment

Choose a reason for hiding this comment

andersio commented Feb 28, 2019 •

edited

Loading

andersio commented Mar 1, 2019 •

edited

Loading

sergdort Mar 1, 2019 •

edited

Loading

andersio Mar 18, 2019 •

edited

Loading

andersio Mar 18, 2019 •

edited

Loading

RuiAAPeres commented Mar 1, 2019 •

edited

Loading