Add code tour #21

glennmoy · 2023-04-26T19:21:12Z

Closes #20

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1204480108842965

examples/tour.jl

glennmoy · 2023-04-26T19:22:20Z

examples/tour.jl

+
+# Now let's start the batcher - but given we already used the RandomBatcher previously, we
+# need to provide the correct new_state
+# XXX: why does it work if we use state=RNG above? why do we need to do this twice?


?

not really sure tbh

beacuse the batcher itself does not store the state anywhere; we used to do that but it was too easy for it to get out of sync and then you lose teh guarantees about reproducibility (that's also why the take!(::Batcher, state) interface requires the state, so that the batcher itself can remain stateless)

ah I see...

To me then it seems that allowing Batcher(...;start=true, state=RNG) would contradict that...because in that case it not only seems like the Batcher does know something about the state (because it's passed as an argument) but that it also controls the iteration state by being able to start from construction.

(tangentially, the fact that "state is ignored if start=false" just confuses the matter because now you have conditional application of kwargs)

Overall I feel the separation of concerns has been blurred which partly contributes to my confusion

glennmoy · 2023-04-26T19:23:34Z

examples/tour.jl

+
+# A more convenient way to do this is to just take! from the batcher.channel, which obviates
+# the need for passing around the new_state
+((X4, Y4), new_state), old_state = take!(batcher.channel)


It's not documented that it can/should be used this way but I've seen this used on beacon-internal code? is it sanctioned? seems safer & more convenient than carrying around a state?

yeah this is no good 🙅 for the reasons above: it's the responsibility of the caller to handle iteration state. the other reason is that take!(::Batcher, state) handles error propagation, avoiding deadlock, etc. all that good stuff (which is surprisingly tricky to get right).

if a user wants to just have a channel of batches, they are absolutely free to call start_batching with the workers they wanna use; the idea of the Batcher is to encapsulate all that trickiness of running this stuff in a distributed way

gotcha! I might show you where I saw this pattern internally and you might shed light on what I might be misunderstanding

examples/tour.jl

glennmoy · 2023-04-26T19:30:56Z

examples/tour.jl

+# https://github.com/beacon-biosignals/Legolas.jl/blob/main/examples/tour.jl
+# https://github.com/beacon-biosignals/Onda.jl/blob/main/examples/tour.jl
+#
+# Why OndaBatches?


to the best of my understanding - this is why it exists

kleinschmidt

this is a great start and really awesome to have!

kleinschmidt · 2023-04-27T14:45:20Z

examples/tour.jl

+s1, l1 = load_labeled_signal(labeled_signals[1, :])
+
+@test s1 isa Samples
+@test l1 isa Samples
+


might be worthwhile to explain a bit about how label_span relates to the Signal's span field (they're both relative to recording start), and maybe to give an example of using sub_label_span to select a sub-span (using the correct alignment). woudl be easier to see the nuances here if span doesn't start at 0. so maybe this should be a separate section somewhere...

added, though I'm confused about one part - highlighted below

examples/tour.jl

kleinschmidt · 2023-04-27T14:55:27Z

examples/tour.jl

+
+# Now let's start the batcher - but given we already used the RandomBatcher previously, we
+# need to provide the correct new_state
+# XXX: why does it work if we use state=RNG above? why do we need to do this twice?


beacuse the batcher itself does not store the state anywhere; we used to do that but it was too easy for it to get out of sync and then you lose teh guarantees about reproducibility (that's also why the take!(::Batcher, state) interface requires the state, so that the batcher itself can remain stateless)

examples/tour.jl

kleinschmidt · 2023-04-27T15:01:28Z

examples/tour.jl

+
+# A more convenient way to do this is to just take! from the batcher.channel, which obviates
+# the need for passing around the new_state
+((X4, Y4), new_state), old_state = take!(batcher.channel)


yeah this is no good 🙅 for the reasons above: it's the responsibility of the caller to handle iteration state. the other reason is that take!(::Batcher, state) handles error propagation, avoiding deadlock, etc. all that good stuff (which is surprisingly tricky to get right).

if a user wants to just have a channel of batches, they are absolutely free to call start_batching with the workers they wanna use; the idea of the Batcher is to encapsulate all that trickiness of running this stuff in a distributed way

examples/tour.jl

kleinschmidt

still haven't really given this a super thorough review but it's been open so long we might as well merge it.

glennmoy added 3 commits April 26, 2023 14:04

add tour

3dfcef8

se testdata

dfc1200

add distributed example

dc6e0e5

glennmoy marked this pull request as draft April 26, 2023 19:21

glennmoy commented Apr 26, 2023

View reviewed changes

glennmoy changed the title ~~Add code tour~~ RFC: Add code tour Apr 26, 2023

glennmoy commented Apr 26, 2023

View reviewed changes

kleinschmidt reviewed Apr 27, 2023

View reviewed changes

glennmoy added 7 commits April 28, 2023 13:37

Don't hardcode RNG

20041a9

update test

ad2e2c1

delete bad example

24f6e5d

update comments

ecb4489

fix tests

786d557

Improve Batcher example

b51ca9a

add realistic example

565e645

glennmoy force-pushed the gm/codetour branch from d877b31 to 565e645 Compare April 28, 2023 15:27

glennmoy added 2 commits April 28, 2023 15:58

add section on signal spans

aa0d693

remove comment

50aa715

glennmoy changed the title ~~RFC: Add code tour~~ Add code tour May 2, 2023

glennmoy marked this pull request as ready for review May 2, 2023 09:55

Update README

99d1190

glennmoy requested a review from kleinschmidt May 3, 2023 18:49

kleinschmidt reviewed Mar 20, 2024

View reviewed changes

examples/tour.jl Outdated Show resolved Hide resolved

kleinschmidt added 2 commits March 20, 2024 14:41

Merge remote-tracking branch 'origin/main' into gm/codetour

44e0166

Update examples/tour.jl

0750559

kleinschmidt approved these changes Mar 20, 2024

View reviewed changes

kleinschmidt merged commit 7b807bf into main Mar 20, 2024
2 checks passed

kleinschmidt deleted the gm/codetour branch March 20, 2024 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add code tour #21

Add code tour #21

glennmoy commented Apr 26, 2023 •

edited by kleinschmidt

Loading

glennmoy Apr 26, 2023

kleinschmidt Apr 27, 2023

glennmoy Apr 28, 2023

glennmoy Apr 26, 2023

kleinschmidt Apr 27, 2023

glennmoy Apr 28, 2023 •

edited

Loading

glennmoy Apr 26, 2023

kleinschmidt left a comment

kleinschmidt Apr 27, 2023

glennmoy Apr 28, 2023

kleinschmidt Apr 27, 2023

kleinschmidt Apr 27, 2023

kleinschmidt left a comment

Add code tour #21

Add code tour #21

Conversation

glennmoy commented Apr 26, 2023 • edited by kleinschmidt Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glennmoy Apr 28, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kleinschmidt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kleinschmidt left a comment

Choose a reason for hiding this comment

glennmoy commented Apr 26, 2023 •

edited by kleinschmidt

Loading

glennmoy Apr 28, 2023 •

edited

Loading