[WasmFS] Async proxied JS backend #16229

kripken · 2022-02-09T16:20:49Z

The key file src/library_wasmfs_fetch.js here shows a simple
async JS backend that depends on pthreads proxying. That is, the main code
looks sync as usual, and we proxy to a dedicated thread which does the async
operation, here, a network fetch().

This is a "hello world" backend, the minimal one I can think of that is async. But
it may still be useful - we used to have a LazyFile option in the old FS, which this
is very close to, that is, on the first read of the data we fetch it from the network,
and then it is cached like a normal JS file.

To implement this, add a new ProxiedAsyncJSImplFile in C++. This combines
the matters of proxying and the target being async. In theory we could add two
layers, here, first a C++ File that is async (perhaps using C++ futures?), but I
think that might be over-engineering, since we don't really want the async
aspect for C++ - it's very specific to JS. However, those are internal details, and
we could refactor the code later to add such laying if we wanted.

The JS side is the important part here. Basically each JS backend would define
a bunch of JS hooks that return Promises, and everything else is taken care of
automatically.

kripken · 2022-02-09T19:40:03Z

Ok, this should now be mostly cleaned up and ready for review.

@rstz This now uses await as you suggested. First time I tried that, looks like it works! Nicer code too.

kripken · 2022-02-09T19:46:18Z

(top comment has been updated to give an overview)

tlively

It would be nice to be able to have async backends without them necessarily also being JS backends, but I agree that that seems complicated to put together and not too useful in practice.

tlively · 2022-02-09T21:19:01Z

src/library_wasmfs.js

+#if ASSERTIONS
+    assert(wasmFS$backends[backend]);
+#endif
+    {{{ runtimeKeepalivePush() }}}


Why is this pushing and popping necessary? Is there a way to make it less manual, like a withRuntimeKeptAlive(...) wrapper function?

@sbc100 , did we consider a wrapper function? It's not less code, and only works in some situations, but might be nice.

Another option, for places where we can use await, is to have a macro {{{ makeAwait }}} perhaps that would put the push/pop around it?

src/library_wasmfs_fetch.js

src/struct_info_internal.json

tlively · 2022-02-09T21:32:02Z

system/lib/wasmfs/fetch_backend.cpp

+namespace wasmfs {
+
+class FetchBackend : public Backend {
+  emscripten::SyncToAsync proxy;


It seems strange to have a proxying object outside of the virtual proxying backend. Could we use that virtual backend here instead?

The issue is that we need that proxy object to call the constructor. I guess we could add a layer of indirection with a function that creates it, and call that, like we did elsewhere, would you prefer that?

(But not sure what you mean by "use that virtual backend here" - maybe you have another idea?)

I guess we can't use the proxying backend here because the proxied operations are not the normal backend operations, but rather async versions of the normal operations. The proxying backend would be expecting to call the normal operations, not async operations.

Yes, exactly.

It is odd though to have a Proxy created here, though, so I refactored some now to avoid that. The FetchBackend class is now entirely gone, so creating more backends like this will have less boilerplate.

…proxy

tlively

LGTM besides these last comments.

tlively · 2022-02-10T01:18:59Z

src/library_wasmfs.js

+    {{{ runtimeKeepalivePush() }}}
+    await wasmFS$backends[backend].allocFile(file);
+    {{{ runtimeKeepalivePop() }}}
+    {{{ makeDynCall('vi', 'fptr') }}}(arg);


Will vi handle wasm64? Is there some form of vp we should use instead?

Good question. It doesn't look like we have support for that atm, so it's another limitation of wasm64. I'll add a comment here at least to make it easier to fix up later when we do.

tlively · 2022-02-10T01:26:10Z

system/lib/wasmfs/proxied_async_js_impl_backend.h

+        buf,
+        len,
+        offset,
+        [](CppCallbackState* state) { (*state->resume)(); },


I don't believe there's any need to dereference the function pointer before calling it.

Suggested change

[](CppCallbackState* state) { (*state->resume)(); },

[](CppCallbackState* state) { state->resume(); },

Maybe you're thinking of a C function pointer (where the deref is optional - kind of weird btw...), but in C++ that doesn't seem to work:

proxied_async_js_impl_backend.h:111:54: error: called object type 'emscripten::SyncToAsync::Callback' (aka 'function<void ()> *') is not a function or function pointer [](CppCallbackState* state) { (state->resume)(); },

Oh, I thought that emscripten::SyncToAsync::Callback was a function pointer, but I guess it is a pointer to std::function.

tlively · 2022-02-10T01:27:04Z

system/lib/wasmfs/proxied_async_js_impl_backend.h

+    return state.result;
+  }
+
+  void flush() override {}


Should we abort with some sort of "unimplemented" message here?

This matches js_impl_backend, but fair point, maybe we should do the same in both. OTOH, I think a default of "sync does nothing" is reasonable. I guess we should decide between those eventually, and if we do go with the latter, we should probably put the default in File and not here and in JSImpl.

Ok, leaving this for now seems fine.

tlively · 2022-02-10T01:29:19Z

system/lib/wasmfs/proxied_async_js_impl_backend.h

+        &state);
+    });
+
+    return state.offset;


Would it be worth having different callback state types for the different function signatures, or even for each individual method? On the one hand it would be more types to keep track of, but on the other hand there would be no ambiguity about which fields to use.

I think that adds complexity and I'm not sure what ambiguity is removed? offset is, by its type, the proper field to use when a call returns an offset. I guess if we had a call that returns two offsets we'd have a problem - but I don't think we have such a syscall?

Ok, we can keep it in mind as an option if we do run into any ambiguity.

system/lib/wasmfs/proxied_async_js_impl_backend.h

kripken added 30 commits February 4, 2022 12:36

start

36bb44f

work [ci skip]

91326ec

work [ci skip]

91bf512

builds [ci skip]

11f82c7

test passes [ci skip]

fabf251

fix

09aa942

fix

0df35c1

comment

5a0653a

better

17c041f

format

cb4c42f

fix

5fa4f25

comments

650efab

comment [ci skip]

c80a891

Merge remote-tracking branch 'origin/main' into wfjsbs

a49a630

Merge remote-tracking branch 'origin/main' into wfjsbs

6d9e223

rename

e3c9012

start [ci skip]

0e5429b

wip [ci skip]

dc20c8d

[ci skip]

6210afa

work [ci skip]

af004a0

cpp builds [ci skip]

19c0c9e

rename

9a4d052

Merge branch 'wfjsbs' into wfjsbs2

8575f0b

js 'compiles' [ci skip]

4718c65

node? [ci skip]

8b28d20

work [ci skip]

3ec5f1f

work [ci skip]

52bbb0e

work

4c555bd

format [ci skip]

9321fe6

c++ builds again [ci skip]

55928de

kripken added 5 commits February 9, 2022 11:28

work [ci skip]

a9f685b

work [ci skip]

8ad69f2

work [ci skip]

721acea

work [ci skip]

329c114

format

86ec770

kripken requested a review from tlively February 9, 2022 19:36

kripken marked this pull request as ready for review February 9, 2022 19:37

kripken requested a review from sbc100 February 9, 2022 19:37

tlively reviewed Feb 9, 2022

View reviewed changes

Base automatically changed from wfjsbs to main February 9, 2022 23:24

kripken added 3 commits February 9, 2022 15:27

Merge remote-tracking branch 'origin/main' into wfjsbs2

945cb46

refactor to avoid creating a FetchBackend class that creates its own …

771ad77

…proxy

format

621813a

kripken changed the title ~~[WasmFS] [DRAFT] Async proxied JS backend~~ [WasmFS] Async proxied JS backend Feb 9, 2022

docs

fa649e6

tlively approved these changes Feb 10, 2022

View reviewed changes

tlively reviewed Feb 10, 2022

View reviewed changes

system/lib/wasmfs/proxied_async_js_impl_backend.h Outdated Show resolved Hide resolved

kripken added 4 commits February 10, 2022 07:48

update test

cacb8c2

comment

cc77ca9

indent

2b031d7

use arrow functions

cd0d2ef

kripken mentioned this pull request Feb 10, 2022

[WasmFS] Add JSImplBackend #16209

Merged

tlively approved these changes Feb 10, 2022

View reviewed changes

kripken merged commit 3740492 into main Feb 10, 2022

kripken deleted the wfjsbs2 branch February 10, 2022 19:34

kripken mentioned this pull request Feb 16, 2022

Draft: Add support for read/seek on files in the native file system #16307

Open

kripken mentioned this pull request Apr 18, 2022

ASYNCFS: Allow async JS filesystem implementations #9151

Closed

bollwyvl mentioned this pull request Jun 1, 2022

Implement a custom Emscripten File System which communicates with the JupyterLab Content Manager, giving file access to pyolite jupyterlite/jupyterlite#655

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WasmFS] Async proxied JS backend #16229

[WasmFS] Async proxied JS backend #16229

kripken commented Feb 9, 2022 •

edited

Loading

kripken commented Feb 9, 2022

kripken commented Feb 9, 2022

tlively left a comment

tlively Feb 9, 2022

kripken Feb 9, 2022

tlively Feb 9, 2022

kripken Feb 9, 2022 •

edited

Loading

tlively Feb 9, 2022

kripken Feb 9, 2022

tlively left a comment

tlively Feb 10, 2022

kripken Feb 10, 2022

tlively Feb 10, 2022

kripken Feb 10, 2022

tlively Feb 10, 2022

tlively Feb 10, 2022

kripken Feb 10, 2022

tlively Feb 10, 2022

tlively Feb 10, 2022

kripken Feb 10, 2022

tlively Feb 10, 2022

	[](CppCallbackState* state) { (*state->resume)(); },
	[](CppCallbackState* state) { state->resume(); },

[WasmFS] Async proxied JS backend #16229

[WasmFS] Async proxied JS backend #16229

Conversation

kripken commented Feb 9, 2022 • edited Loading

kripken commented Feb 9, 2022

kripken commented Feb 9, 2022

tlively left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kripken Feb 9, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tlively left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kripken commented Feb 9, 2022 •

edited

Loading

kripken Feb 9, 2022 •

edited

Loading