-
Notifications
You must be signed in to change notification settings - Fork 720
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wasm2c supporting threads and other features #1766
Comments
It is possible in principle, but would take some work:
|
As long as it is possible haha. @kripken Out of interest, what are atomics? Are they the same as the concurrency-safe variables I've come across in languages such as Java and Go? Secondly, is pthreads part of Emscripten? When looking for existing issues on the matter, I think I came across an issue mentioning pthreads. Aha, I found it: #1645 |
They are lower-level than most languages. Basically they are used to implement the language-level features.
Yes, Emscripten has stable support for pthreads: https://emscripten.org/docs/porting/pthreads.html That uses a bunch of JS for things not supported in wasm yet, like creating web workers and handling the inability to block on the main thread. It works on the web and in node, but atm not anywhere else AFAIK. |
Wasm has semantics for data races and mixed atomic/non-atomic accesses and partly-aliased accesses where C has only UB, so there's a fairly big caveat here that for an efficient and idiomatic translation, the wasm has to be well-behaved according to C shared-memory semantics to avoid UB. |
As long as it is possible. So the onus is on the WASM to prevent UB? That makes sense. But being able to compile the WASM code to C and then compile that to machine code should have a decent performance boost. |
The onus is on the Wasm producer (source language compiler + binaryen + linker + ...) to produce wasm that is of a form that will allow wasm2c to produce UB-free C code. |
@lars-t-hansen, is that more than just a theoretic possibility? The producer would need to know precisely what wasm2c spits out, which it probably cannot even rely on being the same across versions. And then safely steer around the gigantic chasms that are UB in C. Truth probably is that the idea that you could efficiently compile Wasm through C (or LLVM, for that matter) while correctly maintaining all its semantics is hopeless, at least with threads. C's (and LLVM's) UB semantics is much too weak and infectious. In practice, the semantic discrepancy may not matter, though. |
If the code was compiled to WASM from C, then would it still produce errors when decompiling to C? |
No, I don't think so.
The main problem, I expect, is that the C compiler may observe that something is UB and start rewriting the code based on that observation. How big a problem this might be is hard to quantify. It's probably at most a small problem, but one could imagine static memory addresses in wasm turning into sufficiently predictable pointer values in C, and if the wasm code does something interesting via those pointers I could believe UB might be observable. @rhobro, even if the source language is C it's possible the compiler+tools introduces optimized code that, if translated back to C, violate C's idea of undefined behavior. For example, a reasonable C compiler would assume that the integer representation is two's complement and optimize based on that, but that assumption may be invalid in C. |
So how could this be implemented? |
I think that was answered here? #1766 (comment) Those two tasks are basically the steps (which mirror most other features for wasm2c, although here the second task, the runtime, would be larger). It is very true that there is some risk of C UB that needs to be considered carefully during that process, as has been mentioned since. But it may be easier to estimate how big an issue that is after work starts, with concrete code. |
Thanks. When can this be implemented? |
There is no way to really estimate that until someone volunteers to do the work. Usually how these things go is that someone interested enough in the feature, like yourself, will decide to implement it. Once someone starts on this, I'd expect it would take around a week to get something basic working, in addition to any time to ramp up on the codebase. Implementing all the various instructions and corner cases would take somewhat longer, maybe a few more weeks. Aside from that, there are the open questions about undefined behavior as mentioned before - we might only see how serious that issue is once enough is implemented. |
I would implement it if I could 😂 but I don't understand the workings under the hood well enough. Ok. Can someone tag the issue as an enhancement or something? |
Hi there,
I was trying to convert a WASM to C when wasm2c exited saying that the WASM file used threads.
Would it be possible to enable this feature in wasm2c?
Thanks
The text was updated successfully, but these errors were encountered: