-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Where should the memory model spec live? #68
Comments
Whatever we do here, somebody is going to end up with the short end of the stick. Option 1 is bad for wasm because it makes wasm beholden to JS in a way we don't want to be. Option 2 is bad for interop with the web stack (actually with anything at all unless the wasm model is tied to some other stack specifically here). Option 3 is bad for interop with other technology stacks than the web. Option 3 has been generally assumed in discussions I've been involved in, both in TC39 and elsewhere. And it has the nice feature that there can be more than one of these: there can be a wasm-and-JS spec, and if there are other technology stacks (call them X) where wasm figures prominently there can perhaps be a different wasm-and-X spec. Both wasm-and-JS and wasm-and-X will be constrained by the needs of wasm, but it's likely manageable. |
These are fair questions, but for now they are putting the cart three kilometers before the horse. There is a lot to be resolved first. This is a much more complex discussion than a GitHub issue can swallow, but let me note just a few things. Weak memory models are a very, very hard problem, probably the hardest in language semantics so far. There has been more than a decade of research leading to the current C++ and Java models and they still have fundamental flaws. Defining a memory model for a language is not merely a matter of calling into some operations, it is intimately intertwined with the entire language, and requires a holistic definition. As far as I am aware, nobody has ever tackled a multi-language weak memory model. Such a setting makes an already difficult problem combinatorially harder, and is a completely open research problem. Developing a meaningful and practically feasible approach to this will probably be worth several PhDs on the way. We should not pretend that we have a solution to this problem. Sure, we could write up something nicely sounding that is "shared" between the two documents (in whatever place). But that would just create an illusion. Technically, it would be completely void unless we actually know what we are doing. And a meaningless semantics is not going to help anybody on either end and provides no guarantees whatsoever. A sensible order of attacking this problem is to first figure out a semantics for either language (hard enough!) and build sufficient confidence that they actually do what we think they do. Then we could start thinking about ways to integrate those into some unified model. Everything else is just pretending, and no more useful than putting in an informal note saying that we assume these languages interoperate as expected. The good news is that there already are expert researchers starting to tackle the first part of the problem. |
@lars-t-hansen For Option 1, is there more I could read about the goals of Wasm being independent from JS? Is this about JS object semantics accidentally creeping in, or because of difficulties in working with TC39, or something else? For Option 3, I don't quite understand how it would be bad for technology stacks other than the web. If JS doesn't depend on the web, and Wasm doesn't either, why would their shared memory model doc depend on it? @lars-t-hansen @rossberg-chromium The reason I'm putting the cart before the horse on where we do the memory model before we even have this work all done is because there's a bit of organizational work and lead time to choosing a standards venue, getting everyone to agree on it and reference the document produced, IP issues, boring things like that, that are independent of the actual content. If it's a third document, we should also be prepared to have differences in notation and some kind of "binding" to describe how they interact (since Wasm elected not to use JS's notation, there'll be at least one mismatch). |
@littledan, we are considering Wasm implementations with no JavaScipt whatsoever. Some people have already started building them. Enabling that was an explicit design goal from the beginning, and the reason why we carefully layered design and specs to avoid spurious dependencies. |
I guess I meant to ask more specifically, is anyone considering that for the web embedding, threads might not give an option to share the memory with JavaScript? (There's a there-exists vs for-all difference here...) |
@littledan, not sure if that's what you're asking, but a Wasm module can always choose to keep its memory private. In that case, JS has no way of accessing it. So sharing is not required, but of course it is meant to be possible on the web. |
@rossberg-chromium What you're describing makes sense if we want to convert everything over to an operational memory semantics. This is something I'd love and hope we eventually do; it seems like bleeding-edge research though and it'll take a long time though. In the short-term, I think there is a hybrid approach that still keeps the core wasm spec JS-agnostic by making the necessary interactions points with the JS memory model go through host-environment-defined hooks (kinda like how we do with JS imports). More specifically, it looks like when you, e.g., execute SetValueInBuffer in JS on a SAB, you semantically append a WriteSharedMemory event to some [[EventList]]. To plug into this from wasm, it seems like we just need the core semantics of If we did that, then:
|
@lukewagner, unfortunately, there are several problems with that. For example:
FWIW, multi-language semantics is an ongoing research problem on its own, even without weak memory making everything harder. That is regardless of the specification method. I'm highly skeptical that we would be able to come up with anything meaningful. Or that it is energy well-spent to try. |
@rossberg-chromium
Do we need to?
Is that what we want to be defining as the memory model? Or do we want to be defining a virtual ISA, the same way x86, ARM, POWER, NVIDIA hardware, all have an ISA memory model independent of the languages that execute on them. In that approach it's up to each language to figure out how it maps to the shared model. It seems to me that we can define the WebAssembly virtual ISA's memory model, and (as you say) handwave the mapping for a while, let the PhDs mint themselves. I think we'll still have something that extremely practical and useful albeit not Mathematically Pure And Satisfying. |
nobody seems to know how to properly hook up an axiomatic memory model
with an operational semantics
Do we need to?
If you don't relate it to execution, then what does it mean?
multi-language semantics
Is that what we want to be defining as the memory model? Or do we want to
be defining a virtual ISA, the same way x86, ARM, POWER, NVIDIA hardware,
all have an ISA memory model independent of the languages that execute on
them. In that approach it's up to each language to figure out how it maps
to the shared model.
That's a different scenario. We are not compiling one language into the
other. We have two languages with different concepts that interact with
each other directly. There is no common low-level execution model that
underlies both.
It seems to me that we can define the WebAssembly virtual ISA's memory
model, and (as you say) handwave the mapping for a while, let the PhDs mint
themselves. I think we'll still have something that extremely practical and
useful albeit not Mathematically Pure And Satisfying.
Well, all these memory models are fairly mathematical in nature. As such,
they either make sense or they don't. They can also model the wrong thing,
but other than that there isn't much middle ground.
|
Let's try to break this down... Do you agree that we can separately spec the WebAssembly virtual ISA's memory model? Not whether we should, but whether it is possible? IIUC your point is that if we take this approach, then the mapping of each language to that virtual ISA's model is still imprecise. Correct? Can you describe the alternate approach, where we don't do this? It seems to me like we'd be in a yet more imprecise world for longer still, without any model at all, while we wait for academia to solve a very hard problem. That seems less desirable, but you seem to think otherwise. Can you explain why? |
@rossberg-chromium
IIUC, the "atomic action" is inserting a memory event (read, write, rmw) into the [[EventList]].
What I'm suggesting above is that, until we arrive at something better, we could start by not trying to solve this problem in the core wasm spec but rather just calling "hooks" and it's only when you take JS+wasm together (or X+wasm for non-Web) that you can talk about a memory model. This is symmetric to what we've done with using workers as threads. And just like we can later pull threads "into" wasm (viz pure wasm threads), we could also pull in the memory model later (probably we'd even need to do the two at the same time). |
To be a little more concrete about a refactoring that could make sense to support Wasm more cleanly: Currently, functions like SetValueInBuffer operate on JS values like Number (and BigInt in the possible future) and type names based on TypedArrays. A Wasm binding here would have to go and fake creating those JS values. The memory model can instead talk explicitly about values of particular non-JS types for actual 32-bit and 64-bit floats and ints, and have JS convert into them. This requires a little more wording for JS to hook up to it, and has the benefit that Wasm wouldn't have to create any sort of values that don't make sense in the Wasm context.
@rossberg-chromium Would a memory model be entirely useless if a bug is found in it later? It happens pretty frequently that there are bugs in specs which implementations don't implement, and the world doesn't fall apart. Then the bug gets found and fixed, and everything's OK. I don't think implementations are going to be saying, "Wow, I can derive false from these axioms, so I'll do whatever I want!" |
This discussion is focused on memory operations, but they are not what
actually matters much in a memory model! The crux of a memory model is the
specification of *possible execution orders* between *uses* of these
operations, which is a completely non-obvious problem. Without that there
isn't much point in meddling with abstract operations, they are trivial on
their own (especially the atomic ones!).
So, the primary problem is not that we introduce bugs (although I'm sure we
gonna do that, too). It is that the "specification" will not actually
specify anything of interest. Or worse, specifies the completely wrong
thing. And that's independent of how "mathematical" it is.
And yes, to answer @jfbastien's question, that is *worse* than the
alternatives!
* If we keep the specs separate then we at least have a feasible chance of
saying something reasonable about the memory model of each language
individually. On top of that we can add (for the time being) some
handwaving pointing out that these models are intended to be compatible and
expected to interact in the "obvious" manner.
* If, on the other hand, we prematurely try to share something that is
basically meaningless, then we'll not just have failed to specify interop,
we also will have failed to specify the individual languages themselves.
That is a strictly worse outcome.
So I suggest not making naive attempts to solve a hard problem that nobody
has solved before (and introduce needless complexity and bureaucracy on the
way). Instead, stick to what we have a reasonable chance of getting right,
and leave the rest for future research to figure out.
@lukewagner, I don't think we can avoid talking about threads in the Wasm
spec either. They exist and have observable semantics, regardless of how
they are created (to make an analogy with functions: for now we just have
"host threads"). But I believe they are fairly easy to incorporate and hook
up.
|
@rossberg-chromium The JS spec specifies possible execution orders, no? So if, from a purely JS POV, the core wasm spec is just defining host builtins that can be called by JS and that emit the same memory ordering events in the same agents as JS and thus participate in computing the valid candidate executions, that seems no less well-defined than JS currently is. Right? Of course this means that you can't reason about wasm+threads in isolation, without pulling in some host specification, and I think that is your main point of contention, but my point is that this could be a temporary stage while we work out the more challenging questions of how to do it within wasm in a way that interops with JS. MVP and all that. |
@lukewagner, yes, and the definition of candidate executions relies on agents, which "comprise a set of ECMAScript execution contexts, an execution context stack, a running execution context" etc. All of that only makes sense for JavaScript (besides being rather handwavy). For WebAssembly, completely different definitions would be needed to give it meaning, and nobody knows how. And it's even less clear how to define all that in such a way that uncouples it from the individual language specs, because it interacts with all their execution rules. |
@rossberg-chromium so you're satisfied with the definition of possible execution orders (not all the details, just its general existence), but not with the definition of agents? |
@jfbastien, well, without a suitable Wasm definition of agent that connects it to Wasm execution the definition of execution order is totally vacuous. It wouldn't specify anything about the behaviour of Wasm (nor Wasm+JS). Problem is, nobody knows how to make such a connection, let alone how to make it work properly across multiple language semantics. |
I don't understand why you think this is the case. Defining possible execution orders is the harder part in my understanding, because the other agents we're talking about aren't exotic agents which operate weakly through any of SIMD, different processes, different non-coherent hardware, or different machines. We're only adding same-process agents, those are fairly straightforward in my understanding. Do you have data stating otherwise? I may have misses something in the literature and would like to catch up if that's the case. |
@rossberg-chromium Are you contending that the 2-stage "filtering" semantics in ECMAScript doesn't work correctly? Or is the point that you'd like an operational semantics to compute possible values at "step time", without the computationally intractable thing of considering the universe of all possible candidate reduction traces? I also am confused by why using the ECMAScript axiomatic model as a "filter" on possible reduction traces means it cannot say anything about Wasm. I have more worries about the other way: proving some notion of behavioral equivalence or refinement between two independent weak memory models with different machinery. I have no mental model of how I'm supposed to reason about an SAB shared between browser JS and wasm if there is no common ground somewhere. |
@syg, I see a number of issues and mismatches between the two languages.
For example, off the top of my head:
- The definition of "agent" (as well as execution context), which the
notion of execution depends on, is completely different. You can try to
take an "additive" approach to defining an agent's state, but I still have
no idea how you would describe the actual interaction between the two
languages, i.e., Wasm calls that are nested into JS execution and vice
versa, where execution contexts would have to compose somehow.
- The Ecma spec is not declarative but "algorithmic" and imperative: in
particular, events are created and appended to a list; modifications to
this list are described by spec-internal global mutable state (and there is
quite a bit of hand-waving involved in interpreting how the non-determinism
interacts with that, how the axiomatic rules restrict it, and how threads
are interleaved in general). That does not mesh well with the standard text
book model used in Wasm, which is based on a more precise declarative
reduction semantics.
- The standard way of attaching events to a reduction relation is through
labeled transitions. The labels would be the event lists. In analogy with
the Ecma memory model you could then impose a similar axiom system to
restrict the allowed event sequences. However, this may or may not be the
best way to do it. Do you have any reference where such a combination has
already been used successfully to specify a weak memory model? Or a viable
alternative?
- The "bulk" nature of the memory events currently defined in Ecma is at
odds with a precise semantics. A reduction relation has to allow all
possible interleavings, otherwise the axiomatic "filtering" you mention
isn't really possible. In particular, any read or write that is not atomic,
and thus e.g. allows tearing to be observed, would have to be split into
multiple steps. Consequently, I imagine the memory events would need to be
split up accordingly (on both sides!). (Which may, in fact, simplify the
definition of reads-from.)
- Wasm will eventually have other events than just access to linear memory.
For example, module instantiation or shared globals, which have to be
incorporated into the memory model. There may be things that won't arise on
the JS side, and vice versa. Generally, the set of events is highly
language-specific, and I suspect that any premature attempt to define one abstract
model to rule them all is gonna be cumbersome or incomplete or both, and
cause trouble down the road.
- Most importantly, even if we managed to resolve all that, I expect that
any attempt of factoring the definitions of agents, execution contexts,
memory events, etc will almost inevitably end up being specific to a JS
embedding. That would de facto tie Wasm to JavaScript, with no way to
specify other embeddings.
As mentioned before, both weak memory models and multi-language semantics
are already very hard problems in isolation. And @jfbastien, I am not aware
of any successful attempt to day that solved both together in a meaningful
manner (but I'm happy to be proven wrong!).
I am somewhat optimistic that most of this can be solved eventually, but
honestly, I think we are years away from a proper solution. Rushing
something incoherent won't help anybody either -- @syg, how would you
reason about the SAB interaction when its specification is not meaningful?
That has no more value than informal statements of intent, and creates more
problems than it solves, both technically and administratively.
|
PS: For the experts, the primary mismatch probably boils down to ES
defining (essentially) a big-step semantics, while Wasm is using a
small-step semantics. The problem with big-step is that it provides no
clean way to specify threading, which is why the entire concurrency story
in the ES spec really is a big hand-wave. For Wasm we wanted to avoid that,
which is why we switched to small-step. Unfortunately, it is quite
difficult to model an interaction between the two approaches, even for
trivial toy systems (an extension of the problem that big-step has with
threads already).
|
@rossberg-chromium The first two problems need to be solved in formalizing the JS.md specification, which exactly is supposed to hook up Wasm's formalization with JS for single-threaded code. There's some JS specific parts, but there are other parts that are just about, "actually do this thing in the Wasm spec", which is a bit awkward to word. The latter category isn't really specific to a connection to JS at all; if you were documenting how Wasm executes in any other context, I think at some point you'd have to say, "parse the binary format into an AST, then execute it until it's done executing. And if Wasm calls out into a native function, run it, and run any Wasm callbacks that the native function invokes". If any connection between small-step semantics and imperative descriptions of what actually might happen is hand-wavy by your standards, then we're just going to have to have a somewhat hand-wavy specification. The alternative would be a formalism floating in the air and no documented way to invoke it. (Is there any sort of bug thread or discussion notes from when the current formalism was chosen? I don't really understand how the Wasm spec is more precise than the JS one (ignoring the areas where the JS spec is deliberately imprecise).) |
Andreas, Luke, and I discussed near(er)-term plans for the wasm memory model, focusing on interop with the JS embedding. Please correct me if I incorrectly sum up anything here. The nearer term plan would be to try to factor out a common core event semantics and the weakest-possible-yet-still-meaningful set of constraints. Instead of this core event semantics be normatively referenced by both JS and wasm, the normative versions for both specs will remain inlined, so as to reduce friction for moving forward. There will then be a non-normative note (Andreas suggested the best place would be the JS-wasm bridge spec) with concise math that serves as the intended semantics that both JS and wasm are supposed to be observationally equivalent to. I'm not sure how this should be best spun -- the intent needs to communicate that if the event semantics diverge between JS and wasm, that is a spec bug in one, or both. This is not quite the teeth that "non-normative" has. What do folks think is the best approach here? Perhaps Dan can chime in. On the academic research front, Peter's work has been great and it'd be awesome if he looks into this. I think it is very important that he collaborates with the work that Clark and Cristian has already done. There's no current plans with Peter's group AFAIK, however. I'll be happy to float the idea of this style of factoring to TC39 once details are more concrete on the wasm side. |
@syg summed it up nicely. Here are some additional details of our discussion (@syg and @lukewagner correct me if I'm wrong). There are two properties of the Wasm core spec that we want to maintain:
The goal for the memory model hence is to
Details to be worked out. The basic idea is that the Wasm spec will contain a "minimum" memory model, but embedder specs (such as the JS API) can refine and extend it with additional host events and ordering constraints. For now, an axiomatic formulation seems like the easiest way to get there. I will look into ways of integrating that with the Wasm spec. Obviously, it is not clear what a sufficiently "generic" model really is or how different spec approaches to semantics could be reconciled for other languages. So despite the first goal above, the initial model will naturally be informed primarily by the needs of JS. But at least we want to avoid knowingly introducing dependencies. We expect that the model will evolve if other embedder specs pop up or research on memory models offers new insights. Moreover, we discussed how to implement sharing between the Wasm and the ES spec. It seems like the amount of definitions that would actually be common to both specs is not really large enough to motivate yet another spec (especially if kept non-JS-specific). It would create tedious indirection and extra churn, and may complicate adapting to new Wasm embedders. As Shu mentioned, we arrived at the idea of rather including the respective pieces in both specs, and then describe the unified semantics in the JS API spec that bridges the two. |
Being independent of various things sounds like a great goal for keeping these specs clear, meaningful and capable of evolving into the future. If there's sufficient non-normative text that can explain what's supposed to going on to normal implementers and users, this could be a working solution for everyone. However, there's another goal which is (unfortunately) in tension with this independence. That goal is having the combination of all of the specifications completely describe the system, to the point that could, in theory, read all the specs in the world and write a web browser that can run the whole web. In several other areas of the web platform, ideas about definitions being clean and independent of other things has gotten in the way of being complete enough (it would be too messy to specify that part!), or has made their evolution over time difficult (e.g., in adding features that integrate with other parts which breaks the abstraction boundaries). Sometimes, abstraction boundaries turn out to be in the wrong place when attempting to actually take advantage of the generic-ness. I hope, as the Wasm specification evolves here, we can keep in mind the balance between good abstractions and complete, web-ready descriptions. |
That's a good goal, but the reality of memory models is that no implementer will read the actual memory model for implementation guidance. They will read the non normative guidelines or will lean on experts. As far as the MM goes, then, I think the outlined approach doesn't run counter to this goal in practice.
…--
shu
On Sep 18, 2017, at 09:05, Daniel Ehrenberg ***@***.***> wrote:
Being independent of various things sounds like a great goal for keeping these specs clear, meaningful and capable of evolving into the future. If there's sufficient non-normative text that can explain what's supposed to going on to normal implementers and users, this could be a working solution for everyone.
However, there's another goal which is (unfortunately) in tension with this independence. That goal is having the combination of all of the specifications completely describe the system, to the point that could, in theory, read all the specs in the world and write a web browser that can run the whole web. In several other areas of the web platform, ideas about definitions being clean and independent of other things has gotten in the way of being complete enough (it would be too messy to specify that part!), or has made their evolution over time difficult (e.g., in adding features that integrate with other parts which breaks the abstraction boundaries). Sometimes, abstraction boundaries turn out to be in the wrong place when attempting to actually take advantage of the generic-ness.
I hope, as the Wasm specification evolves here, we can keep in mind the balance between good abstractions and complete, web-ready descriptions.
―
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Three options:
Any thoughts on these options?
(thread split off from #9)
The text was updated successfully, but these errors were encountered: