Text format revisions #884

tlively · 2018-10-02T18:18:11Z

At the Oct. 2 CG meeting, there was unanimous consent to consider making changes to the text format to improve the consistency of instruction names and remove special characters. Specifically, interest has been expressed in removing slashes from all type conversion (wrap, trunc, extend, convert, demote, promote, and reinterpret) instructions and unifying the formats of memory and global instructions.

To minimize ecosystem disruption, we will plan to make all changes we agree on at once and update as many tools as possible at roughly the same time. We would like to resolve this issue quickly to avoid holding up the standardization process.

What final changes do we want to make?

tlively · 2018-10-02T18:26:23Z

I propose replacing slashes with underscores. For example, i32.wrap/i64 would become i32.wrap_i64.

Related discussion for nontrapping float to int conversions here: WebAssembly/nontrapping-float-to-int-conversions#4.

rossberg · 2018-10-02T20:24:56Z

I'm fine with underscore for conversions. If we do that, I would propose switching around the _u/_s suffixes, so that they consistently go last in all mnemonics. That is,

i32.trunc_f32_u, i32.trunc_f32_s and so on

This makes it easier to consistently recognise e.g. all i32_xxx_u operating specifically on unsigned i32's.

The other changes I would propose, for consistency with memory and upcoming table instructions (see bulk operations proposal):

{get,set}_global -> global.{get,set}
{get,set,tee}_local -> local.{get,set,tee}

(For the record, one could also consider call -> func.call, but I won't be proposing that.)

binji · 2018-10-02T21:16:10Z

OK, so then the full list would be like this, right?

;; table declaration
anyfunc -> funcref

;; core instructions
get_local -> local.get
set_local -> local.set
tee_local -> local.tee
get_global -> global.get
set_global -> global.set
i32.wrap/i64 -> i32.wrap_i64
i32.trunc_s/f32 -> i32.trunc_f32_s
i32.trunc_u/f32 -> i32.trunc_f32_u
i32.trunc_s/f64 -> i32.trunc_f64_s
i32.trunc_u/f64 -> i32.trunc_f64_u
i64.extend_s/i32 -> i64.extend_i32_s
i64.extend_u/i32 -> i64.extend_i32_u
i64.trunc_s/f32 -> i64.trunc_f32_s
i64.trunc_u/f32 -> i64.trunc_f32_u
i64.trunc_s/f64 -> i64.trunc_f64_s
i64.trunc_u/f64 -> i64.trunc_f64_u
f32.convert_s/i32 -> f32.convert_i32_s
f32.convert_u/i32 -> f32.convert_i32_u
f32.convert_s/i64 -> f32.convert_i64_s
f32.convert_u/i64 -> f32.convert_i64_u
f32.demote/f64 -> f32.demote_f64
f64.convert_s/i32 -> f64.convert_i32_s
f64.convert_u/i32 -> f64.convert_i32_u
f64.convert_s/i64 -> f64.convert_i64_s
f64.convert_u/i64 -> f64.convert_i64_u
f64.promote/f32 -> f64.promote_f32
i32.reinterpret/f32 -> i32.reinterpret_f32
i64.reinterpret/f64 -> i64.reinterpret_f64
f32.reinterpret/i32 -> f32.reinterpret_i32
f64.reinterpret/i64 -> f64.reinterpret_i64

;; saturating-float-to-int instructions
i32.trunc_s:sat/f32 -> i32.trunc_sat_f32_s
i32.trunc_u:sat/f32 -> i32.trunc_sat_f32_u
i32.trunc_s:sat/f64 -> i32.trunc_sat_f64_s
i32.trunc_u:sat/f64 -> i32.trunc_sat_f64_u
i64.trunc_s:sat/f32 -> i64.trunc_sat_f32_s
i64.trunc_u:sat/f32 -> i64.trunc_sat_f32_u
i64.trunc_s:sat/f64 -> i64.trunc_sat_f64_s
i64.trunc_u:sat/f64 -> i64.trunc_sat_f64_u

;; simd instructions
f32x4.convert_s/i32x4 -> f32x4.convert_i32x4_s
f32x4.convert_u/i32x4 -> f32x4.convert_i32x4_u
f64x2.convert_s/i64x2 -> f64x2.convert_i64x2_s
f64x2.convert_u/i64x2 -> f64x2.convert_i64x2_u
i32x4.trunc_s/f32x4:sat -> i32x4.trunc_sat_f32x4_s
i32x4.trunc_u/f32x4:sat -> i32x4.trunc_sat_f32x4_u
i64x2.trunc_s/f64x2:sat -> i64x2.trunc_sat_f64x2_s
i64x2.trunc_u/f64x2:sat -> i64x2.trunc_sat_f64x2_u

;; atomic instructions
i32.atomic.rmw8_u.add -> i32.atomic.rmw8.add_u
i32.atomic.rmw16_u.add -> i32.atomic.rmw16.add_u
i64.atomic.rmw8_u.add -> i64.atomic.rmw8.add_u
i64.atomic.rmw16_u.add -> i64.atomic.rmw16.add_u
i64.atomic.rmw32_u.add -> i64.atomic.rmw32.add_u
i32.atomic.rmw8_u.sub -> i32.atomic.rmw8.sub_u
i32.atomic.rmw16_u.sub -> i32.atomic.rmw16.sub_u
i64.atomic.rmw8_u.sub -> i64.atomic.rmw8.sub_u
i64.atomic.rmw16_u.sub -> i64.atomic.rmw16.sub_u
i64.atomic.rmw32_u.sub -> i64.atomic.rmw32.sub_u
i32.atomic.rmw8_u.and -> i32.atomic.rmw8.and_u
i32.atomic.rmw16_u.and -> i32.atomic.rmw16.and_u
i64.atomic.rmw8_u.and -> i64.atomic.rmw8.and_u
i64.atomic.rmw16_u.and -> i64.atomic.rmw16.and_u
i64.atomic.rmw32_u.and -> i64.atomic.rmw32.and_u
i32.atomic.rmw8_u.or -> i32.atomic.rmw8.or_u
i32.atomic.rmw16_u.or -> i32.atomic.rmw16.or_u
i64.atomic.rmw8_u.or -> i64.atomic.rmw8.or_u
i64.atomic.rmw16_u.or -> i64.atomic.rmw16.or_u
i64.atomic.rmw32_u.or -> i64.atomic.rmw32.or_u
i32.atomic.rmw8_u.xor -> i32.atomic.rmw8.xor_u
i32.atomic.rmw16_u.xor -> i32.atomic.rmw16.xor_u
i64.atomic.rmw8_u.xor -> i64.atomic.rmw8.xor_u
i64.atomic.rmw16_u.xor -> i64.atomic.rmw16.xor_u
i64.atomic.rmw32_u.xor -> i64.atomic.rmw32.xor_u
i32.atomic.rmw8_u.xchg -> i32.atomic.rmw8.xchg_u
i32.atomic.rmw16_u.xchg -> i32.atomic.rmw16.xchg_u
i64.atomic.rmw8_u.xchg -> i64.atomic.rmw8.xchg_u
i64.atomic.rmw16_u.xchg -> i64.atomic.rmw16.xchg_u
i64.atomic.rmw32_u.xchg -> i64.atomic.rmw32.xchg_u
i32.atomic.rmw8_u.cmpxchg -> i32.atomic.rmw8.cmpxchg_u
i32.atomic.rmw16_u.cmpxchg -> i32.atomic.rmw16.cmpxchg_u
i64.atomic.rmw8_u.cmpxchg -> i64.atomic.rmw8.cmpxchg_u
i64.atomic.rmw16_u.cmpxchg -> i64.atomic.rmw16.cmpxchg_u
i64.atomic.rmw32_u.cmpxchg -> i64.atomic.rmw32.cmpxchg_u

Note: Edited to reflect @rossberg's comment below.

rossberg · 2018-10-02T23:56:11Z

It would be more consistent to have the _u/_s go after the _sat, for the same reasons mentioned above.

In fact, I suggest establishing the following consistent naming scheme for numeric instruction mnemonics:

t.xxx for all numeric instructions agnostic to signedness
t.xxx_u/t.xxx_s for all numeric instructions sensitive to signedness

where xxx describes the specific operation (e.g., add). (That already holds for everything but conversions.)

Conversions are then instantiations of this scheme with xxx = yyy_t, yielding:

t.yyy_t for conversions agnostic to signedness
t.yyy_t_u/t.yyy_t_s for conversions sensitive to sign

where yyy describes the kind of conversion (e.g., trunc or trunc_sat).

binji · 2018-10-03T00:05:17Z

OK, I've updated the comment above to match this.

lars-t-hansen · 2018-10-03T06:23:35Z

Since we're just chatting anyway: It irks me that f64.lt (say) does not fit the general pattern; it should really be i32.lt_f64.

lars-t-hansen · 2018-10-03T06:28:00Z

Oh, and atomics:

i32.atomic.load8_u fits the proposed pattern.
i32.atomic.rmw8_u.add more or less does not.

rossberg · 2018-10-03T15:30:08Z

@lars-t-hansen, I suppose you cannot interpret the type before the dot strictly as the result type of an operator. It's the type that the operator "is defined at" in some looser sense.

lars-t-hansen · 2018-10-03T16:06:22Z

I'm not sure why I wouldn't want to interpret the type strictly as the result type of the operator, though, since that pattern is used very broadly, just not in some cases where it's suddenly about the arguments and the result type is implied (in all cases I can think of the implied result type is i32). Of course there's a succinctness to the instructions that fall under that exception that would be lost if we applied the general pattern, but I don't know why that is an argument either.

(I'm not just trolling, I actually think the current syntax for comparisons is confusing and counterintuitive. Even lt_f64 -- without any type-designating prefix -- would have been better than the current f64.lt.)

binji · 2018-10-03T19:18:19Z

I dunno, i32.lt_f64 looks super weird to me. If we think of the type before the . as a result type, many instructions don't make sense, such as i32.store or even memory.size. Personally I don't think it's that valuable to encode the types in the instruction name anyway, we just need a way to differentiate them.

tlively · 2018-10-04T02:16:32Z

I think of the type before the dot as signifying what type the generic instruction is specialized on. Since all lts return i32, the type before the dot tells me what type the operands are instead, since those still need to be disambiguated.

alexcrichton · 2018-10-04T07:12:43Z

FWIW some possible breakage I think that may want to be considered when doing I believe is:

Libraries which want to be idiomatic with the current wasm spec will need breaking changes to update their type names to reflect these name changes. For example a parser library in Rust, parity-wasm, would need a breaking change to rename Instruction::GetLocal to Instruction::LocalGet.
We've got tests in Rust which verify that the correct instruction is generated for a particular function (namely for intrinsics like SIMD and such). These instruction names come from disassembly the wasm binary itself with wasm2wat and the instructions to assert for are embedded in the source code. If wasm2wat changes its output, these tests will start to break.
A final one is one we've talked about in the Rust project historically is the "blog post" problem. This is where (for example) there's blog posts, tutorials, reference materials, etc on the internet today which may reference wasm instructions by the old names. If the standard changes a number of instructions (especially common ones like get_local) then all this documentation becomes out of date and needs and update.

To be clear I'm not opposed to changing the instructions to be more consistent, but I wanted to be sure to point out some perhaps more subtle locations that "breakage" can arise in the non-traditional sense. They're all pretty minor and we're certainly more than willing to do the updates on the Rust side of things, but points that may want to be considered!

Also @binji your comment with an exhaustive list of renames is super helpful!

binji · 2018-10-04T18:32:34Z

Right, we've already experienced some of the same issues when changing grow_memory to memory.grow and current_memory -> memory.size, and these instructions are much more common.

@lars-t-hansen is right about atomics too, these are odd now:

i32.atomic.rmw.add 
i64.atomic.rmw.add 
i32.atomic.rmw8_u.add
i32.atomic.rmw16_u.add
i64.atomic.rmw8_u.add
i64.atomic.rmw16_u.add
i64.atomic.rmw32_u.add
...

rossberg · 2018-10-09T13:39:32Z

@binji, good point. I propose moving the _u/_s to the end for those as well (which arguably fits existing instructions).

binji · 2018-10-09T18:22:11Z

OK, updated my comment above to include these as well.

rossberg · 2018-10-15T13:48:43Z

Oh, that reminds me of one more thing that I'd like to suggest. In the light of the reference type and GC proposals it would be more consistent if we renamed anyfunc to funcref, so that it lines up with the likes of anyref, eqref, optref, and possibly other subtypes of ref.

binji · 2018-10-16T18:27:22Z

OK, added that to the list.

I think as a good first step, we should file bugs in the various trackers where the name change is relevant and link to this issue.

aardappel · 2018-10-18T20:57:12Z

Apologies for the bike-shedding, but I don't see a whole lot of consistency in the use of . and _.

So far the most frequent use of . has been to separate the type operated upon (input types, not return type) before the . from the rest of the instruction. Which to me means operations like get_local which are not type specific shouldn't have a . in them. And I'm not sure why there are multiple . in atomic ops but not in others.

I'd suggest that either . has a very intuitive meaning like above, or maybe we should do away with . as well, and make _ the only separator.

In fact, while I am ranting, why does i32.add have a type and get_local does not? Both participate in completely static dataflow determined by the stack and locals, and both will have completely static types that are verified when bytecode is loaded. If get_local loads the wrong type, that is an error, but that is not obvious from the instruction. If i32.add receives a wrong operand that is also an error, but here it is more obvious. If instead the instruction was add it wouldn't really change anything, beyond knowing which of the 2 operands is wrong.

tlively · 2018-12-15T00:22:13Z

@binji it looks like we have generally settled on the list above. Is there any action we need to take to officially finalize it?

binji · 2018-12-15T00:31:30Z

I don't think so, we agreed to the change in the CG meeting, and the change to the spec has landed. I think at this point we should update the tools to use the new names.

Automated renaming according to WebAssembly/spec#884 (comment).

Summary: An automated renaming of all the instructions listed at WebAssembly/spec#884 (comment) as well as some similarly-named identifiers. Reviewers: aheejin, dschuff, aardappel Subscribers: sbc100, jgravelle-google, eraman, sunfish, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D56338 llvm-svn: 350609

This renames instructions according to WebAssembly/spec#884 (comment).

Renames instructions as discussed in WebAssembly/spec#884 (comment).

Renames instructions as discussed in WebAssembly/spec#884 (comment). Closes WebAssembly#4 and fixes WebAssembly#6.

Renames instructions as discussed in WebAssembly/spec#884 (comment). Closes #4 and fixes #6.

These instructions were renamed in the October 2, WebAssembly CG meeting. The issue describing the change is here: WebAssembly/spec#884 Change-Id: Ia9e8733156b5ed5db7fc9ab1681c1a51b874dd71 Reviewed-on: https://chromium-review.googlesource.com/c/v8/v8/+/1620681 Reviewed-by: Clemens Hammacher <[email protected]> Commit-Queue: Ben Smith <[email protected]> Cr-Commit-Position: refs/heads/master@{#61711}

Renames instructions as discussed in WebAssembly/spec#884 (comment). Closes #4 and fixes #6.

See WebAssembly/spec#884 (comment).

This renaming was decided on in WebAssembly/spec#884.

* Update conversion op names This renaming was decided on in WebAssembly/spec#884.

Addresses [this comment](WebAssembly/spec#884 (comment)).

* Update conversion op names This renaming was decided on in WebAssembly/spec#884.

* Define non-trapping float-to-int conversions. This also introduces the concept of prefix bytes, and defines the "numeric" prefix byte, used for encodin the new conversion instructions. * Add feature markers. * Add the feature marker in more places. * Rename "numeric" to "misc". See WebAssembly/nontrapping-float-to-int-conversions#5. * Rename opcodes. See WebAssembly/spec#884 (comment).

tlively mentioned this issue Oct 4, 2018

Delete the old wasm memory intrinsics rust-lang/stdarch#563

Closed

binji mentioned this issue Oct 16, 2018

Rename instructions in text format WebAssembly/wabt#933

Closed

binji mentioned this issue Oct 25, 2018

Change saturating truncation operator names WebAssembly/simd#26

Closed

rossberg mentioned this issue Nov 29, 2018

[spec/interpreter/tests] Rename instructions #926

Merged

dtig mentioned this issue Dec 17, 2018

Consistency with non-trapping truncate proposal WebAssembly/simd#25

Closed

tlively added a commit to tlively/binaryen that referenced this issue Jan 7, 2019

Massive renaming

3c86aad

Automated renaming according to WebAssembly/spec#884 (comment).

tlively added a commit to tlively/binaryen that referenced this issue Jan 7, 2019

Massive renaming

7270886

Automated renaming according to WebAssembly/spec#884 (comment).

tlively mentioned this issue Jan 7, 2019

Massive renaming WebAssembly/binaryen#1855

Merged

tlively added a commit to WebAssembly/binaryen that referenced this issue Jan 7, 2019

Massive renaming (#1855)

7d94900

Automated renaming according to WebAssembly/spec#884 (comment).

aheejin added a commit to aheejin/threads that referenced this issue Feb 18, 2019

Atomic instruction renaming

1f047cc

This renames instructions according to WebAssembly/spec#884 (comment).

aheejin mentioned this issue Feb 18, 2019

Atomic instruction renaming WebAssembly/threads#132

Merged

binji pushed a commit to WebAssembly/threads that referenced this issue Feb 18, 2019

Atomic instruction renaming (#132)

142fd21

This renames instructions according to WebAssembly/spec#884 (comment).

Horcrux7 mentioned this issue Feb 19, 2019

Switch to the new text format i-net-software/JWebAssembly#3

Closed

neelance mentioned this issue Mar 30, 2019

Text format revisions WebAssembly/nontrapping-float-to-int-conversions#6

Closed

aheejin added a commit to aheejin/nontrapping-float-to-int-conversions that referenced this issue Mar 30, 2019

Instruction renaming

e7bc2fe

Renames instructions as discussed in WebAssembly/spec#884 (comment).

aheejin added a commit to aheejin/nontrapping-float-to-int-conversions that referenced this issue Mar 30, 2019

Instruction renaming

9723fda

Renames instructions as discussed in WebAssembly/spec#884 (comment). Closes WebAssembly#4 and fixes WebAssembly#6.

aheejin mentioned this issue Mar 30, 2019

Instruction renaming WebAssembly/nontrapping-float-to-int-conversions#7

Merged

sunfishcode pushed a commit to WebAssembly/nontrapping-float-to-int-conversions that referenced this issue Apr 2, 2019

Instruction renaming

ce495e2

Renames instructions as discussed in WebAssembly/spec#884 (comment). Closes #4 and fixes #6.

AlanFoster mentioned this issue Jun 10, 2019

Align with latest wasm naming convention AlanFoster/bug#4

Merged

rossberg mentioned this issue Jul 5, 2019

Confused about text format #1042

Closed

binji mentioned this issue Jul 22, 2019

On funcref replacing anyref in text format WebAssembly/binaryen#2248

Closed

sunfishcode pushed a commit to WebAssembly/nontrapping-float-to-int-conversions that referenced this issue Aug 27, 2019

Instruction renaming

b8abf12

Renames instructions as discussed in WebAssembly/spec#884 (comment). Closes #4 and fixes #6.

sunfishcode added a commit to WebAssembly/design that referenced this issue Aug 30, 2019

Rename opcodes.

88625a7

See WebAssembly/spec#884 (comment).

tlively added a commit to WebAssembly/simd that referenced this issue Sep 13, 2019

Update conversion op names

b10d3bf

This renaming was decided on in WebAssembly/spec#884.

tlively mentioned this issue Sep 13, 2019

Update conversion op names WebAssembly/simd#108

Merged

dtig pushed a commit to WebAssembly/simd that referenced this issue Sep 13, 2019

Update conversion op names (#108)

b36ca02

* Update conversion op names This renaming was decided on in WebAssembly/spec#884.

Honry pushed a commit to Honry/simd that referenced this issue Oct 19, 2019

[spec] Fix comment still referring to old syntax (#957)

1c16c7a

Addresses [this comment](WebAssembly/spec#884 (comment)).

Honry pushed a commit to Honry/simd that referenced this issue Oct 19, 2019

Update conversion op names (WebAssembly#108)

af4e3a1

* Update conversion op names This renaming was decided on in WebAssembly/spec#884.

tpmccallum mentioned this issue Dec 29, 2019

funcref vs anyfunc wasdk/WebAssemblyStudio#448

Open

This was referenced Mar 19, 2020

https://webassembly.github.io/threads is out of date WebAssembly/threads#152

Open

Adding tests for wat compatibility wasdk/wasmparser#33

Closed

This was referenced May 20, 2020

[wabt-compatibility] Atomic instruction, tail-call, threads flag always on wasdk/wasmparser#48

Merged

[wabt-compatiblity] Saturating float to int, reference types, simd wasdk/wasmparser#49

Merged

pjmlp mentioned this issue Jun 5, 2022

Examples using get_local should use local.get PacktPublishing/Practical-WebAssembly#6

Closed

womeier mentioned this issue Aug 7, 2023

update type keywords rhysd/vim-wasm#4

Merged

yurydelendik mentioned this issue Aug 14, 2023

Future of old syntax for local.get or global.set bytecodealliance/wasm-tools#1164

Closed

larry0x mentioned this issue Nov 3, 2023

Update WAT format in several test cases CosmWasm/cosmwasm#1935

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text format revisions #884

Text format revisions #884

tlively commented Oct 2, 2018

tlively commented Oct 2, 2018

rossberg commented Oct 2, 2018

binji commented Oct 2, 2018 •

edited

Loading

rossberg commented Oct 2, 2018

binji commented Oct 3, 2018

lars-t-hansen commented Oct 3, 2018

lars-t-hansen commented Oct 3, 2018

rossberg commented Oct 3, 2018

lars-t-hansen commented Oct 3, 2018

binji commented Oct 3, 2018

tlively commented Oct 4, 2018

alexcrichton commented Oct 4, 2018

binji commented Oct 4, 2018

rossberg commented Oct 9, 2018

binji commented Oct 9, 2018

rossberg commented Oct 15, 2018

binji commented Oct 16, 2018

aardappel commented Oct 18, 2018

tlively commented Dec 15, 2018

binji commented Dec 15, 2018

Text format revisions #884

Text format revisions #884

Comments

tlively commented Oct 2, 2018

tlively commented Oct 2, 2018

rossberg commented Oct 2, 2018

binji commented Oct 2, 2018 • edited Loading

rossberg commented Oct 2, 2018

binji commented Oct 3, 2018

lars-t-hansen commented Oct 3, 2018

lars-t-hansen commented Oct 3, 2018

rossberg commented Oct 3, 2018

lars-t-hansen commented Oct 3, 2018

binji commented Oct 3, 2018

tlively commented Oct 4, 2018

alexcrichton commented Oct 4, 2018

binji commented Oct 4, 2018

rossberg commented Oct 9, 2018

binji commented Oct 9, 2018

rossberg commented Oct 15, 2018

binji commented Oct 16, 2018

aardappel commented Oct 18, 2018

tlively commented Dec 15, 2018

binji commented Dec 15, 2018

binji commented Oct 2, 2018 •

edited

Loading