CL/aarch64: implement the wasm SIMD `i32x4.dot_i16x8_s` instruction #2327

julian-seward1 · 2020-10-27T14:05:59Z

This patch implements, for aarch64, the following wasm SIMD extensions

i32x4.dot_i16x8_s instruction
WebAssembly/simd#127

It also updates dependencies as follows, in order that the new instruction can
be parsed, decoded, etc:

wat to 1.0.27
wast to 26.0.1
wasmparser to 0.65.0
wasmprinter to 0.2.12

The changes are straightforward:

new CLIF instruction widening_pairwise_dot_product_s
translation from wasm into widening_pairwise_dot_product_s
new AArch64 instructions smull, smull2 (part of the VecRRR group)
translation from widening_pairwise_dot_product_s to smull ; smull2 ; addv

There is no testcase in this commit, because that is a separate repo. The
implementation has been tested, nevertheless.

github-actions · 2020-10-27T14:55:39Z

Subscribe to Label Action

cc @fitzgen, @peterhuene

This issue or pull request has been labeled: "cranelift", "cranelift:area:aarch64", "cranelift:area:peepmatic", "cranelift:meta", "cranelift:wasm", "fuzzing", "lightbeam", "wasmtime:api"

Thus the following users have been cc'd because of the following labels:

fitzgen: cranelift:area:peepmatic, fuzzing
peterhuene: wasmtime:api

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

akirilov-arm · 2020-10-27T17:42:31Z

cranelift/codegen/src/isa/aarch64/inst/mod.rs

@@ -290,6 +290,10 @@ pub enum VecALUOp {
    Umlal,
    /// Zip vectors (primary) [meaning, high halves]
    Zip1,
+    /// Signed multiply long (low halves)


Just a comment (no need to take action) - it's probably time to introduce an Inst variant for widening binary operations, so that the handling of the high and the low halves can be streamlined, for example, but we can do this separately. We previously took a shortcut here because Umlal was the only odd man out.

cc @jgouly

Yeah, that sounds like a nice cleanup for a followup. I'm curious though .. when you say "handling of the high and the low halves can be streamlined", what did you have in mind, roughly?

Have a look at Inst::VecMiscNarrow, which organizes things similarly, but for narrowing operations. TBH I don't have an exact code change in mind (as I said, neither Joey, nor I did anything yet because there was no real need), but having a separate boolean parameter to specify the half is probably going to be the way to do it.

I dislike the ISA's approach (via the vector shape), but it has constraints (encoding space) that are not applicable to the Inst enum. Anyway, the Inst variants don't necessarily map 1:1 to machine instructions, so we can use that to improve ergonomics (e.g. we support bitwise operations on arbitrary vector shapes, not just 8-bit elements).

cranelift/codegen/src/isa/aarch64/lower_inst.rs

julian-seward1 · 2020-10-27T17:53:10Z

cc @yurydelendik

yurydelendik

Looks good, with proper type check/assertion.

akirilov-arm

LGTM.

This patch implements, for aarch64, the following wasm SIMD extensions i32x4.dot_i16x8_s instruction WebAssembly/simd#127 It also updates dependencies as follows, in order that the new instruction can be parsed, decoded, etc: wat to 1.0.27 wast to 26.0.1 wasmparser to 0.65.0 wasmprinter to 0.2.12 The changes are straightforward: * new CLIF instruction `widening_pairwise_dot_product_s` * translation from wasm into `widening_pairwise_dot_product_s` * new AArch64 instructions `smull`, `smull2` (part of the `VecRRR` group) * translation from `widening_pairwise_dot_product_s` to `smull ; smull2 ; addv` There is no testcase in this commit, because that is a separate repo. The implementation has been tested, nevertheless.

julian-seward1 requested a review from yurydelendik October 27, 2020 14:29

akirilov-arm reviewed Oct 27, 2020

View reviewed changes

yurydelendik approved these changes Nov 2, 2020

View reviewed changes

julian-seward1 force-pushed the arm64-simd-dotmul branch 2 times, most recently from f007e8b to 4883749 Compare November 3, 2020 11:37

akirilov-arm approved these changes Nov 3, 2020

View reviewed changes

julian-seward1 force-pushed the arm64-simd-dotmul branch from 4883749 to c5374ec Compare November 3, 2020 12:40

julian-seward1 merged commit 5a5fb11 into bytecodealliance:main Nov 3, 2020

yurydelendik mentioned this pull request Dec 23, 2020

[SIMD][x86_64] Add encoding for PMADDWD #2530

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CL/aarch64: implement the wasm SIMD `i32x4.dot_i16x8_s` instruction #2327

CL/aarch64: implement the wasm SIMD `i32x4.dot_i16x8_s` instruction #2327

julian-seward1 commented Oct 27, 2020

github-actions bot commented Oct 27, 2020

akirilov-arm Oct 27, 2020

julian-seward1 Oct 27, 2020

akirilov-arm Oct 27, 2020

julian-seward1 commented Oct 27, 2020

yurydelendik left a comment

akirilov-arm left a comment

CL/aarch64: implement the wasm SIMD i32x4.dot_i16x8_s instruction #2327

CL/aarch64: implement the wasm SIMD i32x4.dot_i16x8_s instruction #2327

Conversation

julian-seward1 commented Oct 27, 2020

github-actions bot commented Oct 27, 2020

Subscribe to Label Action

akirilov-arm Oct 27, 2020

Choose a reason for hiding this comment

julian-seward1 Oct 27, 2020

Choose a reason for hiding this comment

akirilov-arm Oct 27, 2020

Choose a reason for hiding this comment

julian-seward1 commented Oct 27, 2020

yurydelendik left a comment

Choose a reason for hiding this comment

akirilov-arm left a comment

Choose a reason for hiding this comment

CL/aarch64: implement the wasm SIMD `i32x4.dot_i16x8_s` instruction #2327

CL/aarch64: implement the wasm SIMD `i32x4.dot_i16x8_s` instruction #2327