Better codegen for array repeats in Rust -- fixes #137 #141

msprotz · 2025-02-05T00:29:16Z

No description provided.

msprotz · 2025-02-05T18:03:13Z

@franziskuskiefer I would like you to try this out on mlkem and mldsa before merging, so I'll wait until we get libcrux back to green before merging this, I'm mildly concerned with regressions

please also note that in the case of nested arrays, there will still be a long series of assignments -- if this is a problem I left a TODO in the code that indicates what to tweak to improve that situation, too

msprotz · 2025-02-05T18:05:04Z

Note that this patch still improves things: previously,

  17   │ fn init2() -> [[u8; 32]; 32] {
  18   │     [[0u8; 32]; 32]
  19   │ }

would generate 1024 assignments -- now it only generates 32, but with 32 distinct buffers on the stack...

msprotz · 2025-02-05T18:34:19Z

ok looks like I need to revamp the array repeat desugaring phase

msprotz · 2025-02-05T21:26:11Z

As far as I can tell, this generates a rather pleasant diff on branch protz_refresh of libcrux -- see my latest commit there to see the effect of this PR.

@franziskuskiefer if the changes are agreeable to you, please go ahead and merge... thanks

franziskuskiefer

Thanks for tackling this!

This improves a bunch of cases.

But there's at least one it makes worse.

let mut tmp_stack = [[0i32; 263], [0i32; 263], [0i32; 263], [0i32; 263]];

(Maybe this should be different in Rust already, but that's a different question.)
used to generate

int32_t tmp_stack[4U][263U] = {{0U}};

Now it generates

int32_t repeat_expression[263U] = {0U};
int32_t tmp_stack[4U][263U] = {
    {(int32_t)0, ...},
    {(int32_t)0, ...},
    {(int32_t)0, ...},
    repeat_expression};

Which is not only worse, but won't compile.

msprotz · 2025-02-06T16:37:02Z

Ah good catch. I'll add this as a test-case, I didn't have it locally. Thank you

msprotz · 2025-02-07T22:51:06Z

I ended up revamping the handling of nested arrays and simplified pretty awful code. (I actually vividly remember writing this code in a rush and thinking I wasn't sure I was doing the right thing...)

You'll need FStarLang/karamel#530

The consequence should now be uniformly better code. For your specific example I do not yet generate a perfect nested initializer (i.e. int32_t tmp_stack[4U][263U] = { 0 }) but the code is pretty good. Let me know if you'd like me to push a little more on code quality.

For instance, if you have arrays nested three levels deep (i.e. arrays of arrays of arrays) there is an opportunity to introduce a for-loop in the code I just touched, rather than statically generating a slew of assignments. With the big cleanup, this should be pretty easy to do, so if this is something you need, let me know. Cheers.

msprotz · 2025-02-07T23:30:42Z

I was able to confirm that on my branch protz_refresh of mlkem, I see no regression (i.e. the code is still as improved as before), and this passes the tests locally.

franziskuskiefer · 2025-02-10T11:20:50Z

Thanks!
I'm seeing another issue though. Looks like there's a counter that's not counting anymore.

Here's the diff.

-  memcpy(ret[1U], copy_of_out1, (size_t)200U * sizeof(uint8_t));
-  memcpy(ret[2U], copy_of_out2, (size_t)200U * sizeof(uint8_t));
-  memcpy(ret[3U], uu____3, (size_t)200U * sizeof(uint8_t));
+  memcpy(ret[0U], copy_of_out1, (size_t)200U * sizeof(uint8_t));
+  memcpy(ret[0U], copy_of_out2, (size_t)200U * sizeof(uint8_t));
+  memcpy(ret[0U], uu____3, (size_t)200U * sizeof(uint8_t));
 }

The Rust code for this is

let mut out0 = [0u8; 200];
let mut out1 = [0u8; 200];
let mut out2 = [0u8; 200];
let mut out3 = [0u8; 200];
// ...
[out0, out1, out2, out3]

franziskuskiefer

Thanks. Looks like this is good to go now.

Better codegen for array repeats in Rust -- fixes #137

fffd6c7

msprotz enabled auto-merge February 5, 2025 00:29

msprotz added 2 commits February 4, 2025 16:29

Merge branch 'main' into protz_array_repeat

90f6898

A comment and some tweak

58126ae

msprotz disabled auto-merge February 5, 2025 18:02

msprotz added 4 commits February 5, 2025 13:19

Second attempt, with a better strategy

1ec9cf6

Refresh expected test output

fdb06c3

Upgrade ocamlformat and reformat

c56c8e9

Refresh more diff

f210b81

protz pushed a commit to cryspen/libcrux that referenced this pull request Feb 5, 2025

Refresh C code with AeneasVerif/eurydice#141

9f27913

franziskuskiefer self-requested a review February 6, 2025 07:07

franziskuskiefer requested changes Feb 6, 2025

View reviewed changes

Revamp the handling of nested arrays

79472b6

msprotz added 3 commits February 7, 2025 15:18

Forgot to recurse in one spot

80e90e3

Ooops

21983e4

nix

b532806

Forgot to increment an index

ce76803

franziskuskiefer approved these changes Feb 10, 2025

View reviewed changes

msprotz merged commit 60f543d into main Feb 10, 2025
3 of 4 checks passed

msprotz deleted the protz_array_repeat branch February 10, 2025 18:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better codegen for array repeats in Rust -- fixes #137 #141

Better codegen for array repeats in Rust -- fixes #137 #141

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

franziskuskiefer left a comment

msprotz commented Feb 6, 2025

msprotz commented Feb 7, 2025

msprotz commented Feb 7, 2025 •

edited

Loading

franziskuskiefer commented Feb 10, 2025

franziskuskiefer left a comment

Better codegen for array repeats in Rust -- fixes #137 #141

Better codegen for array repeats in Rust -- fixes #137 #141

Conversation

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

msprotz commented Feb 5, 2025

franziskuskiefer left a comment

Choose a reason for hiding this comment

msprotz commented Feb 6, 2025

msprotz commented Feb 7, 2025

msprotz commented Feb 7, 2025 • edited Loading

franziskuskiefer commented Feb 10, 2025

franziskuskiefer left a comment

Choose a reason for hiding this comment

msprotz commented Feb 7, 2025 •

edited

Loading