Implement `@depositBits` and `@extractBits` #18680

ominitay · 2024-01-25T13:49:16Z

Implements @depositBits and @extractBits, corresponding to pdep and pext from BMI2. The builtins are supported in the LLVM backend for any integer width, are supported in the x86 backend for integer widths below and including 64 bits, and are evaluated at compile-time when possible (also at any integer width). The functionality remains unimplemented in all other backends, and is out of the scope of this PR.

Apologies for how long it has taken me to find the time to get this merge-ready, but I'm happy for it to be merged now, and any further enhancements to be split out into further issues.

Continuation of #15285
Closes #14995

ominitay · 2024-01-25T23:42:50Z

All checks passed, happy for this to be merged if approved :)

ominitay · 2024-02-14T12:09:39Z

Rebased onto master.

ominitay · 2024-03-22T03:17:07Z

Converted to draft as there are a couple of changes I'm part-way through making to this PR. Should be ready for review in a week or two.

ominitay · 2024-03-29T19:29:36Z

Emulation on unsupported targets and large int types has been moved to compiler-rt. Yet to implement calls to this from other backends, but x86 backend should now pass behaviour tests. Before ready for review, I'll look at implementing testing of the various implementations of this against each other to each other with random integers, as we have 4 of them: the two compiler-rt methods (generic and bigint), the comptime bigint, and the native instruction call, so any latent bugs I could have missed should be detected. This is kind of building off of @matu3ba's suggestions: #15285 (comment)

This change implements depositBits and extractBits (equivalents of PDEP and PEXT) for Zig's bit ints. This change lays the groundwork for implementation of `@depositBits` and `@extractBits`. Tests have been added to check the behaviour of these two functions. The functions currently don't handle negative values (though negative values may be converted to twos complement externally), and aren't optimal in either memory or performance.

Implements std.math.big.int.Mutable.convertFromTwosComplement, to match convertToTwosComplement.

Incomplete: currently only implemented for 64-bit-or-smaller integers for x86(-64) in the LLVM backend.

Removes the requirement to copy and modify `mask`, removing the need to clone `mask` into a `Mutable` bigint.

Implements compiler-rt functions to emulate the PEXT and PDEP instructions from BMI2. These also implement the same functionality for arbitrarily-big integers. The existing emulation of these instructions has been removed from the LLVM backend, and replaced with calls to these compiler-rt functions. Some rework has been done in the backend to reduce code duplication.

Adds calls into compiler-rt in the x86 backend for depositBits and extractBits. This brings the x86 backend on-par with the LLVM backend, now fully supporting these builtins for all targets and integer sizes. Some refactoring has been applied to reduce code duplication.

Adds a test for u256 to provide some coverage for codegen of __pdep_bigint and __pext_bigint. Also stops skipping tests on the x86 backend.

Disables the two behaviour tests which are caused to fail on the x86_64 backend by ziglang#19498. Fixing the underlying issue is not within the scope of this pull request.

ominitay · 2024-04-17T00:57:29Z

Disabled the failing behaviour tests caused by the bug in the x86 backend. My rationale here is that it's not within the scope of this PR to work around, as a user needing to use this functionality could simply just not use the x86 backend.

andrewrk · 2024-06-08T20:10:23Z

draft status for > 30 days

ominitay force-pushed the pdeppext branch from 3c38534 to c22de91 Compare February 14, 2024 12:09

ominitay force-pushed the pdeppext branch from c22de91 to e58496a Compare March 15, 2024 21:13

ominitay marked this pull request as draft March 22, 2024 03:17

ominitay force-pushed the pdeppext branch from e58496a to 5b8a0aa Compare March 29, 2024 19:05

ominitay mentioned this pull request Mar 31, 2024

x86 backend: 128-bit integers returned in xmm0 registers by windows 64 function cause compiler crash #19498

Open

ominitay added 21 commits April 17, 2024 00:20

std.math.big.int: Conversion from 2's complement

32ff101

Implements std.math.big.int.Mutable.convertFromTwosComplement, to match convertToTwosComplement.

Write docs for @depositBits and @extractBits

a33d4f6

Implement @depositBits and @extractBits

9bd3bf7

Incomplete: currently only implemented for 64-bit-or-smaller integers for x86(-64) in the LLVM backend.

LLVM: Implement emulation for @depositBits

2de5fcc

LLVM: Implement emulation for @extractBits

a2850aa

std.math.big.int: Fix index out-of-bounds

9760841

Add behaviour tests for @depositBits and @extractBits

566a888

zig fmt

db280ce

Replace u6 with Log2Limb

9020b2f

big.int.depositBits/extractBits: Remove limbs_buffer

eecdf99

Removes the requirement to copy and modify `mask`, removing the need to clone `mask` into a `Mutable` bigint.

Disallow signed integer types for deposit/extract

13d4205

Actually use deposit/extract behaviour test

313d258

Enable langref tests for deposit and extract

5a42ecb

Allow use of comptime_int with deposit/extract

fc8eadb

Improve compile errors for negative values

9c14b26

update comments

4eff831

Bring branch up-to-date

69e893d

x86: Implement @depositBits and @extractBits

71f8db4

update deposit/extract to master

5f66df1

zig fmt

e0b4630

ominitay added 7 commits April 17, 2024 00:22

Don't compile tests for deposit/extract when unsupported

4bcaab9

Bring branch up-to-date with llvm backend changes

432e1cb

Update behaviour tests for deposit/extractBits

e80a4b2

Adds a test for u256 to provide some coverage for codegen of __pdep_bigint and __pext_bigint. Also stops skipping tests on the x86 backend.

Bring fork up-to-date with master

b87e549

Skip failing behaviour tests

726b436

Disables the two behaviour tests which are caused to fail on the x86_64 backend by ziglang#19498. Fixing the underlying issue is not within the scope of this pull request.

ominitay force-pushed the pdeppext branch from 10b7a74 to 726b436 Compare April 17, 2024 00:54

andrewrk closed this Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `@depositBits` and `@extractBits` #18680

Implement `@depositBits` and `@extractBits` #18680

ominitay commented Jan 25, 2024

ominitay commented Jan 25, 2024

ominitay commented Feb 14, 2024

ominitay commented Mar 22, 2024

ominitay commented Mar 29, 2024

ominitay commented Apr 17, 2024

andrewrk commented Jun 8, 2024

Implement @depositBits and @extractBits #18680

Implement @depositBits and @extractBits #18680

Conversation

ominitay commented Jan 25, 2024

ominitay commented Jan 25, 2024

ominitay commented Feb 14, 2024

ominitay commented Mar 22, 2024

ominitay commented Mar 29, 2024

ominitay commented Apr 17, 2024

andrewrk commented Jun 8, 2024

Implement `@depositBits` and `@extractBits` #18680

Implement `@depositBits` and `@extractBits` #18680