Tracking issue for `cfg_target_feature` #29717

aturon · 2015-11-09T16:54:48Z

Added as part of the SIMD work, supports feature detection based on e.g. target processor. This issue tracks stabilization.

Relevant issues and bugs

repr(simd) is unsound #44367 -- repr(simd) is unsound

alexcrichton · 2015-11-09T18:15:44Z

The only known issue with this I'm aware of is that it doesn't doesn't use the same mechanism in LLVM to query whether a feature is activated or not. We already maintain our own list of what features are allowed here (e.g. we're not just exposing what LLVM accepts) and then we currently map that down to an LLVM feature name. When testing whether the LLVM feature is activated, however, we should robustly query LLVM via its own internal mechanisms rather than explicitly checking ourselves.

gnzlbg · 2015-12-18T16:26:20Z

I opened a separate issue #30462 for the missing cfg_target_feature for bit manipulation instruction sets.

alexcrichton · 2016-02-24T16:59:37Z

cfg target_feature -- for github issue searchability as well

also a new forum link -- https://internals.rust-lang.org/t/comprehensive-list-of-desired-cfg-target-feature-s/3201

gnzlbg · 2016-06-21T09:59:40Z

So this is a list of the current target features supported by llc -mattr=help:

16bit-mode             - 16-bit mode (i8086).
  32bit-mode             - 32-bit mode (80386).
  3dnow                  - Enable 3DNow! instructions.
  3dnowa                 - Enable 3DNow! Athlon instructions.
  64bit                  - Support 64-bit instructions.
  64bit-mode             - 64-bit mode (x86_64).
  adx                    - Support ADX instructions.
  aes                    - Enable AES instructions.
  atom                   - Intel Atom processors.
  avx                    - Enable AVX instructions.
  avx2                   - Enable AVX2 instructions.
  avx512bw               - Enable AVX-512 Byte and Word Instructions.
  avx512cd               - Enable AVX-512 Conflict Detection Instructions.
  avx512dq               - Enable AVX-512 Doubleword and Quadword Instructions.
  avx512er               - Enable AVX-512 Exponential and Reciprocal Instructions.
  avx512f                - Enable AVX-512 instructions.
  avx512ifma             - Enable AVX-512 Integer Fused Multiple-Add.
  avx512pf               - Enable AVX-512 PreFetch Instructions.
  avx512vbmi             - Enable AVX-512 Vector Bit Manipulation Instructions.
  avx512vl               - Enable AVX-512 Vector Length eXtensions.
  bmi                    - Support BMI instructions.
  bmi2                   - Support BMI2 instructions.
  call-reg-indirect      - Call register indirect.
  clflushopt             - Flush A Cache Line Optimized.
  clwb                   - Cache Line Write Back.
  cmov                   - Enable conditional move instructions.
  cx16                   - 64-bit with cmpxchg16b.
  f16c                   - Support 16-bit floating point conversion instructions.
  fast-partial-ymm-write - Partial writes to YMM registers are fast.
  fma                    - Enable three-operand fused multiple-add.
  fma4                   - Enable four-operand fused multiple-add.
  fsgsbase               - Support FS/GS Base instructions.
  fxsr                   - Support fxsave/fxrestore instructions.
  hle                    - Support HLE.
  idivl-to-divb          - Use 8-bit divide for positive values less than 256.
  idivq-to-divw          - Use 16-bit divide for positive values less than 65536.
  invpcid                - Invalidate Process-Context Identifier.
  lea-sp                 - Use LEA for adjusting the stack pointer.
  lea-uses-ag            - LEA instruction needs inputs at AG stage.
  lzcnt                  - Support LZCNT instruction.
 mmx                    - Enable MMX instructions.
  movbe                  - Support MOVBE instruction.
  mpx                    - Support MPX instructions.
  mwaitx                 - Enable MONITORX/MWAITX timer functionality.
  pad-short-functions    - Pad short functions.
  pclmul                 - Enable packed carry-less multiplication instructions.
  pcommit                - Enable Persistent Commit.
  pku                    - Enable protection keys.
  popcnt                 - Support POPCNT instruction.
  prefetchwt1            - Prefetch with Intent to Write and T1 Hint.
  prfchw                 - Support PRFCHW instructions.
  rdrnd                  - Support RDRAND instruction.
  rdseed                 - Support RDSEED instruction.
  rtm                    - Support RTM instructions.
  sahf                   - Support LAHF and SAHF instructions.
  sgx                    - Enable Software Guard Extensions.
  sha                    - Enable SHA instructions.
  slm                    - Intel Silvermont processors.
  slow-bt-mem            - Bit testing of memory is slow.
  slow-incdec            - INC and DEC instructions are slower than ADD and SUB.
  slow-lea               - LEA instruction with certain arguments is slow.
  slow-shld              - SHLD instruction is slow.
  slow-unaligned-mem-16  - Slow unaligned 16-byte memory access.
  slow-unaligned-mem-32  - Slow unaligned 32-byte memory access.
  smap                   - Supervisor Mode Access Protection.
  soft-float             - Use software floating point features..
  sse                    - Enable SSE instructions.
  sse-unaligned-mem      - Allow unaligned memory operands with SSE instructions.
  sse2                   - Enable SSE2 instructions.
  sse3                   - Enable SSE3 instructions.
  sse4.1                 - Enable SSE 4.1 instructions.
  sse4.2                 - Enable SSE 4.2 instructions.
  sse4a                  - Support SSE 4a instructions.
  ssse3                  - Enable SSSE3 instructions.
  tbm                    - Enable TBM instructions.
  vmfunc                 - VM Functions.
  x87                    - Enable X87 float instructions.
  xop                    - Enable XOP instructions.
  xsave                  - Support xsave instructions.
  xsavec                 - Support xsavec instructions.
  xsaveopt               - Support xsaveopt instructions.
  xsaves                 - Support xsaves instructions.

Would a patch that implements macros for all of these need to go through the RFC process?

This commit adds support for detecting all of llc's target features for the x86/x86_64 architectures. Closes rust-lang#30462. Helps with rust-lang#29717 and rust-lang#34382.

aturon · 2016-06-21T17:39:04Z

cc @BurntSushi @eddyb @nikomatsakis -- I believe all of you were recently involved in discussion around SIMD stabilization.

bluss · 2016-10-18T13:02:49Z

This would be useful to use in matrixmultiply, ndarray even before simd stabilization. I get better results if I can pick the particular stable rust-implemented loop that generates best code given the particular available vector types / available target features.

briansmith · 2016-12-15T21:46:51Z

For people interested in helping with this:

There is a allowlist of features that are exposed by this mechanism, which is very limited. See https://github.com/rust-lang/rust/blob/master/src/librustc_driver/target_features.rs#L23-L28, in particular the variables ARM_WHITELIST and X86_WHITELIST.

I found #34412 which seems to be a good example of a PR that expands the set of features that can be used with #[cfg(target_feature=)]. It looks straightforward to create more PRs that expose more features, and I encourage people to do so.

briansmith · 2016-12-15T21:49:11Z

When testing whether the LLVM feature is activated, however, we should robustly query LLVM via its own internal mechanisms rather than explicitly checking ourselves.

Is this blocking stabilization of this feature? And, if so, is this the only thing blocking stabilization, or are there other things?

briansmith · 2016-12-15T21:59:58Z

When testing whether the LLVM feature is activated, however, we should robustly query LLVM via its own internal mechanisms rather than explicitly checking ourselves.

Is this blocking stabilization of this feature? And, if so, is this the only thing blocking stabilization, or are there other things?

To answer my own question: IMO, the above concern doesn't need to block this feature from being stabilized. In my case, I don't care whether LLVM has the CPU features enabled in its code generation because I'm not using LLVM-based code gen for my code that uses SIMD and stuff.

The above concern does need to block the stabilization of intrinsics and other things exposed from libcore/libstd that depend on #[cfg(target-feature)], but that's separate from exposing #[cfg(target-feature)] itself, and that should happen in the tracking issues for those features, not this one.

So, IMO, this seems like it this is good to go.

briansmith · 2016-12-15T22:22:52Z

One clarification might be in order: When code uses this feature, can it assume that every feature name will be globally unique, or should it assume that feature names might be reused across different architectures. For example, would a feature "+crypto" imply AAarch32 or AAarch64, or is it possible that, say, a MIPS target could have a "+crypto" feature? In some instances this would affect whether one writes this:

#[cfg(target_feature="crypto")]

or this:

#[cfg(all(target_feature="crypto", any(target="aarch64", target="aarch32")))]

alexcrichton · 2016-12-16T00:22:10Z

@briansmith AFAIK this issue is not block on anything. The concern you linked to about querying LLVM directly was actually resolved back in April!

Also AFAIK we haven't discussed issues of namespaces or implication of features across architectures.

briansmith · 2016-12-16T06:29:21Z

@briansmith AFAIK this issue is not block on anything. The concern you linked to about querying LLVM directly was actually resolved back in April!

Awesome!

Also AFAIK we haven't discussed issues of namespaces or implication of features across architectures.

The RFC says "This RFC proposes a target_feature cfg, that would be set to the features of the architecture that are known to be supported by the exact target e.g." To me, that implies that some feature names may overlap between architectures. This means that one probably should use target_arch or similar in addition to target_feature, although the example in the RFC doesn't do that.

I also noticed that the documentation for each target feature is missing. In particular, there doesn't seem to be any documentation describing what each value of target_feature means, nor is there documentation indicating the implications. For example, the RFC says that +sse42 implies the enabling of earlier versions of SSE, and these implied enablings should be documented.

For example, in the case of Aarch32/Aarch64 crypto features, ARM has defined a "crypto" feature that enables AES, SHA-1, and SHA-2 instructions. However, the CPU actually reports whether each feature is enabled/disabled separately, so in theory there could be individual sub-features of "crypto" in the future. Also, in clang "+crypto" enables floating point and NEON, according to its documentation.

If it is expected that the Rust target features will stay in sync with LLVM's, then we could just have documentation for target_feature that points to the external LLVM documentation. That's what I'd recommend for now.

briansmith · 2016-12-16T06:41:54Z

Note in particular the GCC documentation says "feature crypto implies simd, which implies fp. Conversely, nofp implies nosimd, which implies nocrypto." (emphasis mine). Does this mean that in Rust -fp should imply -simd and -simd should imply -crypto? What about things like `+crypto -simd"?

alexcrichton · 2016-12-20T01:24:14Z

@briansmith yeah makes sense to me that we'd want to make sure we document all supported target features in one way or another, including hierarchies and implications amongst them.

I do believe we support -C target-feature=-foo, although I don't know if it also has those sorts of implication changes (we just rely on LLVM for now). Would be good to find out!

bluss · 2016-12-20T10:10:56Z

What's the relation to cargo's CARGO_CFG_TARGET_FEATURE that it gives to build scripts? This env var is possible to use from Rust 1.14 (from the cargo version shipped with it) and the project can set its own cfg vars depending on which features are enabled.

alexcrichton · 2016-12-20T17:30:05Z

Cargo gained a feature recently to ferry along everything from rustc --print cfg over to build scripts, which is where CARGO_CFG_TARGET_FEATURE comes from. The --print cfg option is stable as well and is basically orthogonal to this feature itself.

The #![feature(cfg_target_feature)] in this issue is specifically related to #[cfg(target_feature = "...")] annotations. Note, though, that many of those target features (e.g. sse, etc), are only set on nightly. This means that --print cfg on stable doesn't print those features, but it does on nightly. So this issue is also pseudo sort-of for all those names as well, and allowing them to be stable.

In other words, @briansmith's points about documentation, relationships, names, what they actually mean, etc, I believe are also all relevant for this stabilization issue.

bluss · 2016-12-20T17:36:57Z

Ok that's important information. I don't like that stable prints something different, but it's important to know.

Very late clarify (March 2017): Normally, on nightly we opt in to nightly features (such as printing more cfg's), so it's surprising when it is a silently enabled stable/nightly difference.

alexcrichton · 2017-03-20T20:29:48Z

I'd like to hopefully make progress on this issue as the real issue I'd like to make progress on, #37406, I have now realized is blocked on this report.

As of today we have solved my original concern by querying LLVM directly about this information instead of attempting to infer it ourselves. However @briansmith's concern remains about namespacing across targets and such.

I would also like to show that practically today all our features match LLVM exactly (we don't change the names). This is intentional for these features but we are not tying ourselves to LLVM's naming, we reserve the right to change the name and add features (such as crt-static). I will also point out, though, that adding more features is very easy to do and is highly likely to skip stabilization processes and such.

As a result I would like to propose the following:

Stabilize -C target-feature
Stabilize #[cfg(target_feature)]
Stabilize --print cfg printing the target features
Do not stabilize any target feature names

That is, I'd like to stabilize this mechanism for controlling features to targets, and then leave stabilization of the names here to the actual relevant subsystems. For example I would expect SIMD stabilization to be coupled with the SIMD attributes, and crt-static should have its own bikeshed for the name.

This does force our hand I think on the "global namespace" argument where it would be mega confusing for x86_64 Linux to have a foo feature that's different from the arm Windows foo feature. This may mean that some features need to be renamed appropriately to prefix the relevant information. Other features, though, like crt-static apply to all targets equally.

Along those lines, I will...

@rfcbot fcp merge

rfcbot · 2017-03-20T20:29:49Z

alexcrichton · 2018-01-17T18:05:40Z

Yeah as pointed out by @gnzlbg the original motivation for stabilizing this more quickly evaporated which caused this to sit for awhile. Work has started on the implementation and the next outstanding piece I believe is gating each feature name behind a separate feature gate.

This is now pretty tightly coupled with simd stabilization (without many other motivational factors) so I wouldn't expect this to stabilize ahead or long after SIMD (but rather at the same time). SIMD's still farther out, though.

raphaelcohn · 2018-02-08T15:20:39Z

@gnzlbg

Counter-intuitively, specifying -C target-feature=rdrnd on the rustc command line and using #[cfg(target_feature = "rdrnd")] does not work - the #[cfg()] behaves as if -C target-feature=rdrnd wasn't set. However, setting a target feature within a target-specification.json file does work.

Also, compiling against a native CPU with this feature available also doesn't cause #[cfg(target_feature = "rndrd")] to be true.

Is this intentional?

BurntSushi · 2018-02-08T15:24:11Z

@raphaelcohn I believe you need -C target-feature=+rdrnd.

gnzlbg · 2018-02-08T16:00:30Z

@raphaelcohn I believe you need -C target-feature=+rdrnd.

If -C target-feature=rdrnd doesn't produce an error, it should. @raphaelcohn mind filling a rust-lang/rust issue ?

raphaelcohn · 2018-02-08T16:30:58Z

I'll double-check I transcribed that correctly. However, if it is supposed to +rdrnd, then it'd be nice if the same syntax was also used in #[target_feature(enable)], too (it seems it was before enable came along).

raphaelcohn · 2018-02-08T16:34:40Z

Right, target-feature=rdrnd doesn't error, target-feature=+rdrnd doesn't error, but does match for #[cfg(target_feature = "rdrnd")]. So a bug, then?

gnzlbg · 2018-02-08T16:40:24Z

@raphaelcohn the RFC says

There are currently two ways of passing target feature information to rustc's code generation backend on stable Rust.

-C --target-feature=+/-backend_target_feature_name: where +/- add/remove features from the default feature set of the platform for the whole crate.

-C --target-cpu=backend_cpu_name, which changes the default feature set of the crate to be that of all features enabled for backend_cpu_name.

These two options are available on stable Rust and have been defacto stabilized. Their semantics are LLVM specific and depend on what LLVM actually does with the features.

This RFC proposes to keep these options "as is", and add one new compiler option, --enable-features="feature0,feature1,...", (the analogous --disable-features is discussed in the "Future Extensions" section) that supports only stabilized target features.

That is,

then it'd be nice if the same syntax was also used in #[target_feature(enable)], too (it seems it was before enable came along).

per the rfc, the analogous syntax is --enable-features="...", and might not be implemented yet.

I think that target-feature=rdrnd should error.

alexcrichton · 2018-04-04T19:46:32Z

#48556 has entered FCP which I believe will implicitly stabilize this attribute

This commit stabilizes the SIMD in Rust for the x86/x86_64 platforms. Notably this commit is stabilizing: * The `std::arch::{x86, x86_64}` modules and the intrinsics contained inside. * The `is_x86_feature_detected!` macro in the standard library * The `#[target_feature(enable = "...")]` attribute * The `#[cfg(target_feature = "...")]` matcher Stabilization of the module and intrinsics were primarily done in rust-lang/stdarch#414 and the two attribute stabilizations are done in this commit. The standard library is also tweaked a bit with the new way that stdsimd is integrated. Note that other architectures like `std::arch::arm` are not stabilized as part of this commit, they will likely stabilize in the future after they've been implemented and fleshed out. Similarly the `std::simd` module is also not being stabilized in this commit, only `std::arch`. Finally, nothing related to `__m64` is stabilized in this commit either (MMX), only SSE and up types and intrinsics are stabilized. Closes rust-lang#29717 Closes rust-lang#44839 Closes rust-lang#48556

Stabilize x86/x86_64 SIMD This commit stabilizes the SIMD in Rust for the x86/x86_64 platforms. Notably this commit is stabilizing: * The `std::arch::{x86, x86_64}` modules and the intrinsics contained inside. * The `is_x86_feature_detected!` macro in the standard library * The `#[target_feature(enable = "...")]` attribute * The `#[cfg(target_feature = "...")]` matcher Stabilization of the module and intrinsics were primarily done in rust-lang/stdarch#414 and the two attribute stabilizations are done in this commit. The standard library is also tweaked a bit with the new way that stdsimd is integrated. Note that other architectures like `std::arch::arm` are not stabilized as part of this commit, they will likely stabilize in the future after they've been implemented and fleshed out. Similarly the `std::simd` module is also not being stabilized in this commit, only `std::arch`. Finally, nothing related to `__m64` is stabilized in this commit either (MMX), only SSE and up types and intrinsics are stabilized. Closes #29717 Closes #44839 Closes #48556

aturon added T-lang Relevant to the language team, which will review and decide on the PR/issue. B-unstable Blocker: Implemented in the nightly compiler and unstable. labels Nov 9, 2015

gnzlbg mentioned this issue Dec 18, 2015

target_features for bit manipulation instruction sets #30462

Closed

bluss mentioned this issue Jan 9, 2016

Use std::arch for SIMD and target_feature rust-ndarray/ndarray#46

Open

pmarcelll mentioned this issue Jun 14, 2016

make check is failing with "Illegal instruction" servo/euclid#48

Closed

This was referenced Jun 21, 2016

Support detecting all llc's x86 target features gnzlbg/rust#1

Closed

Support detecting all llc's x86 target features #34397

Closed

nrc added the T-tools label Aug 17, 2016

briansmith mentioned this issue Dec 15, 2016

Better use rust toolchain features for feature detection cfg(target_feature=) briansmith/ring#381

Open

alexcrichton mentioned this issue Mar 20, 2017

Tracking issue for customizing CRT linkage #37406

Closed

3 tasks

cramertj removed the final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. label Feb 26, 2018

alexcrichton mentioned this issue Feb 26, 2018

Tracking issue for stable SIMD in Rust #48556

Closed

3 tasks

alexcrichton mentioned this issue Apr 4, 2018

Stabilize x86/x86_64 SIMD #49664

Merged

bors closed this as completed in #49664 Apr 17, 2018

gnzlbg mentioned this issue Dec 10, 2018

Split up aarch64's "crypto" feature? rust-lang/stdarch#614

Closed

ehuss mentioned this issue Mar 18, 2019

Document target_feature and cfg_target_feature. rust-lang/reference#545

Merged

hudson-ayers mentioned this issue Apr 14, 2021

Tracking issue for RFC 2045: improving #[target_feature] #44839

Open

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issue for `cfg_target_feature` #29717

Tracking issue for `cfg_target_feature` #29717

aturon commented Nov 9, 2015 •

edited by nikomatsakis

Loading

alexcrichton commented Nov 9, 2015

gnzlbg commented Dec 18, 2015

alexcrichton commented Feb 24, 2016

gnzlbg commented Jun 21, 2016

aturon commented Jun 21, 2016

bluss commented Oct 18, 2016 •

edited

Loading

briansmith commented Dec 15, 2016 •

edited

Loading

briansmith commented Dec 15, 2016

briansmith commented Dec 15, 2016 •

edited

Loading

briansmith commented Dec 15, 2016

alexcrichton commented Dec 16, 2016

briansmith commented Dec 16, 2016

briansmith commented Dec 16, 2016

alexcrichton commented Dec 20, 2016

bluss commented Dec 20, 2016

alexcrichton commented Dec 20, 2016

bluss commented Dec 20, 2016 •

edited

Loading

alexcrichton commented Mar 20, 2017

rfcbot commented Mar 20, 2017 •

edited by aturon

Loading

alexcrichton commented Jan 17, 2018

raphaelcohn commented Feb 8, 2018

BurntSushi commented Feb 8, 2018

gnzlbg commented Feb 8, 2018

raphaelcohn commented Feb 8, 2018

raphaelcohn commented Feb 8, 2018 •

edited

Loading

gnzlbg commented Feb 8, 2018 •

edited

Loading

alexcrichton commented Apr 4, 2018

Tracking issue for cfg_target_feature #29717

Tracking issue for cfg_target_feature #29717

Comments

aturon commented Nov 9, 2015 • edited by nikomatsakis Loading

Relevant issues and bugs

alexcrichton commented Nov 9, 2015

gnzlbg commented Dec 18, 2015

alexcrichton commented Feb 24, 2016

gnzlbg commented Jun 21, 2016

aturon commented Jun 21, 2016

bluss commented Oct 18, 2016 • edited Loading

briansmith commented Dec 15, 2016 • edited Loading

briansmith commented Dec 15, 2016

briansmith commented Dec 15, 2016 • edited Loading

briansmith commented Dec 15, 2016

alexcrichton commented Dec 16, 2016

briansmith commented Dec 16, 2016

briansmith commented Dec 16, 2016

alexcrichton commented Dec 20, 2016

bluss commented Dec 20, 2016

alexcrichton commented Dec 20, 2016

bluss commented Dec 20, 2016 • edited Loading

alexcrichton commented Mar 20, 2017

rfcbot commented Mar 20, 2017 • edited by aturon Loading

alexcrichton commented Jan 17, 2018

raphaelcohn commented Feb 8, 2018

BurntSushi commented Feb 8, 2018

gnzlbg commented Feb 8, 2018

raphaelcohn commented Feb 8, 2018

raphaelcohn commented Feb 8, 2018 • edited Loading

gnzlbg commented Feb 8, 2018 • edited Loading

alexcrichton commented Apr 4, 2018

Tracking issue for `cfg_target_feature` #29717

Tracking issue for `cfg_target_feature` #29717

aturon commented Nov 9, 2015 •

edited by nikomatsakis

Loading

bluss commented Oct 18, 2016 •

edited

Loading

briansmith commented Dec 15, 2016 •

edited

Loading

briansmith commented Dec 15, 2016 •

edited

Loading

bluss commented Dec 20, 2016 •

edited

Loading

rfcbot commented Mar 20, 2017 •

edited by aturon

Loading

raphaelcohn commented Feb 8, 2018 •

edited

Loading

gnzlbg commented Feb 8, 2018 •

edited

Loading