Add BFloat16 #98643

huoyaoyuan · 2024-02-19T05:08:56Z

Delegating all functional members to float, since the upcast is trivial. The only logic is rounding from double.
Tests are borrowed from Half.

dotnet-issue-labeler · 2024-02-19T05:09:03Z

Note regarding the new-api-needs-documentation label:

This serves as a reminder for when your PR is modifying a ref *.cs file and adding/modifying public APIs, please make sure the API implementation in the src *.cs file is documented with triple slash comments, so the PR reviewers can sign off that change.

ghost · 2024-02-19T05:09:08Z

Tagging subscribers to this area: @dotnet/area-system-numerics
See info in area-owners.md if you want to be subscribed.

Issue Details

Closes #96295.

Delegating all functional members to float, since the upcast is trivial. The only logic is rounding from double.
Tests are borrowed from Half.

Author:	huoyaoyuan
Assignees:	-
Labels:	`area-System.Numerics`, `new-api-needs-documentation`
Milestone:	-

src/libraries/System.Private.CoreLib/src/System/Numerics/BFloat16.cs

tannergooding · 2024-06-03T19:56:10Z

@huoyaoyuan, I believe this should be unblocked now

…LP rounding.

huoyaoyuan · 2024-06-10T12:57:04Z

src/libraries/System.Private.CoreLib/src/System/Numerics/BFloat16.cs

+            // Exponent displacement #1
+            const ulong Exponent942 = 0x3ae0_0000_0000_0000u;


/cc @MineCake147E I figured out the value of this magic number for double->BFloat16 by debugging, but can't give an expression to calculate it. Do you have any information around this?

Exponent45 reflects the difference of the number of fraction bits. bfloat16 has 7 fraction bits, while double has 52. So, 52 - 7 = 45.
Exponent942 is a little bit complicated.
We want 1.0 to be converted to 1.0(bf16).

The line:

runtime/src/libraries/System.Private.CoreLib/src/System/Numerics/BFloat16.cs

Line 490 in 14b0d85

value += BitConverter.UInt64BitsToDouble(exponentOffset0);

transforms
0b0_011_1111_1111_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000 into 0b0_100_0010_1100_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_1000_0000.
If we skip subtracting Exponent942, the ulong newExponent = bitValue >> 45; will be 0b0010_0001_0110_0000_0000
By adding the value before shifting, we get 0b010000101100000000000000000000000000000000000010_0_001_0110_1_000_0000
The internal representation of 1.0(bf16) is 0b0_011_1111_1_000_0000.
0b010000101100000000000000000000000000000000000010_0_001_0110_1_000_0000 - 0b0_011_1111_1_000_0000 = 0b010000101100000000000000000000000000000000000_0011_1010_1110_0000_000
And we shift this value 45 bits left, we get 0b0011_1010_1110_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000_0000, which is 0x3AE0_0000_0000_0000.
In other words,
${ExponentDisplacement_1} = E_{source}(1.0) - E_{result}(1.0) + (FractionBits_{source} - FractionBits_{result}) + 1$
For single -> half, $127 - 15 + (23 - 10) + 1 = 113 + 13 = 126$.
For double -> bfloat16, $1023 - 127 + (52 - 7) + 1 = 897 + 45 = 942$.
We can test this formula by applying this to double -> single, $1023 - 127 + (52 - 23) + 1 = 897 + 29 = 926$.

const ulong SingleBiasedExponentMask = double.BiasedExponentMask; const ulong Exponent926 = 0x39e0_0000_0000_0000u; const ulong Exponent29 = 0x01D0_0000_0000_0000u; var q = BitConverter.DoubleToUInt64Bits(Math.PI + BitConverter.UInt64BitsToDouble(Exponent29 + (BitConverter.DoubleToUInt64Bits(Math.PI) & SingleBiasedExponentMask))) - Exponent926; BitConverter.UInt32BitsToSingle((uint)(q + (q >> 29)))

And we get 3.1415927 which matches (float)Math.PI.

I hope it helps.

huoyaoyuan · 2024-06-10T16:55:48Z

Now every test is passing locally. Marking this as ready for review.

huoyaoyuan added 10 commits February 18, 2024 22:56

Add api for BFloat16

589afe0

Creating

312d051

Equals and GetHashCode

5e1c981

Comparison

1fb4765

Constants and comment

fc05d3b

Xml doc

152fe99

Using rounding for cast

25a16e7

Ref source

50d90aa

Simple tests

559f2e0

Conversion tests

b24839c

dotnet-issue-labeler bot added area-System.Numerics new-api-needs-documentation labels Feb 19, 2024

huoyaoyuan added the community-contribution Indicates that the PR has been added by a community member label Feb 19, 2024

hez2010 reviewed Feb 19, 2024

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Numerics/BFloat16.cs Show resolved Hide resolved

Stripping sign is redundant

8284526

This was referenced Feb 19, 2024

Failed USB connection via port 54050, error 61, in tvOS arm64 Release AllSubsets_Mono #82637

Open

[browser][MT] Assert failed: Cannot find Promise for JSHandle -2 #98406

Closed

huoyaoyuan added 2 commits February 19, 2024 18:25

Fix test copied from Half

8e32e71

Fix conversion test cases

4bd266e

huoyaoyuan force-pushed the BFloat16 branch from 03cca33 to 4bd266e Compare February 19, 2024 10:34

build-analysis bot mentioned this pull request Feb 19, 2024

[mt][browser] HttpClient_CancelInDifferentThread failing with operation cancelled #98101

Open

huoyaoyuan added 7 commits February 21, 2024 21:42

Constants and well-known values

6df00e6

Categorizing methods

ff295fd

Reorder conversion members

09af2b2

Operators batch 1

1a8f0ad

Operators batch 2

c9fc867

TryConvert

e9fc0f8

Operators batch 3

c967aa5

dotnet-policy-service bot closed this Jun 3, 2024

stephentoub reopened this Jun 3, 2024

Merge branch 'main'

25b7684

This was referenced Jun 4, 2024

NativeAOT legs timing out in CI #102239

Closed

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

huoyaoyuan added 6 commits June 9, 2024 21:44

Merge branch 'main' into BFloat16

832651e

Fix test case

a07fe96

Add double conversion test

f9c35d3

Parse tests

4059b66

Formatting tests

4b4d1a5

RoundTripping tests

6ed52f5

Port float->Half conversion algorithm to double->BFloat16 to handle U…

14b0d85

…LP rounding.

huoyaoyuan commented Jun 10, 2024

View reviewed changes

Port function tests from Half

ea1dd5f

Convert the precesion of test cases.

dfd49c8

huoyaoyuan marked this pull request as ready for review June 10, 2024 16:55

hez2010 mentioned this pull request Jun 25, 2024

Support for AI floats BF16, FP8E4M3, FP8E5M2 m4rs-mt/ILGPU#1221

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BFloat16 #98643

Add BFloat16 #98643

huoyaoyuan commented Feb 19, 2024

dotnet-issue-labeler bot commented Feb 19, 2024

ghost commented Feb 19, 2024

tannergooding commented Jun 3, 2024

huoyaoyuan Jun 10, 2024

MineCake147E Jun 10, 2024 •

edited

Loading

huoyaoyuan commented Jun 10, 2024

		// Exponent displacement #1
		const ulong Exponent942 = 0x3ae0_0000_0000_0000u;

Add BFloat16 #98643

Are you sure you want to change the base?

Add BFloat16 #98643

Conversation

huoyaoyuan commented Feb 19, 2024

dotnet-issue-labeler bot commented Feb 19, 2024

ghost commented Feb 19, 2024

tannergooding commented Jun 3, 2024

huoyaoyuan Jun 10, 2024

Choose a reason for hiding this comment

MineCake147E Jun 10, 2024 • edited Loading

Choose a reason for hiding this comment

huoyaoyuan commented Jun 10, 2024

MineCake147E Jun 10, 2024 •

edited

Loading