llvm: Add option to select 32bit floating point representation #2400

jvesely · 2022-04-29T04:30:41Z

Add --fp-precision pytest cmdline option, accepted values are 'fp64' and 'fp32', 'fp64' is the default.
Consolidate tests of compiler builtin helpers.
Refactor implementation of hyperbolic functions (tanh, coth, csch) to handle corner cases that result in the infinite value of exponential.
Adjust tests and add fp32 results for cases where the lower precision and rounding give different results.

Fixes parallel execution when using fp32 type. Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

Predator-prey runs both fp32 and fp64 variant irrespective of the --fp-precision setting. Signed-off-by: Jan Vesely <[email protected]>

Add tests. Conversion from double -> half is not accurate, because of an issue with a call to runtime library [0]. [0] numba/llvmlite#834 Signed-off-by: Jan Vesely <[email protected]>

llvmlite does not handle special FP string values for other than fp64 precision [0] [0] numba/llvmlite#833 Signed-off-by: Jan Vesely <[email protected]>

Make sure we always have fp64 variant available. Add type specific tests. Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

…ding fp64 Signed-off-by: Jan Vesely <[email protected]>

Bump vector length to 1500 to span more than one 4kB page. Signed-off-by: Jan Vesely <[email protected]>

Add const dimension variant to every tested op. Convert arguments to the right fp type. Signed-off-by: Jan Vesely <[email protected]>

This is tested in test_builtins_matrix.py const dimensions tests. Signed-off-by: Jan Vesely <[email protected]>

… again Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

This should switch to using pytest.mark if it gets more widespread Signed-off-by: Jan Vesely <[email protected]>

The original formula would return NaN if the input was large enough so that exp(2x) == Inf and would need an extra condition check. The new formula handles large inputs without extra checks. Add tests with extreme value to builtins tests. Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

…le in fp32 There are no special input values in the tests might as well make them representable in 32b floating point. Fixes fp32 test failures in SUB operation tests. Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

The original formula would return NaN if the input was large enough so that exp(2x) == Inf and would need an extra condition check. The new formula handles large inputs without extra checks. Add tests with extreme values to builtins tests. Signed-off-by: Jan Vesely <[email protected]>

The original formula would return NaN if the input was large enough so that exp(2x) == Inf and would need an extra condition check. The new formula handles large inputs without extra checks. Add tests with extreme value to builtins tests. Use stricter tolerance for fp64 tests. Signed-off-by: Jan Vesely <[email protected]>

Move expected values out of param list. Integrate test names into param list. Re-enable small drift rate test, override expected results on windows and mac. Enable small drift rate test in compiled mode, override expected results. Add more small drift rate tests. Signed-off-by: Jan Vesely <[email protected]>

The results differ from fp64 because of rounding and use of Philox PRNG. Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

…ision Signed-off-by: Jan Vesely <[email protected]>

These are now high/low enough to work with fp32 Philox random sequence. Make sure initializer values are different from the test values. Signed-off-by: Jan Vesely <[email protected]>

Drop the hack using inf input values to select PRNG. Signed-off-by: Jan Vesely <[email protected]>

github-actions · 2022-04-29T04:43:11Z

This PR causes the following changes to the html docs (ubuntu-latest-3.7-x64):

No differences!

...

See CI logs for the full diff.

github-actions · 2022-04-29T04:54:11Z

This PR causes the following changes to the html docs (ubuntu-latest-3.7-x64):

No differences!

...

See CI logs for the full diff.

PTX results for fp32 DDA function with neg input differ due to operation accuracy/rounding. Signed-off-by: Jan Vesely <[email protected]>

github-actions · 2022-04-29T05:41:23Z

This PR causes the following changes to the html docs (ubuntu-latest-3.7-x64):

No differences!

...

See CI logs for the full diff.

jvesely added 29 commits April 14, 2022 10:42

llvm/execution: Add support for structs to _element_dtype

9030213

Fixes parallel execution when using fp32 type. Signed-off-by: Jan Vesely <[email protected]>

tests: Add cmdline option to select compiler fp precision

ab152aa

Signed-off-by: Jan Vesely <[email protected]>

tests/models/predator-prey: Add fp32 variant

89dcaab

Predator-prey runs both fp32 and fp64 variant irrespective of the --fp-precision setting. Signed-off-by: Jan Vesely <[email protected]>

llvm: Add support for different floating point precision conversions

6e7e498

Add tests. Conversion from double -> half is not accurate, because of an issue with a call to runtime library [0]. [0] numba/llvmlite#834 Signed-off-by: Jan Vesely <[email protected]>

llvm, UDF: Create Python float instance instead of string "Inf"

646d790

llvmlite does not handle special FP string values for other than fp64 precision [0] [0] numba/llvmlite#833 Signed-off-by: Jan Vesely <[email protected]>

llvm/builtins: Split 'is_close' builtin implementation by type

57f9984

Make sure we always have fp64 variant available. Add type specific tests. Signed-off-by: Jan Vesely <[email protected]>

tests, llvm/helpers: Convert numpy arrays to expected fp format

bf7383e

Signed-off-by: Jan Vesely <[email protected]>

tests, llvm/mt_random: Use function parameter types instead of hardco…

029bf71

…ding fp64 Signed-off-by: Jan Vesely <[email protected]>

tests, llvm/builtins_vector: Cast operands to the correct type

3258bcc

Bump vector length to 1500 to span more than one 4kB page. Signed-off-by: Jan Vesely <[email protected]>

tests/llvm/builtins: Consolidate matrix ops tests

3f9ef5c

Add const dimension variant to every tested op. Convert arguments to the right fp type. Signed-off-by: Jan Vesely <[email protected]>

tests/llvm/custom_func: Drop fixed dimension vector matrix multiply

44f4123

This is tested in test_builtins_matrix.py const dimensions tests. Signed-off-by: Jan Vesely <[email protected]>

tests/llvm/custom_func: Use pnlvm.ir instead of importing llvmlite ir…

9b9c725

… again Signed-off-by: Jan Vesely <[email protected]>

tests/llvm/compile: Convert arguments to the correct fp precision

8174a4e

Signed-off-by: Jan Vesely <[email protected]>

tests: Consolidate spelling of 'Philox' to easily identify philox tests

a65c3f4

This should switch to using pytest.mark if it gets more widespread Signed-off-by: Jan Vesely <[email protected]>

tests/llvm/builtins: Convert parameters to correct type in PTX tests

0344cc8

Signed-off-by: Jan Vesely <[email protected]>

Merge remote-tracking branch 'origin/devel' into devel-llvm

e1a9eb0

tests/models/necker_cube: Use higher tolerance when running in fp32 mode

547ceb0

Signed-off-by: Jan Vesely <[email protected]>

tests/functions/Distribution: Add fp32 expected values

04b660d

The results differ from fp64 because of rounding and use of Philox PRNG. Signed-off-by: Jan Vesely <[email protected]>

tests/functions/Selection: Add Philox fp32 results

d8b8f86

Signed-off-by: Jan Vesely <[email protected]>

test/functions/Memory: Remove dead code

1928717

Signed-off-by: Jan Vesely <[email protected]>

tests/TestMiscTrainingFunctionality: Add fp32 results

077b890

Signed-off-by: Jan Vesely <[email protected]>

tests/composition/control: Add testing for fp32 results and fp32 prec…

c209061

…ision Signed-off-by: Jan Vesely <[email protected]>

tests/function/Memory: Adjust probabilities for storage/retrieve

d88fe5e

These are now high/low enough to work with fp32 Philox random sequence. Make sure initializer values are different from the test values. Signed-off-by: Jan Vesely <[email protected]>

tests/functions/Distribution: Pass PRNG type as explicit test parameter

5e8fc13

Drop the hack using inf input values to select PRNG. Signed-off-by: Jan Vesely <[email protected]>

jvesely added the compiler Runtime Compiler label Apr 29, 2022

jvesely added the CUDA CUDA target for the runtime compiler label Apr 29, 2022

jvesely force-pushed the devel-llvm branch from 47be5f5 to af00fa8 Compare April 29, 2022 04:41

jvesely added 2 commits April 29, 2022 01:30

tests/functions/Distribution: Add special case result for PTX fp32 test

cc3e743

PTX results for fp32 DDA function with neg input differ due to operation accuracy/rounding. Signed-off-by: Jan Vesely <[email protected]>

Merge remote-tracking branch 'origin/devel' into devel-llvm

96bb7f7

jvesely force-pushed the devel-llvm branch from af00fa8 to 96bb7f7 Compare April 29, 2022 05:31

jvesely merged commit f939a97 into PrincetonUniversity:devel Apr 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llvm: Add option to select 32bit floating point representation #2400

llvm: Add option to select 32bit floating point representation #2400

jvesely commented Apr 29, 2022 •

edited

Loading

github-actions bot commented Apr 29, 2022

github-actions bot commented Apr 29, 2022

github-actions bot commented Apr 29, 2022

llvm: Add option to select 32bit floating point representation #2400

llvm: Add option to select 32bit floating point representation #2400

Conversation

jvesely commented Apr 29, 2022 • edited Loading

github-actions bot commented Apr 29, 2022

github-actions bot commented Apr 29, 2022

github-actions bot commented Apr 29, 2022

jvesely commented Apr 29, 2022 •

edited

Loading