[SYCL] Add faster reduction implementations using atomic or/and intel… #1615

v-klochkov · 2020-04-30T05:15:25Z

…::reduce()

Signed-off-by: Vyacheslav N Klochkov [email protected]

…::reduce() Signed-off-by: Vyacheslav N Klochkov <[email protected]>

romanovvlad · 2020-04-30T08:40:22Z

@Pennycook could you review the patch?

romanovvlad · 2020-04-30T08:55:44Z

sycl/include/CL/sycl/handler.hpp

@@ -761,6 +785,48 @@ class __SYCL_EXPORT handler {
 #endif
  }

+  /// Implements parallel_for() accepting nd_range and 1 reduction variable
+  /// having 'read_write' access mode.


[Minor] Suggest mentioning that reduction should support "fast atomics".

Vlad, doesn't this comment already tell that (see the line 3 and 4 of this comment section)?

/// Implements parallel_for() accepting nd_range and 1 reduction variable
/// having 'read_write' access mode.
/// This version uses fast sycl::atomic operations to update user's reduction
/// variable at the end of each work-group work.

Pennycook

I'm not sure about the names fast_atomic and fast_reduce going forward -- what we're really testing here is whether SYCL provides native atomics or reductions for those types (since a device is not required to guarantee that their implementation is fast).

I don't feel strongly enough about this to block merging this PR, but we might want to revisit this naming convention when tuning for additional platforms.

v-klochkov · 2020-04-30T18:05:52Z

I'm not sure about the names fast_atomic and fast_reduce going forward -- what we're really testing here is whether SYCL provides native atomics or reductions for those types (since a device is not required to guarantee that their implementation is fast).

I don't feel strongly enough about this to block merging this PR, but we might want to revisit this naming convention when tuning for additional platforms.

If for some device those 'native' atomics happen to work slowly, then the good move is to exclude such 'native' atomic from 'fast' atomics list and use different algorithm not using them. Such exclusion would require some additional changes and maybe dynamic/runtime checks on HOST (I did not think much about it yet).
Taking this idea into account using the word 'fast' seems reasonable, right?

Pennycook · 2020-04-30T18:10:00Z

If for some device those 'native' atomics happen to work slowly, then the good move is to exclude such 'native' atomic from 'fast' atomics list and use different algorithm not using them. Such exclusion would require some additional changes and maybe dynamic/runtime checks on HOST (I did not think much about it yet).
Taking this idea into account using the word 'fast' seems reasonable, right?

That's a good point. I agree -- as long as we continue to update the logic and use these specializations only if the features are expected to improve performance, "fast" is a good name.

alexbatashev

LGTM

…_docs * origin/sycl: (6482 commits) [SYCL][NFC] Clean formatting in Markdown documents (intel#1635) [SYCL][Doc] Remove obsolete parens from README (intel#1637) [SYCL] Fix failing ABI tests when LLVM_LIBDIR_SUFFIX is set (intel#1605) [SYCL] Fix warnings in libdevice (intel#1630) [SYCL][CUDA] Triage and clean LIT (intel#1620) [SYCL][NFC] Fix GCC 8 compilation warnings (intel#1631) [SYCL] Minor fixes in LowerWGScope [SYCL] PI: correct default interoperability plugin selection [SYCL] Add faster reduction implementations using atomic or/and intel::reduce() (intel#1615) [SYCL] Add sycl-ls utility for listing devices discovered/selected by SYCL RT (intel#1575) [SYCL] Fix getDeviceFromHandler declarations (intel#1626) [SPIR-V] Correct/improve declaration of SPIR-V builtins (intel#1519) [SYCL][USM] Improve USM allocator test and fix improper behavior. (intel#1538) [SYCL] Fix failing ABI LITs (intel#1622) [SYCL] Add support for MSVC internal math functions in device library (intel#1441) [SYCL] Add runtime library versioning (intel#1604) [SYCL] Check weak symbols in ABI dumps (intel#1609) [NFC][SYCL] Improve kernel metadata test (intel#1610) Revert "[SYCL] XFAIL LIT test due to duplicate diagnostic" (intel#1460) [SYCL] Move the reduction command group funcs out of handler.hpp (intel#1602) ...

[SYCL] Add faster reduction implementations using atomic or/and intel…

2c8542a

…::reduce() Signed-off-by: Vyacheslav N Klochkov <[email protected]>

v-klochkov requested a review from romanovvlad April 30, 2020 05:15

v-klochkov requested a review from a team as a code owner April 30, 2020 05:15

v-klochkov assigned Pennycook Apr 30, 2020

romanovvlad requested a review from Pennycook April 30, 2020 08:37

romanovvlad reviewed Apr 30, 2020

View reviewed changes

Pennycook approved these changes Apr 30, 2020

View reviewed changes

v-klochkov requested review from alexbatashev and romanovvlad May 1, 2020 16:30

alexbatashev approved these changes May 1, 2020

View reviewed changes

bader merged commit 05625f1 into intel:sycl May 1, 2020

v-klochkov mentioned this pull request May 5, 2020

[SYCL][CUDA] Reduction ext unsupported #1641

Merged

v-klochkov deleted the public_vklochkov_reduction_p3 branch May 8, 2020 06:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] Add faster reduction implementations using atomic or/and intel… #1615

[SYCL] Add faster reduction implementations using atomic or/and intel… #1615

v-klochkov commented Apr 30, 2020

romanovvlad commented Apr 30, 2020

romanovvlad Apr 30, 2020

v-klochkov Apr 30, 2020

Pennycook left a comment

v-klochkov commented Apr 30, 2020

Pennycook commented Apr 30, 2020

alexbatashev left a comment

[SYCL] Add faster reduction implementations using atomic or/and intel… #1615

[SYCL] Add faster reduction implementations using atomic or/and intel… #1615

Conversation

v-klochkov commented Apr 30, 2020

romanovvlad commented Apr 30, 2020

romanovvlad Apr 30, 2020

Choose a reason for hiding this comment

v-klochkov Apr 30, 2020

Choose a reason for hiding this comment

Pennycook left a comment

Choose a reason for hiding this comment

v-klochkov commented Apr 30, 2020

Pennycook commented Apr 30, 2020

alexbatashev left a comment

Choose a reason for hiding this comment