Use KernelAbstractions.jl for gather/scatter kernels #487

pxl-th · 2023-04-09T18:17:54Z

Depends on #486.
Successfully tested on CPU, CUDABackend, ROCBackend.

Since LLVM does not support certain atomic operations (fmin, fmax, fmul, fdiv) tests for them are disabled when using such backends, which is currently only ROCBackend:
https://llvm.org/docs/LangRef.html#atomicrmw-instruction

fmin/fmax requires LLVM 15+: https://reviews.llvm.org/D127041

TODO

Add @inbounds.

PR Checklist

Tests are added
Documentation, if applicable

pxl-th · 2023-04-11T12:11:43Z

It may be that LLVM on Julia 1.6 is too old and we can't support all of the ops that we currently support for CUDA.
On 1.9-rc2 it fails for - op, but all other ops are working...

As an alternative, we can retain (for now) NNlibCUDA scatter.

pxl-th · 2023-04-11T13:23:12Z

I've retained scatter kernels for CUDA in NNlibCUDA so that this PR is not blocked.
But what's important is that other backends that rely on KernelAbstractions.jl will be able to use embedding layers (fwd + bwd pass), which is probably the most common use-case for it.

ToucheSir · 2023-04-11T14:15:36Z

FWIW the copy of NNlibCUDA in this repo doesn't do anything at present, so we need to make sure the overloads in the standalone NNlibCUDA repo still work.

pxl-th · 2023-04-11T15:23:07Z

FWIW the copy of NNlibCUDA in this repo doesn't do anything at present, so we need to make sure the overloads in the standalone NNlibCUDA repo still work.

IIC, if NNlibCUDA in this repo is not registered, then tests at using NNlibCUDA line should import the standalone repo.
And I've disabled Gather and Scatter tests from test suite for CUDA, because we call them from NNlibCUDA tests.

pxl-th · 2023-04-11T17:09:34Z

Not sure what's with CUDA on 1.8...
Will investigate in a bit

pxl-th · 2023-04-11T17:58:10Z

Probably was some kind of hiccup, now the CI passes.

ToucheSir

Anything left before merging?

pxl-th · 2023-04-16T09:25:56Z

Anything left before merging?

I think no, should be good to go

pxl-th marked this pull request as draft April 9, 2023 18:18

pxl-th mentioned this pull request Apr 9, 2023

Implement StableDiffusion JuliaNeuralGraphics/Diffusers.jl#22

Merged

10 tasks

pxl-th added 5 commits April 11, 2023 13:15

Use KA for gather

6888f79

Finish scatter

1a4c557

Update testsuite for gather/scatter

ea4f155

Cleanup

fac67d5

Update tests

50cc60b

pxl-th mentioned this pull request Apr 11, 2023

Atomic - causes LLVM error on CUDA JuliaConcurrent/Atomix.jl#36

Open

pxl-th added 2 commits April 11, 2023 15:52

Retain NNlibCUDA scatter kernels

ac0e310

Fixup

99f6ed4

pxl-th marked this pull request as ready for review April 11, 2023 13:20

pxl-th added 2 commits April 11, 2023 18:43

Add at-inbounds

19040c0

Add compat

232111e

Use KA unsafe free

1e1ced2

ToucheSir approved these changes Apr 15, 2023

View reviewed changes

CarloLucibello merged commit 1c6a87c into FluxML:master Apr 16, 2023

pxl-th deleted the ka-gather branch April 16, 2023 12:34

This was referenced Sep 19, 2023

Support Metal JuliaGraphs/GraphNeuralNetworks.jl#344

Open

Support AMDGPU JuliaGraphs/GraphNeuralNetworks.jl#343

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use KernelAbstractions.jl for gather/scatter kernels #487

Use KernelAbstractions.jl for gather/scatter kernels #487

pxl-th commented Apr 9, 2023 •

edited

Loading

pxl-th commented Apr 11, 2023 •

edited

Loading

pxl-th commented Apr 11, 2023

ToucheSir commented Apr 11, 2023

pxl-th commented Apr 11, 2023

pxl-th commented Apr 11, 2023

pxl-th commented Apr 11, 2023

ToucheSir left a comment

pxl-th commented Apr 16, 2023

Use KernelAbstractions.jl for gather/scatter kernels #487

Use KernelAbstractions.jl for gather/scatter kernels #487

Conversation

pxl-th commented Apr 9, 2023 • edited Loading

TODO

PR Checklist

pxl-th commented Apr 11, 2023 • edited Loading

pxl-th commented Apr 11, 2023

ToucheSir commented Apr 11, 2023

pxl-th commented Apr 11, 2023

pxl-th commented Apr 11, 2023

pxl-th commented Apr 11, 2023

ToucheSir left a comment

Choose a reason for hiding this comment

pxl-th commented Apr 16, 2023

pxl-th commented Apr 9, 2023 •

edited

Loading

pxl-th commented Apr 11, 2023 •

edited

Loading