add snapshot tests #31

DhruvDh · 2024-03-07T18:09:05Z

This draft PR adds snapshot tests. I am trying to incrementally rewrite operations in KernelAbstractions.jl, and am using these tests to ensure consistent outputs at temp 0.

Currently, it doesn't run on GPUs, for that I think a method that moves weights to a chosen backend would be needed.
Right now, I just want to write correct kernels that run on the CPU and produce outputs consistent to the original version.

Here is a kernel for rmsnorm - DhruvDh@1d49d42. It makes the whole thing some 2-5% slower but produces correct outputs. This way, we can try incrementally writing kernels for operations and later look at the results and decide if GPU acceleration is needed, how to maintain readability, etc.

I can also keep a draft PR open where we do this incremental addition of kernels

I am also having trouble with formatting, if a .JuliaFormatter.toml can be added to the project, that would help a lot.

cafaxo · 2024-03-22T08:23:49Z

GPU support would be awesome. I am not sure if testing for exact reproduction is that useful since language models are very sensitive to rounding/quantization errors.
Even llama.cpp produces different outputs depending on the backend (see ggerganov/llama.cpp#4755).

add snapshot tests

aee78b8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add snapshot tests #31

add snapshot tests #31

DhruvDh commented Mar 7, 2024

cafaxo commented Mar 22, 2024 •

edited

Loading

add snapshot tests #31

Are you sure you want to change the base?

add snapshot tests #31

Conversation

DhruvDh commented Mar 7, 2024

cafaxo commented Mar 22, 2024 • edited Loading

cafaxo commented Mar 22, 2024 •

edited

Loading