How to get the compiled function which can be called later? #230

tongzhou80 · 2023-06-07T20:37:46Z

tongzhou80
Jun 7, 2023

Once I do tune_kernel("my_kernel", ...), how do I get the object of the compiled function that has the best runtime, which can be called again? Something like this:

func = tune_kernel("my_kernel", kernel_str, ..., args, ...)
func(args). # call again with args

Essentially an autotuner that returns the best compiled version. Thanks!

Answered by benvanwerkhoven

Jun 7, 2023

Hi @vesuppi! Good question, as this isn't really documented all that well! For Python applications, we have the PythonKernel from kernel_tuner.kernelbuilder. This example shows the simplest way to use it:
https://github.com/KernelTuner/kernel_tuner/blob/master/examples/cuda/python_kernel.py

The idea is that you can either directly specify which configuration you want with the 'params=' option of PythonKernel. For example you could use get_best_config from kernel_tuner.util and pass that as the params.

A probably better way to do this is to let Kernel Tuner figure out which configuration to use based on the tuning results of tune_kernel that have been stored to a file. In that case you hav…

View full answer

benvanwerkhoven · 2023-06-07T20:53:04Z

benvanwerkhoven
Jun 7, 2023
Maintainer

Hi @vesuppi! Good question, as this isn't really documented all that well! For Python applications, we have the PythonKernel from kernel_tuner.kernelbuilder. This example shows the simplest way to use it:
https://github.com/KernelTuner/kernel_tuner/blob/master/examples/cuda/python_kernel.py

The idea is that you can either directly specify which configuration you want with the 'params=' option of PythonKernel. For example you could use get_best_config from kernel_tuner.util and pass that as the params.

A probably better way to do this is to let Kernel Tuner figure out which configuration to use based on the tuning results of tune_kernel that have been stored to a file. In that case you have to use the 'results_file=' option of PythonKernel and point to a "results file" written using the store_results function in kernel_tuner.integration.

This may seem like an additional step, but this enables you to only tune once, store the results, and then run the application many times reusing the same tuning results. The selection for which kernel configuration to compile is made based on the GPU available at run time and the specified problem size.

0 replies

tongzhou80 · 2023-06-08T19:52:32Z

tongzhou80
Jun 8, 2023
Author

I see, thank you very much for the detailed explanation! I was able to get the best config using

results, env = tune_kernel("vector_add", kernel_string, N, (c, a, b, torch.tensor(N)), tune_params)
best_config = util.get_best_config(results, 'time')

Haven't tried PythonKernel yet, but will do! Another side question, if we want to tune the size of the thread block, does the parameters have to be named "block_size_x" and "block_size_y"? Thanks!

0 replies

benvanwerkhoven · 2023-06-09T06:49:25Z

benvanwerkhoven
Jun 9, 2023
Maintainer

You can indeed use other names for the thread block dimensions. You can specify the names of these using the block_size_names= option of tune_kernel. This optional argument takes a list of strings with the names for the x, y, and z thread block dimensions.

This test illustrates how to use this option:
https://github.com/KernelTuner/kernel_tuner/blob/master/test/test_runners.py#L65
The example kernel in this test uses 'block_dim_x' instead of 'block_size_x', but you can change it to anything you like.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kernel Tuner

How to get the compiled function which can be called later? #230

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Kernel Tuner

How to get the compiled function which can be called later? #230

tongzhou80 Jun 7, 2023

Replies: 3 comments

benvanwerkhoven Jun 7, 2023 Maintainer

tongzhou80 Jun 8, 2023 Author

benvanwerkhoven Jun 9, 2023 Maintainer

tongzhou80
Jun 7, 2023

benvanwerkhoven
Jun 7, 2023
Maintainer

tongzhou80
Jun 8, 2023
Author

benvanwerkhoven
Jun 9, 2023
Maintainer