Vectorize InstanceNormalization and BatchNormalization #469

robertknight · 2024-12-19T08:14:04Z

InstanceNormalization has two steps:

Compute channel mean and variance
Shift and scale result to normalize the mean and variance, and then apply a per-channel scale and bias

BatchNormalization is similar but uses precomputed values for step 1. Previously part of step 1 was vectorized, but not the variance calculation. Add a vectorized kernel for step 2 and use it for BatchNormalization and InstanceNormalization.

Tested using the wav2vec example on x64, this made InstanceNormalization ~3x faster (~26ms -> ~8ms per run).

This enables printing a debug representation of a vector implementing `Simd` via `println!("{:?}", simd_vec.to_array())`.

Add two vectorized functions that will be useful as part of the InstanceNormalization operation. - `vec_sum_square_sub` is like `vec_sum_square` but subtracts a constant from each element before squaring. - `vec_shift_scale_bias` subtracts a constant from each element and then shifts and scales the result.

…tion InstanceNormalization has three steps: 1. Compute channel mean 2. Compute channel variance 3. Shift and scale result to normalize the mean and variance, and then apply a per-channel scale and bias Previously only step 1 was vectorized. This vectorizes steps 2 and 3 as well. Tested using the wav2vec example on x64, this made InstanceNormalization ~3x faster (~26ms -> ~8ms per run).

BatchNormalization and InstanceNormalization are very similar, except BatchNormalization uses pre-computed mean and variance statistics while InstanceNormalization computes them dynamically. Extract the vectorized normalization step from `instance_normalization` and re-use it for `batch_normalization`.

robertknight added 4 commits December 19, 2024 07:43

Add additional bounds to Simd::Array associated type

9cfa719

This enables printing a debug representation of a vector implementing `Simd` via `println!("{:?}", simd_vec.to_array())`.

robertknight changed the title ~~Vectorize variance calculation and input scaling in InstanceNormalization~~ Vectorize InstanceNormalization and BatchNormalization Dec 19, 2024

robertknight merged commit 1fd414f into main Dec 19, 2024
2 checks passed

robertknight deleted the vec-instance-norm branch December 19, 2024 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorize InstanceNormalization and BatchNormalization #469

Vectorize InstanceNormalization and BatchNormalization #469

robertknight commented Dec 19, 2024 •

edited

Loading

Vectorize InstanceNormalization and BatchNormalization #469

Vectorize InstanceNormalization and BatchNormalization #469

Conversation

robertknight commented Dec 19, 2024 • edited Loading

robertknight commented Dec 19, 2024 •

edited

Loading