use batched processing instead of processing by instance #80

dkrako · 2022-03-16T13:14:49Z

Currently all of the metrics are more or less structured by the following scheme:

x: array
y: array
a: array

for x_instance, y_instance, a_instance in zip(x, y, a):
    for perturbation_step in range(perturbation_steps):
        x_perturbed = perturb_instance(x_instance, a_instance, perturbation_step)
        y_perturbed = model(x_perturbed)
        score = calculate_score_for_instance(y_instance, y_perturbed)

The choice of perturb_instance arguments are just for simplicity, the code is of course more complex than presented.

But this kind of implementation doesn't use the performance benefits from batched model-prediction and vectorized numpy functions.
Instead we could speed up computations by a magnitude if we would instead use the following approach:

x: array
y: array
a: array
batch_size: int

generator = BatchGenerator(x, y, a, batch_size)
for x_batch, y_batch, a_batch in next(generator):
    for perturbation_step in range(perturbation_steps):
        x_batch_perturbed = perturb_batch(x_batch, a_batch, perturbation_step)
        y_batch_perturbed = model(x_batch_perturbed)
        score = calculate_score_for_batch(y_batch, y_batch_perturbed)

Some of perturb_batch functions may need an inner for-loop again, but others could be computed on the whole batch for sure.
Depending on the dataset size and model complexity, this should lead to significant improvements in performance.

The text was updated successfully, but these errors were encountered:

annahedstroem · 2022-12-02T14:10:40Z

Solved in batched processing.

dkrako mentioned this issue Mar 23, 2022

batched processing #87

Closed

annahedstroem mentioned this issue Apr 6, 2022

Bug: Performance bottleneck in Pixel-Flipping algorithm #96

Closed

annahedstroem linked a pull request Apr 7, 2022 that will close this issue

batched processing #87

Closed

annahedstroem added the enhancement New feature or request label Apr 16, 2022

dkrako mentioned this issue Oct 12, 2022

batched evaluation #168

Merged

6 tasks

annahedstroem closed this as completed Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use batched processing instead of processing by instance #80

use batched processing instead of processing by instance #80

dkrako commented Mar 16, 2022

annahedstroem commented Dec 2, 2022

use batched processing instead of processing by instance #80

use batched processing instead of processing by instance #80

Comments

dkrako commented Mar 16, 2022

annahedstroem commented Dec 2, 2022