Add generic KV caching support, use it with Whisper #307

katalinic-gc · 2023-04-06T09:07:31Z

What does this PR do?

Intended usage for Whisper:

pipelined_model = pipelined_model.parallelize(
    for_generation=True, use_cache=use_cache, batch_size=batch_size, max_length=448, num_beams=num_beams
)
pipelined_model.generate(input_features, use_cache=use_cache, **kwargs)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-04-06T09:10:46Z

The documentation is not available anymore as the PR was closed or merged.

optimum/graphcore/generation_utils.py

jimypbr

Great work!

katalinic-gc force-pushed the whisper_kv_cache branch from 56e2ba0 to 4d0cc08 Compare April 6, 2023 09:11

paolot-gc reviewed Apr 6, 2023

View reviewed changes

optimum/graphcore/generation_utils.py Outdated Show resolved Hide resolved

katalinic-gc force-pushed the whisper_kv_cache branch from 4d0cc08 to 456bfa0 Compare April 6, 2023 10:38

katalinic-gc marked this pull request as ready for review April 6, 2023 12:14

katalinic-gc force-pushed the whisper_kv_cache branch 5 times, most recently from e5ec8b8 to b349221 Compare April 12, 2023 11:14

Add generic KV caching support, use it with Whisper

d96cad6

katalinic-gc force-pushed the whisper_kv_cache branch from b349221 to d96cad6 Compare April 12, 2023 15:19

jimypbr approved these changes Apr 17, 2023

View reviewed changes

jimypbr merged commit 3f92baa into huggingface:main Apr 17, 2023

jimypbr deleted the whisper_kv_cache branch April 17, 2023 12:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add generic KV caching support, use it with Whisper #307

Add generic KV caching support, use it with Whisper #307

katalinic-gc commented Apr 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 6, 2023 •

edited

Loading

jimypbr left a comment

Add generic KV caching support, use it with Whisper #307

Add generic KV caching support, use it with Whisper #307

Conversation

katalinic-gc commented Apr 6, 2023 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Apr 6, 2023 • edited Loading

jimypbr left a comment

Choose a reason for hiding this comment

katalinic-gc commented Apr 6, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 6, 2023 •

edited

Loading