adding whisper large peft+int8 training example #95

pacman100 · 2023-02-16T09:37:42Z

What does this PR do?

Fixes [Whisper] fine-tune Whisper with int-8 (scripts + example notebook) #87 - adding whisper large peft+int8 training example notebook.

review-notebook-app · 2023-02-16T09:37:47Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

younesbelkada

This looks amazing! 🔥 Looks excellent to me!

review-notebook-app · 2023-02-16T10:25:23Z

View / edit / reply to this conversation on ReviewNB

sayakpaul commented on 2023-02-16T10:25:22Z
----------------------------------------------------------------

Add a small introduction section and club the code cells before datasets import under a section called "Initial setup".

review-notebook-app · 2023-02-16T10:25:24Z

View / edit / reply to this conversation on ReviewNB

sayakpaul commented on 2023-02-16T10:25:23Z
----------------------------------------------------------------

Better to define openai/whisper-large-v2 in a variable like model_ckpt and then reuse it throughout. Reduces the cognitive load.

Also, for models that include multiple modalities like this one, we usually maintain a standalone processor instead of separate feature extractors and tokenizers. I think we should use WhisperProcessor (https://huggingface.co/docs/transformers/model_doc/whisper#transformers.WhisperProcessor) here, no? An example of using the processor of a multimodal model is available here: https://huggingface.co/docs/transformers/main/tasks/image_captioning

review-notebook-app · 2023-02-16T10:25:26Z

View / edit / reply to this conversation on ReviewNB

sayakpaul commented on 2023-02-16T10:25:25Z
----------------------------------------------------------------

Line #7.    class DataCollatorSpeechSeq2SeqWithPadding:

Maybe this could later go into transformers like the other data collators we have.

review-notebook-app · 2023-02-16T10:25:27Z

View / edit / reply to this conversation on ReviewNB

sayakpaul commented on 2023-02-16T10:25:27Z
----------------------------------------------------------------

🔥

review-notebook-app · 2023-02-16T10:25:28Z

View / edit / reply to this conversation on ReviewNB

sayakpaul commented on 2023-02-16T10:25:28Z
----------------------------------------------------------------

Maybe add a sentence drawing the reader's attention to the fact that we're ONLY training 1% of the total model params.

review-notebook-app · 2023-02-16T10:25:29Z

View / edit / reply to this conversation on ReviewNB

sayakpaul commented on 2023-02-16T10:25:29Z
----------------------------------------------------------------

Put Seq2SeqTrainingArguments in Seq2SeqTrainingArguments?

sayakpaul

तुम्ही खूप चांगले काम केले! 🔥

Maybe just format the code with something like jupyter-black so that the code reads more beautiful?

adding whisper large peft+int8 training example

ca7b462

pacman100 requested review from sayakpaul and younesbelkada February 16, 2023 09:37

younesbelkada approved these changes Feb 16, 2023

View reviewed changes

sayakpaul approved these changes Feb 16, 2023

View reviewed changes

resolving comments and running jupyter black

c1281b9

pacman100 merged commit 8ace553 into main Feb 16, 2023

pacman100 deleted the smangrul/add-whisper-example branch February 16, 2023 12:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding whisper large peft+int8 training example #95

adding whisper large peft+int8 training example #95

pacman100 commented Feb 16, 2023

review-notebook-app bot commented Feb 16, 2023

younesbelkada left a comment

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

sayakpaul left a comment

adding whisper large peft+int8 training example #95

adding whisper large peft+int8 training example #95

Conversation

pacman100 commented Feb 16, 2023

What does this PR do?

review-notebook-app bot commented Feb 16, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

review-notebook-app bot commented Feb 16, 2023 • edited Loading

review-notebook-app bot commented Feb 16, 2023 • edited Loading

review-notebook-app bot commented Feb 16, 2023 • edited Loading

review-notebook-app bot commented Feb 16, 2023 • edited Loading

review-notebook-app bot commented Feb 16, 2023 • edited Loading

review-notebook-app bot commented Feb 16, 2023 • edited Loading

sayakpaul left a comment

Choose a reason for hiding this comment

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading

review-notebook-app bot commented Feb 16, 2023 •

edited

Loading