Thank you very much for the improvements to whisper, could you have more clarity on whether there is an "hallucination" problem, i.e. an issue with duplicate output in other languages #61

isbn9877007 · 2023-11-20T13:36:07Z

And can you give the amount of video memory required to run the model? Does the windows system require additional configuration?

Vaibhavs10 · 2023-11-20T17:49:25Z

It still suffers from hallucination a bit. However, we have a fix for that coming up shortly.
See here: huggingface/transformers#27492

The max GPU VRAM should be around 12GB.

It should work out of the box on Windows (as long as you have a GPU) - do check out the FAQs for more info: https://github.com/Vaibhavs10/insanely-fast-whisper#frequently-asked-questions

(I'm closing this issue for now, feel free to re-open)

152334H · 2023-12-06T13:04:11Z

the code (on whisper v2/v3, with chunk_length_s=30) seems to still produce repeated text on long transcriptions, even on the latest transformers commit (i'll try to add examples later). behaviour doesn't occur on faster-whisper

adding repetition penalty kind of fixes this, but also still has unwanted consequences on the general quality of the transcript. beam search doesn't fix it

Vaibhavs10 closed this as completed Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thank you very much for the improvements to whisper, could you have more clarity on whether there is an "hallucination" problem, i.e. an issue with duplicate output in other languages #61

Thank you very much for the improvements to whisper, could you have more clarity on whether there is an "hallucination" problem, i.e. an issue with duplicate output in other languages #61

isbn9877007 commented Nov 20, 2023 •

edited

Loading

Vaibhavs10 commented Nov 20, 2023 •

edited

Loading

152334H commented Dec 6, 2023

Thank you very much for the improvements to whisper, could you have more clarity on whether there is an "hallucination" problem, i.e. an issue with duplicate output in other languages #61

Thank you very much for the improvements to whisper, could you have more clarity on whether there is an "hallucination" problem, i.e. an issue with duplicate output in other languages #61

Comments

isbn9877007 commented Nov 20, 2023 • edited Loading

Vaibhavs10 commented Nov 20, 2023 • edited Loading

152334H commented Dec 6, 2023

isbn9877007 commented Nov 20, 2023 •

edited

Loading

Vaibhavs10 commented Nov 20, 2023 •

edited

Loading