Add Wav2Vec2BertProcessorWithLM #30671

FredHaa · 2024-05-06T10:48:11Z

Feature request

Wav2Vec2-Bert was open sourced and integrated with Transformers in the end of last year. However, it is missing an easy integration with pyctcdecode similar to Wav2Vec2ProcessorWithLM. This should be quite trivial to implement, since Wav2Vec2Processor is very similar to Wav2Vec2BertProcessor, the only difference being that they use different feature extractors.

Motivation

Having a Wav2Vec2BertProcessorWithLM class would make it possible to use Wav2Vec2-Bert with a kenlm model in a Transformers ASR pipeline.

Your contribution

I can submit a PR.

LysandreJik · 2024-05-06T12:00:36Z

cc @sanchit-gandhi @ylacombe

ylacombe · 2024-05-20T09:56:48Z

Hey @FredHaa, #28706 should fix this, I'm reopening it! Note that you would have to use Wav2Vec2ProcessorWithLM and not Wav2Vec2BertProcessorWithLM!

ylacombe · 2024-05-20T11:41:45Z

#28706 has been merged, I'm closing the issue for now, feel free to ask questions

FredHaa changed the title ~~Wav2Vec2BertProcessorWithLM~~ Add Wav2Vec2BertProcessorWithLM May 6, 2024

amyeroberts added Feature request Request for a new feature Audio labels May 7, 2024

ylacombe closed this as completed May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Wav2Vec2BertProcessorWithLM #30671

Add Wav2Vec2BertProcessorWithLM #30671

FredHaa commented May 6, 2024

LysandreJik commented May 6, 2024

ylacombe commented May 20, 2024

ylacombe commented May 20, 2024

Add Wav2Vec2BertProcessorWithLM #30671

Add Wav2Vec2BertProcessorWithLM #30671

Comments

FredHaa commented May 6, 2024

Feature request

Motivation

Your contribution

LysandreJik commented May 6, 2024

ylacombe commented May 20, 2024

ylacombe commented May 20, 2024