Maintaining state across file audio source chunks #248

Aduomas · 2024-10-28T12:39:58Z

Hello,

I am looking for a way to do chunk-based inference instead of streaming inference using audio files.
The issue now is that each file audio will have new inference and thus new state (new speaker embeddings) which is unwanted behaviour for my program.

How should I do achieve wanted behaviour of doing inference on larger chunks of audio (such as 20 second) and keeping the pipeline state across ?

juanmc2005 · 2024-12-13T09:46:02Z

Hi @Aduomas, given your description, do you actually require a streaming pipeline? It looks like pyannote.audio could achieve what you want

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maintaining state across file audio source chunks #248

Maintaining state across file audio source chunks #248

Aduomas commented Oct 28, 2024

juanmc2005 commented Dec 13, 2024

Maintaining state across file audio source chunks #248

Maintaining state across file audio source chunks #248

Comments

Aduomas commented Oct 28, 2024

juanmc2005 commented Dec 13, 2024