Live Stream audio #670

MartinEmilEshack · 2021-05-18T01:57:27Z

My input audio will be recorded and might reach an hour or two of length. I'm also planning to host the model on a remote host. Uploading and processing an audio file that big will take so much time.

Is it possible to give the model a live stream audio? or maybe keep the same person-id for different small files?

If not I have a question
What is the largest length of audio file with maximum number of persons that the model can process without loosing accuracy?

hbredin · 2021-05-18T06:42:59Z

Is it possible to give the model a live stream audio? or maybe keep the same person-id for different small files?

Online speaker diarization is something that is not currently supported in pyannote.audio.
Regarding speaker tracking, it is feasible with pyannote.audio but not available out of the box.
See related issues #391 and #651.

hbredin · 2021-08-25T20:05:34Z

FYI, @juanmc2005 just released some code to perform live speaker diarization.

hbredin closed this as completed May 18, 2021

MartinEmilEshack mentioned this issue Jul 2, 2021

Comparing Speaker Embeddings #697

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Live Stream audio #670

Live Stream audio #670

MartinEmilEshack commented May 18, 2021

hbredin commented May 18, 2021

hbredin commented Aug 25, 2021

Live Stream audio #670

Live Stream audio #670

Comments

MartinEmilEshack commented May 18, 2021

hbredin commented May 18, 2021

hbredin commented Aug 25, 2021