Running without "wespeaker-voxceleb-resnet34-LM" #1476

doublex · 2023-09-27T07:22:02Z

Is it possible to run version 3 without wespeaker-voxceleb-resnet34-LM?

The text was updated successfully, but these errors were encountered:

github-actions · 2023-09-27T07:22:28Z

Thank you for your issue.You might want to check the FAQ if you haven't done so already.

Feel free to close this issue if you found an answer in the FAQ.

If your issue is a feature request, please read this first and update your request accordingly, if needed.

If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:

installation
data preparation
model download
etc.

Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users).

Companies relying on pyannote.audio in production may contact me via email regarding:

paid scientific consulting around speaker diarization and speech processing in general;
custom models and tailored features (via the local tech transfer office).

This is an automated reply, generated by FAQtory

hbredin · 2023-09-27T07:34:40Z

You can use any supported speaker embedding in place of hbredin/wespeaker-voxceleb-resnet34-LM.

See pyannote.audio.pipelines.speaker_verification.PretrainedSpeakerEmbedding for a list of supported models.

It should be as simple as downloading this configuration file, updating this line and that line, and then load it with

from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("config.yml")

Note however that the reason why I chose this speaker embedding for version 3.0.0 is that it is the one that gave me the best result. I'd love to hear the reason why you would like to change it? Research? Any other reasons?

doublex · 2023-09-27T07:41:43Z

@hbredin
Thanks for your answer.
onnxruntime is slow (runs on the CPU). There is a dropin-replacement onnxruntime-gpu which uses CUDA - but using onnxruntime is cumbersome....

hbredin · 2023-09-27T08:04:48Z

Would switching to onnxruntime-gpu solve the issue?
Does it also support CPU?

doublex · 2023-09-27T08:08:42Z

Yes - but it does not support running on the CPU - 😠😠😠😠
My knowledge is limited.

hbredin · 2023-09-27T10:51:08Z

Could you try with this and let me know if that allows to run on GPU on your side?

pip install https://github.com/pyannote/pyannote-audio/archive/refs/heads/fix/onnxruntime-gpu.zip

All it does is switch from onnxruntime to onnxruntime-gpu which does seem to also support CPU

guilhermehge · 2023-09-27T12:34:17Z

@hbredin how would I know which threshold to use for each model?

hbredin · 2023-09-27T13:57:48Z

You would not know. You have to tune this threshold.
You could also use pyannote/speaker-diarization-2.1 configuration file as a reasonable starting point.

hbredin · 2023-11-16T19:49:03Z

Closing as latest version no longer relies on ONNX hbredin/wespeaker-voxceleb-resnet34-LM and this solves the original issue.

Please update to pyannote.audio 3.1 and pyannote/speaker-diarization-3.1 (and open new issues if needed).

realfolkcode mentioned this issue Sep 27, 2023

pipeline.to(torch.device("cuda")) not working on T4 Tesla GPU (pyannote==3.0.0) #1475

Closed

hbredin mentioned this issue Sep 27, 2023

fix: fix WeSpeakerPretrainedSpeakerEmbedding GPU support #1478

Merged

hbredin closed this as completed Nov 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running without "wespeaker-voxceleb-resnet34-LM" #1476

Running without "wespeaker-voxceleb-resnet34-LM" #1476

doublex commented Sep 27, 2023

github-actions bot commented Sep 27, 2023

hbredin commented Sep 27, 2023 •

edited

Loading

doublex commented Sep 27, 2023

hbredin commented Sep 27, 2023

doublex commented Sep 27, 2023

hbredin commented Sep 27, 2023 •

edited

Loading

guilhermehge commented Sep 27, 2023

hbredin commented Sep 27, 2023

hbredin commented Nov 16, 2023

Running without "wespeaker-voxceleb-resnet34-LM" #1476

Running without "wespeaker-voxceleb-resnet34-LM" #1476

Comments

doublex commented Sep 27, 2023

github-actions bot commented Sep 27, 2023

hbredin commented Sep 27, 2023 • edited Loading

doublex commented Sep 27, 2023

hbredin commented Sep 27, 2023

doublex commented Sep 27, 2023

hbredin commented Sep 27, 2023 • edited Loading

guilhermehge commented Sep 27, 2023

hbredin commented Sep 27, 2023

hbredin commented Nov 16, 2023

hbredin commented Sep 27, 2023 •

edited

Loading

hbredin commented Sep 27, 2023 •

edited

Loading