Howto / Allow Inject initial embeddings inside pipeline cluster #1764
heralight
started this conversation in
Development
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Help Needed: Reinjection of Previous Embeddings in
pyannote/speaker-diarization-3.1
PipelineIssue Type:
🆘 Support Request
Description
Yet another question on how identify speaker on multiple source...
Hello PyAnnote Team,
I am using the
pyannote/speaker-diarization-3.1
pipeline for speaker diarization in my project. I aim to improve speaker consistency across multiple audio chunks by reinjecting embeddings from a previous chunk into the pipeline when processing the next chunk.Attempts Made:
Modifying Clustering:
SpeakerDiarization
pipeline to store and utilize previous embeddings and cluster centroids.compute_embeddings
andcluster
methods to concatenate previous embeddings and initialize KMeans with prior centroids.Using Hooks:
initial_embeddings
during the second call to maintain speaker identity.Example Use Case:
Specifically, I ideally want to perform the following:
Guidance on Reinjection Mechanism:
How can I effectively pass initial_embeddings from one pipeline call to the next to maintain speaker consistency?
Pipeline Customization:
Example Implementation:
Environment
pyannote.audio version: 3.1
Python version: 3.11
Torch version: 2.4.1+cu121
GPU: NVIDIA CUDA-enabled device
Thank you for your assistance!
Best regards,
Alexandre
Beta Was this translation helpful? Give feedback.
All reactions