-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when using music_speech..._89.98.pt: pytorch-lightning_version #16
Comments
I asked the same here, it's seems a model focused on music separation but I wasn't able to load it. |
Oh cool, thanks. I did look through the issue titles but must have missed
this one. Thanks for pointing it out.
Although it seems liuxubo717 closed without solving/addressing it...
Best wishes and thanks for the work liuxubo717,
Tom
…On Tue, 24 Oct 2023 at 02:32, Fabio Grasso ***@***.***> wrote:
I asked the same here <#6>,
it's seems a model focused on music separation but I wasn't able to load it.
—
Reply to this email directly, view it on GitHub
<#16 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AETIGRHSGQ6M3MVQKUBOHQLYA5OG3AVCNFSM6AAAAAA6M6OBRSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONZWGYYDQNZWGU>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
I was able to fix this error by copying the missing keys from the first checkpoint to the second. |
music_speech_audioset_epoch_15_esc_89.98.pt is not used for music source separation. Actually, it is used to initalise the text encoder (https://github.com/Audio-AGI/AudioSep/blob/main/models/clap_encoder.py#L13) of the AudioSep model. |
It's from new transformers. Run this script on the music_speech_audioset_epoch_15_esc_89.98.pt checkpoint: LAION-AI/CLAP#127 (comment) |
From your paper, I wasn't sure of the role/purpose of music_speech_audioset_epoch_15_esc_89.98.pt
Are these the saved model weights one should use if one wants to focus on separation of musical instruments from one another, say? Or is audiosep_base_4M_steps.ckpt still applicable in such use cases?
When I edited your example inference code from the readme to use music_speech_audioset_epoch_15_esc_89.98.pt on a Linux machine running Ubuntu, I got the following error.
Please clarify the purpose/use of this checkpoint, and if it is meant to be used, whether I need to modify the example inference code further.
Thanks!
The text was updated successfully, but these errors were encountered: