Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

not working on colab #28

Open
faheem-khaskheli opened this issue Oct 6, 2020 · 21 comments
Open

not working on colab #28

faheem-khaskheli opened this issue Oct 6, 2020 · 21 comments

Comments

@faheem-khaskheli
Copy link

cannot download youtube videos or audio

@mwaseemrandhawa
Copy link

Same issue. are you resolved it?

@faheem-khaskheli
Copy link
Author

yes, i solved the issues, have to install some modules and change path of files when using these in single jupyter notebook.

@mwaseemrandhawa
Copy link

can you please share a code with me?

@faheem-khaskheli
Copy link
Author

i will upload it on github today.

@mwaseemrandhawa
Copy link

mwaseemrandhawa commented Oct 7, 2020 via email

@faheem-khaskheli
Copy link
Author

yes, but downloading dataset on colab is taking hours

@mwaseemrandhawa
Copy link

mwaseemrandhawa commented Oct 7, 2020 via email

@faheem-khaskheli
Copy link
Author

i download 100 audio and videos to test the model

@mwaseemrandhawa
Copy link

mwaseemrandhawa commented Oct 7, 2020 via email

@faheem-khaskheli
Copy link
Author

this model is not pretrained, we have to train it using whole dataset.
do you have pretrained weights for this model?

@mwaseemrandhawa
Copy link

mwaseemrandhawa commented Oct 7, 2020 via email

@faheem-khaskheli
Copy link
Author

according to readme, model v1 is audio only.
model v2 is both video and audio.
these weights belong to v1 audio only model

@mwaseemrandhawa
Copy link

mwaseemrandhawa commented Oct 7, 2020 via email

@faheemkhaskheli9
Copy link

here, i uploaded file that i used on colab.
https://github.com/faheemkhaskheli9/speech_separation
if you have any issue then contact me here, i can fix it.

@mwaseemrandhawa
Copy link

mwaseemrandhawa commented Oct 7, 2020 via email

@koravadi
Copy link

Since we are using the bidirectional LSTM in the network, can this be used for the realtime application? or this works only for the recorded data?

@faheem-khaskheli
Copy link
Author

it can be used on real time data, you just have to split the input stream into chunks, like get 3 second data and feed it to model, then another 3 second data.
you have to wait for 3 second so that you can collect 3 second recording. that will be lag in system.

@koravadi
Copy link

@faheem-khaskheli thanks for clarification!

@koravadi
Copy link

@faheem-khaskheli I have one more question. Why it needs to be 3 sec why can't it be 1 sec? is there any parameter defines this 3 sec interval in the network?

@faheemkhaskheli9
Copy link

no reason, you can make it 1 second but you need some useful data to do task so long will be better then 1 sec or less then 1 sec

@SutirthaChakraborty
Copy link

Thanks @faheemkhaskheli9 I tried to replicate your colab code. The last part 'evaluate the model and generate the prediction' is not working. I am getting silent audio as output. Any help ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants