Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

Could you provide any instructions to preprocess the dataset? #19

Open
tangyuelm opened this issue May 6, 2021 · 0 comments
Open

Could you provide any instructions to preprocess the dataset? #19

tangyuelm opened this issue May 6, 2021 · 0 comments

Comments

@tangyuelm
Copy link

Hello,

I am new to the CPC method and want to learn something from your marvelous codes. However, I am still confused about how to prepossess the dataset. I downloaded the librispeech-train-clean-100 subset from the website but I did not know how to arrange them as follows. It seems that this dataset only has training samples without labels. And I am also not sure how to use the training/validation sequences lists and the Train / Val splits. Are there any detailed instructions?
PATH_AUDIO_FILES

└───speaker1
│ └───...
│ │ seq_11.{$EXTENSION}
│ │ seq_12.{$EXTENSION}
│ │ ...

└───speaker2
└───...
│ seq_21.{$EXTENSION}
│ seq_22.{$EXTENSION}

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant