The clean voices were mainly gathered from LibriSpeech: an ASR corpus based on public domain audiobooks. I used as well some data from SiSec. The environmental noises were gathered from ESC-50 dataset or https://www.ee.columbia.edu/~dpwe/sounds/.
The clean voices were mainly gathered from LibriSpeech: an ASR corpus based on public domain audiobooks. I used as well some data from SiSec. The environmental noises were gathered from ESC-50 dataset or https://www.ee.columbia.edu/~dpwe/sounds/.