Goal is to clean concert audio record from applauses.
- MUSDB18-HQ, for high quality music audios
- Free Sound, for random concert noises
- downsample all music audio from 44.kHz to 16kHz
- make mono from stereo by averaging left and right audio signals
- splitting audio into 5 sec intervals with 2.5 sec overlap
- get STFT features
- adding random noises to audio with different SNR's
- using Bi-GRU model
- from STFT going back to audio signal
- adding phase info from noisy audio to estimated audio