Add wavernn example pipeline #749

jimchen90 · 2020-06-24T22:28:41Z

This is a reference example using WaveRNN model to train on LJSpeech. The structure will be inspired by #632 and WaveRNN.

There are at least a few more things to do:

Add bg_iterator and README.
~~Add torchaudio transforms on mel-spectrogram.~~

Related to #446

Stack:

~~Add MelResNet Block #705, #751~~
~~Add Upsampling Block #724~~
~~Add WaveRNN Model #735~~
Add example pipeline with WaveRNN #749
Remove underscore of wavernn model #810

cc @cpuhrsch @zhangguanheng66
internal

vincentqb · 2020-06-25T16:44:28Z

examples/pipeline_wavernn/datasets.py

+        bits = 16 if self.mode == 'MOL' else self.n_bits
+
+        x = (x + 1.) * (2 ** bits - 1) / 2
+        x = torch.clamp(x, min=0, max=2 ** bits - 1)
+
+        return mel.squeeze(0), x.int().squeeze(0)


This converts representation of a waveform from [-1, 1] to 16-bit integer representation. For instance, this is done in load_wav already. Since this is an important step and can be generalized, let's make this into a function within torchaudio. One point of discussion is whether we add that directly in WaveRNN.

This function has been added as normalized_waveform_to_bits function in processing.py.

codecov · 2020-06-25T17:59:23Z

Codecov Report

Merging #749 into master will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #749      +/-   ##
==========================================
+ Coverage   89.87%   89.88%   +0.01%     
==========================================
  Files          34       34              
  Lines        2666     2660       -6     
==========================================
- Hits         2396     2391       -5     
+ Misses        270      269       -1

Impacted Files	Coverage Δ
torchaudio/models/_wavernn.py	`99.03% <100.00%> (+0.85%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 209858e...b306f68. Read the comment docs.

examples/pipeline_wavernn/wavernn.py

vincentqb · 2020-07-02T14:28:38Z

btw, can you add a README.md to discuss the pipeline?

vincentqb · 2020-07-02T20:07:58Z

It'd be nice to get a baseline by comparing the error you get here to the output obtained by Griffin-Lim, say, and in other norms too L^1, L^2 for instance.

examples/pipeline_wavernn/loss_mol.py

examples/pipeline_wavernn/transform.py

PetrochukM · 2020-07-03T17:30:09Z

Since this is not the original WaveRNN model, I'd recommend renaming it as "FatchordWaveRNN" or something similar.

examples/pipeline_wavernn/README.md

examples/pipeline_wavernn/datasets.py

examples/pipeline_wavernn/mol_loss.py

examples/pipeline_wavernn/transform.py

vincentqb

LGTM. Minor things to address:

fix jit in wavernn in separate pull request Fix output type of upsampling #801
change two command line parameters
fix default format in docstring of wavernn model Update form of default value in docstring #802

…ials Fix formatting and clean up tutorial on quantized transfer learning

Co-authored-by: Shen Li <[email protected]>

jimchen90 requested a review from vincentqb June 24, 2020 22:54

vincentqb changed the title ~~Add wavernn example~~ Add wavernn example pipeline Jun 25, 2020

jimchen90 mentioned this pull request Jun 25, 2020

Add WaveRNN Model #735

Merged

vincentqb reviewed Jun 25, 2020

View reviewed changes

jimchen90 mentioned this pull request Jun 25, 2020

Add MelResNet Block #705

Merged

jimchen90 mentioned this pull request Jun 25, 2020

UpsampleNetwork #724

Merged

vincentqb reviewed Jun 25, 2020

View reviewed changes

examples/pipeline_wavernn/wavernn.py Outdated Show resolved Hide resolved

jimchen90 mentioned this pull request Jun 26, 2020

Update MelResNet #751

Merged

jimchen90 force-pushed the pipeline_wavernn branch from 210945c to 9429ff0 Compare June 29, 2020 13:28

vincentqb reviewed Jul 2, 2020

View reviewed changes

examples/pipeline_wavernn/loss_mol.py Outdated Show resolved Hide resolved

vincentqb reviewed Jul 2, 2020

View reviewed changes

examples/pipeline_wavernn/loss_mol.py Outdated Show resolved Hide resolved

vincentqb reviewed Jul 2, 2020

View reviewed changes

examples/pipeline_wavernn/loss_mol.py Outdated Show resolved Hide resolved

vincentqb reviewed Jul 2, 2020

View reviewed changes

examples/pipeline_wavernn/loss_mol.py Outdated Show resolved Hide resolved

vincentqb reviewed Jul 2, 2020

View reviewed changes

examples/pipeline_wavernn/loss_mol.py Outdated Show resolved Hide resolved

vincentqb reviewed Jul 2, 2020

View reviewed changes

examples/pipeline_wavernn/transform.py Outdated Show resolved Hide resolved

jimchen90 force-pushed the pipeline_wavernn branch from 9429ff0 to bfbf39f Compare July 6, 2020 13:57

vincentqb reviewed Jul 6, 2020

View reviewed changes

examples/pipeline_wavernn/README.md Outdated Show resolved Hide resolved

jimchen90 force-pushed the pipeline_wavernn branch 2 times, most recently from dc0fd1b to 04bfe24 Compare July 8, 2020 13:33