config parameter #29

vinson2233 · 2021-10-27T04:16:35Z

Hi, I wanted to train the MelNet with my own dataset.
There are some audio setting that I still not understand since I'm very new to this signal processing/speech field. Can someone elaborate me or give me reference for me to understand what are the meaning of these setting :

audio:
  sr: 16000
  duration: 6.0
  n_mels: 180
  hop_length: 180
  win_length: 1080
  n_fft: 1080
  num_freq: 541
  ref_level_db: 20.0
  min_level_db: -80.0

Thanks in advance

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config parameter #29

config parameter #29

vinson2233 commented Oct 27, 2021

config parameter #29

config parameter #29

Comments

vinson2233 commented Oct 27, 2021