Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

config parameter #29

Open
vinson2233 opened this issue Oct 27, 2021 · 0 comments
Open

config parameter #29

vinson2233 opened this issue Oct 27, 2021 · 0 comments

Comments

@vinson2233
Copy link

Hi, I wanted to train the MelNet with my own dataset.
There are some audio setting that I still not understand since I'm very new to this signal processing/speech field. Can someone elaborate me or give me reference for me to understand what are the meaning of these setting :

audio:
  sr: 16000
  duration: 6.0
  n_mels: 180
  hop_length: 180
  win_length: 1080
  n_fft: 1080
  num_freq: 541
  ref_level_db: 20.0
  min_level_db: -80.0

Thanks in advance

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant