Skip to content

hyama5/vae_align

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

VAE-based Phoneme Alignment Using Gradient Annealing and SSL Acoustic Features

Accepted to INTERSPEECH 2024 arXiv preprint

Sample code will be available soon.

Alignment examples

  • annotated: Manually annotated phoneme boundaries in the corpus
  • Proposed: Predicted boundaries using proposed method
  • MFA: Predicted boundaries using Montreal Forced Aligner
  • CTC: Predicted boundaries using CTC forced alignment
  • OTA: Predicted boundaries using "One TTS alignment to rule them all"

CSJ dataset

example1

example2

example3

TIMIT dataset

timit_example

Buckeye dataset

buckeye_example

About

Alignment examples for Interspeech 2024

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published