Disjoint MTL

Code accompanying paper "Towards multi-task learning for speech and speaker recognition"

See paper_experiments.md for commands to reproduce results.

See here for some model checkpoints.

See here for VoxCeleb 1 and VoxCeleb2 ASR labels with Whisper.

Quick start guide

Copy .env.example to .env and fill accordingly.

See data_utility for instructions for preparing data. See sre2008, hub5_2000, voxceleb and librispeech.

Install dependencies with poetry update.

Run experiments with run_mtl_disjoint.py, run_mtl_joint.py, run_speaker.py and run_speech.py.

Cite

You can cite this work as:

@INPROCEEDINGS{vaessen2023mtl,
  author={Vaessen, Nik and van Leeuwen, David A.},
  booktitle={Interspeech 2023}, 
  title={Towards multi-task learning for speech and speaker recognition}, 
  year={2023},
}

Name		Name	Last commit message	Last commit date
Latest commit History 283 Commits
config		config
data_utility		data_utility
paper		paper
snellius		snellius
src		src
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
experiments.md		experiments.md
export_env.sh		export_env.sh
hydra_bash_complete.sh		hydra_bash_complete.sh
lsyncd.conf.lua		lsyncd.conf.lua
paper_experiments.md		paper_experiments.md
paper_experiments_short.md		paper_experiments_short.md
pyproject.toml		pyproject.toml
readme.md		readme.md
run_mtl_disjoint.py		run_mtl_disjoint.py
run_mtl_joint.py		run_mtl_joint.py
run_speaker.py		run_speaker.py
run_speech.py		run_speech.py
slurm_setup.sh		slurm_setup.sh
vox2_ls960h_speakers.json		vox2_ls960h_speakers.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disjoint MTL

Quick start guide

Cite

About

Releases

Packages

Languages

License

nikvaessen/disjoint-mtl

Folders and files

Latest commit

History

Repository files navigation

Disjoint MTL

Quick start guide

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages