Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

Open
aguerrerolopez opened this issue Nov 7, 2024 · 0 comments

Comments

@aguerrerolopez
Copy link

Hi!

You should add NeuroVoz dataset:
https://zenodo.org/records/13647600

is one of the most used datasets in Parkinsonian speech recognition. It contains 2977 audio files including 54 individuals diagnosed with Parkinson's Disease and 58 healthy controls, the NeuroVoz dataset offers a rich compilation of speech recordings. The dataset is meticulously curated to include a variety of speech tasks—ranging from sustained vowel phonations and diadochokinetic (DDK) tests to 16 structured listen-and-repeat utterances and spontaneous monologues. It also includes both manually transcribed listen-and-repeat tasks and Whisper-automated transcriptions for monologues.

Moreover, there is a paper explaining the details of the dataset and also a quick guide on how to use (with a github repo included).
Paper explanining the database:
https://arxiv.org/abs/2403.02371
Github repo of the database and how to use it:
https://github.com/BYO-UPM/Neurovoz_Dababase

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant