NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

aguerrerolopez · 2024-11-07T17:58:18Z

Hi!

You should add NeuroVoz dataset:
https://zenodo.org/records/13647600

is one of the most used datasets in Parkinsonian speech recognition. It contains 2977 audio files including 54 individuals diagnosed with Parkinson's Disease and 58 healthy controls, the NeuroVoz dataset offers a rich compilation of speech recordings. The dataset is meticulously curated to include a variety of speech tasks—ranging from sustained vowel phonations and diadochokinetic (DDK) tests to 16 structured listen-and-repeat utterances and spontaneous monologues. It also includes both manually transcribed listen-and-repeat tasks and Whisper-automated transcriptions for monologues.

Moreover, there is a paper explaining the details of the dataset and also a quick guide on how to use (with a github repo included).
Paper explanining the database:
https://arxiv.org/abs/2403.02371
Github repo of the database and how to use it:
https://github.com/BYO-UPM/Neurovoz_Dababase

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

aguerrerolopez commented Nov 7, 2024

NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

Comments

aguerrerolopez commented Nov 7, 2024