-
Pattern Recognition and Human Languages Technology, Research Center
- Valencia, Spain
- https://www.prhlt.upv.es/david-gimeno/
- https://orcid.org/0000-0002-7375-9515
- in/david-gimeno-gómez-589a5526b
- https://scholar.google.com/citations?user=DVRSla8AAAAJ&hl=en
Pinned Loading
-
interpreting-ssl-parkinson-speech
interpreting-ssl-parkinson-speech PublicOfficial source code of the paper: "Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson’s Diagnosis"
-
cosmaadrian/multimodal-depression-from-video
cosmaadrian/multimodal-depression-from-video PublicOfficial source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
-
joactr/AnnoTheia
joactr/AnnoTheia PublicAnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
Python 26
-
tailored-avsr
tailored-avsr PublicOfficial source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
Python 10
If the problem persists, check the GitHub status page or contact support.