Skip to content
Change the repository type filter

All

    Repositories list

    • Apache License 2.0
      03800Updated Jan 8, 2025Jan 8, 2025
    • UTMOSv2

      Public
      UTokyo-SaruLab MOS Prediction System
      Python
      MIT License
      1212900Updated Dec 9, 2024Dec 9, 2024
    • Python
      Other
      0100Updated Dec 2, 2024Dec 2, 2024
    • Modified transcriptions of YODAS dataset
      0400Updated Oct 26, 2024Oct 26, 2024
    • SaSLaW

      Public
      Dialogue Speech Corpus with Audio-visual Egocentric Information, "So, what are you Speaking, Listening, and Watching?"
      Python
      0700Updated Aug 13, 2024Aug 13, 2024
    • Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
      Python
      11400Updated Aug 8, 2024Aug 8, 2024
    • MIT License
      0700Updated Jun 17, 2024Jun 17, 2024
    • Coco-Nut

      Public
      Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
      02100Updated Jun 12, 2024Jun 12, 2024
    • UTMOS22

      Public
      UT-Sarulab MOS prediction system using SSL models
      Python
      MIT License
      1420210Updated Apr 11, 2024Apr 11, 2024
    • Python
      MIT License
      1400Updated Mar 28, 2024Mar 28, 2024
    • Visual onoma-to-wave official implementation
      Python
      MIT License
      0500Updated Mar 11, 2024Mar 11, 2024
    • Multi-lingual AudioCaps
      Apache License 2.0
      0800Updated Nov 20, 2023Nov 20, 2023
    • Python
      Apache License 2.0
      4621561Updated Nov 13, 2023Nov 13, 2023
    • xvector model on jtubespeech
      Python
      MIT License
      44300Updated Nov 5, 2023Nov 5, 2023
    • ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings (INTERSPEECH2023)
      HTML
      Apache License 2.0
      0000Updated May 24, 2023May 24, 2023
    • CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center (INTERSPEECH2023)
      HTML
      Apache License 2.0
      0000Updated May 24, 2023May 24, 2023
    • Python
      MIT License
      83250Updated Dec 4, 2022Dec 4, 2022
    • Python
      0100Updated Jun 16, 2022Jun 16, 2022
    • Lightweight speaker anonymization [IEEE SLT2021]
      Python
      MIT License
      112601Updated Jun 6, 2022Jun 6, 2022
    • fairseq

      Public
      Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
      Python
      MIT License
      6.4k000Updated Apr 4, 2022Apr 4, 2022
    • context labels and pronunciation data for JSUT corpus
      Other
      96800Updated Sep 2, 2021Sep 2, 2021
    • tdmelodic for open-jtalk
      12200Updated Aug 30, 2021Aug 30, 2021
    • BERT models for Japanese text.
      Python
      Apache License 2.0
      55000Updated May 1, 2021May 1, 2021
    • Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
      Python
      MIT License
      22400Updated Mar 23, 2021Mar 23, 2021