New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add PicoAudio Model #249

Open

zeyuxie29 wants to merge 1 commit into open-mmlab:main from zeyuxie29:PicoAudio

zeyuxie29 commented Jul 19, 2024 •

edited

Loading

✨ Description

The PR adds the PicoAudio into the Amphion toolkit.

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

repo: https://github.com/zeyuxie29/PicoAudio
paper: https://arxiv.org/abs/2407.02869v2
demo: https://zeyuxie29.github.io/PicoAudio.github.io/
huggingface spcae: https://huggingface.co/spaces/amphion/PicoAudio

🚧 Related Issues

[List the issue numbers related to this PR]

👨‍💻 Changes Proposed

Added the dataloader and model implement of PicoAudio into models/temporally_controllable_tta
Added the training and inference scripts of PicoAudio into models/temporally_controllable_tta

🧑‍🤝‍🧑 Who Can Review?

@zhizhengwu @HeCheng0625

🛠 TODO

✅ Checklist

Code has been reviewed
Code complies with the project's code standards and best practices
Code has passed all tests
Code does not affect the normal use of existing features
Code has been commented properly
Documentation has been updated (if applicable)
Demo/checkpoint has been attached (if applicable)


          Add files via upload

97c0398

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet