Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add PicoAudio Model #249

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

zeyuxie29
Copy link

@zeyuxie29 zeyuxie29 commented Jul 19, 2024

✨ Description

The PR adds the PicoAudio into the Amphion toolkit.

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation

🚧 Related Issues

[List the issue numbers related to this PR]

👨‍💻 Changes Proposed

  • Added the dataloader and model implement of PicoAudio into models/temporally_controllable_tta
  • Added the training and inference scripts of PicoAudio into models/temporally_controllable_tta

🧑‍🤝‍🧑 Who Can Review?

@zhizhengwu @HeCheng0625

🛠 TODO

✅ Checklist

  • Code has been reviewed
  • Code complies with the project's code standards and best practices
  • Code has passed all tests
  • Code does not affect the normal use of existing features
  • Code has been commented properly
  • Documentation has been updated (if applicable)
  • Demo/checkpoint has been attached (if applicable)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant