Last PR before merge to main #57

ROBERT-MCDOWELL · 2024-11-20T05:08:59Z

this last PR makes me think it should be ok for the merge of your main branch.
we reached 1k stars, so it's time to bring them to the new version...
it will also bring certainly more contributors..

…book into v2.0

DrewThomasson · 2024-11-20T05:28:42Z

Yes yes agreed :D

I'll run a test, in Vietnamese and in English to make sure that it can successfully use the fairseq and XTTS model

And then double check the readme

And then merge 👌

ROBERT-MCDOWELL · 2024-11-20T14:27:47Z

I will push a new PR today, there are still things to change. about fairseq model we must still integrate it.

DrewThomasson · 2024-11-20T15:26:55Z

Yes yes yes, I was also thinking about setting a preferred tts model order so each language defaults to the best sounding compatible model.

Right now order being:

Best:XTTS----> Next Best:fairseq

ROBERT-MCDOWELL · 2024-11-20T15:43:40Z

I have to recheck the language_mapping again, wrong ISO codes.

ROBERT-MCDOWELL · 2024-11-20T15:49:03Z

the facebook/mm-tts is about 1TB :-| . so not good to import it all locally ...

DrewThomasson · 2024-11-20T15:52:01Z

lol yeah well just have them download the fairseq models on the fly,

It's only around 135 mb for each anyway

DrewThomasson · 2024-11-20T15:56:24Z

I have to recheck the language_mapping again, wrong ISO codes.

Oh yeah,

I think the iso code values are fine in the language_mapping it's just only sending the xtts format language codes for every language, and needs to be sending the iso style codes for specifically the fairseq model inference

You probs already 10 steps ahead of me on this though XD

DrewThomasson · 2024-11-20T16:02:35Z

oh yeah and probably good to have the faiseq models generate:

With voice cloning inference if voice is given
Without voice cloning inference if no voice is given (For faster inference if its using VITS for the voice cloning)

DrewThomasson · 2024-11-20T16:03:27Z

I'll run some fairseq speed testing on my end with and without voice cloning to double check 👌

ROBERT-MCDOWELL · 2024-11-20T16:13:13Z

oh yeah and probably good to have the faiseq models generate:

With voice cloning inference if voice is given

Without voice cloning inference if no voice is given (For faster inference if its using VITS for the voice cloning)

we can do it with

tts --out_path output/path/speech.wav --model_name "//<model_name>" --source_wav <path/to/speaker/wav> --target_wav <path/to/reference/wav>

DrewThomasson · 2024-11-20T16:27:27Z

Ah so voice conversion 👌

looks like that should work cause I was having trouble on my end with trying to pass it though in one command with a fairseq model and voice cloning in one command

Good to know you found a way then 👍

DrewThomasson · 2024-11-20T16:31:55Z

I'll see about finding a more automated way of generating a bunch more of the test ebook files

DrewThomasson · 2024-11-20T17:18:07Z

#58

?🙏

DrewThomasson · 2024-11-21T14:37:12Z

Might have some free time to try a go at implementing the fairseq model for the finaly 2.0 update in a bit, Unless your already working on it lol.
I just don't wanna get in the way with conflicting pushes if your already working on it 😅

ROBERT-MCDOWELL · 2024-11-21T20:47:43Z

I'm already working on it since 2 days now. preparing the repo for the 1142 languages... should be ok for tonight.
btw I saw vits2 coming into coqui-tts on a PR there... will see later.

DrewThomasson · 2024-11-21T21:43:35Z

kk 👍

oooo vits2? 👀 👀 👀

oh also

In the meantime should I fix the other read me languages to get them aligned with the ebook2audiobook.sh stuff or are you already set with that?

ROBERT-MCDOWELL · 2024-11-21T22:21:24Z

yes, the readme should be rebased. like adding the new options (--ebook-dir for ex) and behavior (no need to add True to the option, like --headless etc..), the video screenshot too :)

DrewThomasson · 2024-11-21T22:22:48Z

👍👍 I'll get on that 🫡

ROBERT-MCDOWELL added 3 commits November 19, 2024 21:02

various code and settings optimization, gui improvements.

0137fdf

Merge branch 'v2.0' of https://github.com/ROBERT-MCDOWELL/ebook2audio…

5f6f509

…book into v2.0

Merge branch 'DrewThomasson:v2.0' into v2.0

bc2819d

DrewThomasson approved these changes Nov 20, 2024

View reviewed changes

DrewThomasson merged commit abb9810 into DrewThomasson:v2.0 Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Last PR before merge to main #57

Last PR before merge to main #57

ROBERT-MCDOWELL commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

ROBERT-MCDOWELL commented Nov 20, 2024 •

edited

Loading

DrewThomasson commented Nov 20, 2024

ROBERT-MCDOWELL commented Nov 20, 2024

ROBERT-MCDOWELL commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024 •

edited

Loading

DrewThomasson commented Nov 20, 2024 •

edited

Loading

ROBERT-MCDOWELL commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024 •

edited

Loading

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 21, 2024

ROBERT-MCDOWELL commented Nov 21, 2024 •

edited

Loading

DrewThomasson commented Nov 21, 2024 •

edited

Loading

ROBERT-MCDOWELL commented Nov 21, 2024

DrewThomasson commented Nov 21, 2024

Last PR before merge to main #57

Last PR before merge to main #57

Conversation

ROBERT-MCDOWELL commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

ROBERT-MCDOWELL commented Nov 20, 2024 • edited Loading

DrewThomasson commented Nov 20, 2024

ROBERT-MCDOWELL commented Nov 20, 2024

ROBERT-MCDOWELL commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024 • edited Loading

DrewThomasson commented Nov 20, 2024 • edited Loading

ROBERT-MCDOWELL commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024 • edited Loading

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 20, 2024

DrewThomasson commented Nov 21, 2024

ROBERT-MCDOWELL commented Nov 21, 2024 • edited Loading

DrewThomasson commented Nov 21, 2024 • edited Loading

kk 👍

oh also

ROBERT-MCDOWELL commented Nov 21, 2024

DrewThomasson commented Nov 21, 2024

ROBERT-MCDOWELL commented Nov 20, 2024 •

edited

Loading

DrewThomasson commented Nov 20, 2024 •

edited

Loading

DrewThomasson commented Nov 20, 2024 •

edited

Loading

DrewThomasson commented Nov 20, 2024 •

edited

Loading

ROBERT-MCDOWELL commented Nov 21, 2024 •

edited

Loading

DrewThomasson commented Nov 21, 2024 •

edited

Loading