Memory leak #180

filippocarone · 2025-01-02T20:32:26Z

hawthorne_lettera_scarlatta.epub.gz

When processing this file using the docker container, memory utilization grows continuously. Command line:

/ebook2audiobook.sh --hea
dless --ebook ../hawthorne_lettera_scarlatta.epub -
-language ita

DrewThomasson · 2025-01-02T20:39:39Z

If your willing I'd also love to see if this is a issue purely in the docker

Or also in the code in general

Try running the

ebook2audiobook.sh

And see if this issue also happens through that

DrewThomasson · 2025-01-02T20:41:46Z

Include any logs and details such also

No amount of details is too much for us 👍

DrewThomasson · 2025-01-02T21:32:49Z

say....

Are you running into issues with the docker image after running multiple books through it?

And its just having this increase in ram usage after each book is processed in order without wiping and restarting the docker image from the base image scratch? 🤔

DrewThomasson · 2025-01-02T21:34:10Z

Testing rn with this

alice_in_wonderland.txt

In the gui in stnadard settings

docker was launched with a upper cpu memory limit of 4gb with this command for my test

docker run -it -p 7860:7860 --platform=linux/amd64 --memory=4g athomasson2/ebook2audiobook:latest python app.py

edit: I am testing it with this book because there are errors with your`.epub` file as its seen as a `dir` for some reason

DrewThomasson · 2025-01-02T21:41:46Z

seems to be going.... well XD

it's BARELY hanging on rn but still progressing

log_so_far.txt

Update: still going strong

log_so_far2.txt

Update still going strong

log_so_far3.txt

DevonGrandahl · 2025-01-03T01:30:15Z

This has been happening with every novel-length book I try.

Docker on Windows 10
Mostly using a ~2k sentence book
Mostly using Bryan Cranston's voice
Mid/low-end mini PC w/ 8gb of available RAM.

It fails consistently after a few hours (getting about 4-6% of the way through) once it hits the memory threshold. Unrestricted or set to 4gb, doesn't seem to matter. Logs don't show anything interesting, just dies mid-sentence.

I've reverted to using the piper-TTS image for now, which is still awesome. I'll comment if I figure out anything useful.

DrewThomasson · 2025-01-03T01:56:55Z

Interesting

Yeah that's weird then

DrewThomasson · 2025-01-03T01:57:19Z

Even if you turn off sentence splitting?

DrewThomasson · 2025-01-03T01:59:50Z

@DevonGrandahl

Any chance you could show us the book you're using?

DevonGrandahl · 2025-01-03T02:10:25Z

Even if you turn off sentence splitting?

Trying this now, I previously misread as turn on sentence splitting.

The book is Penpal by Dathan Auerbach. I own a copy. What's the best way for me to get it to you?

DrewThomasson · 2025-01-03T03:04:20Z

@DevonGrandahl

Discord

Don't want anyone thinking we're distributing books to the public illegally

DevonGrandahl · 2025-01-03T03:28:55Z

No luck with text splitting off and a 4gb memory limit. It crashed this time at 3.8%.

DrewThomasson · 2025-01-03T05:20:54Z

Anyone who has issues with this keep posting ✨🫶🏻

this is just as a helpful list of legacy things that might work for you in the meantime

Other LEGACY versions of ebook2audiobook that might not have this issue (I'm not updating them tho as these will be integrated eventually)

Legacy Ebook2Audiobook v1.0

Legacy Ebook2AudiobookpiperTTS

Legacy Ebook2AudiobookStyleTTS

Legacy Ebook2AudiobookEspeak

ROBERT-MCDOWELL · 2025-01-03T08:56:09Z

@DevonGrandahl
Could you provide:

the next text after the last sentence converted (see the terminal when you run eb2ab)
the CPU and GPU of your PC

DevonGrandahl · 2025-01-03T11:43:45Z

Sure! It doesn't fail at the same spot every time.

The last lines in the logs:

2025-01-02 21:20:49 94/2396 Sentence: Small towns lack many of the luxuries of larger towns or cities; what few stores there are close down early, 
2025-01-02 21:20:49 
2025-01-02 21:21:47 
Processing 3.88%: : 94/2396

The next line is:

traveling events don’t stop there because they probably missed your small dot on the map, and there aren’t many police or hospitals at your disposal.

The PC is a ProDesk mini PC I use as a server.
Processor: Intel(R) Core(TM) i7-6700T CPU @ 2.80GHz, 2808 Mhz, 4 Core(s), 8 Logical Processor(s)
RAM: 16gb (8gb available to Docker)
GPU: Intel HD Graphics 530 (Integrated)

ROBERT-MCDOWELL · 2025-01-03T12:09:44Z

ha you said it doesn't fail at the same spot right? everytime it's random?

DevonGrandahl · 2025-01-03T12:24:16Z

Yep, seems to be random. It's never made it past ~8%.

Trying the legacy v1.0 image now.

Update: the v1.0 image just passed this sentence in maybe 1/3rd the time. Using Attenborough and no memory limit.

Update: Aaaaand it crashed. Went much faster, but still crashed when memory topped out.

ROBERT-MCDOWELL · 2025-01-03T12:31:31Z

so I don't think it's related to ebook2audiobook, but more how your OS is managing the docker.
it can be also a RAM failure...

DevonGrandahl · 2025-01-03T15:26:47Z

RAM failure would be a strange thing to happen to multiple people at the same time trying to run the app. Plenty of apps run inside docker with no issue, so I'm also not sure about the Windows/Docker management issue. Could be, though!

I tried mounting Docker volumes for the tmp & audiobook directories with no luck, but I have a hunch I did that wrong. Will try again, since that could help with RAM usage.

ROBERT-MCDOWELL · 2025-01-03T17:09:53Z

failure is maybe not the righ word, but more how the OS is managing the docker RAM....
maybe there is also something else out of Docker causing trouble to your docker system.
try to reboot with the minimum services using RAM and virtual RAM and try again to see if it's better.

filippocarone · 2025-01-03T17:27:45Z

The memory leak happens also when running in native mode. I'll try next with text splitting off.

ROBERT-MCDOWELL · 2025-01-03T18:34:32Z

I don't think it's related to enable_text_splitting but more a coqui-tts issue... if you say that around 8% it fails, so there is something somewhere where the memory is not freed....

filippocarone · 2025-01-04T07:11:41Z

Attaching logs and a couple of screenshots of memory utilization (see the clock at the top left of the screen).

log.txt

ROBERT-MCDOWELL · 2025-01-04T12:18:53Z

ok there is already something wrong with the processes running. In any case it should be 3 or 4 processes, unless you are several user on it so it will explain your memory "leak" which is not a leak but just a need of more RAM since more users...

filippocarone · 2025-01-04T14:03:16Z

Those different PIDs are actually threads of the same process. Only one instance is running, by just one user - this is a desktop computer. What I wanted to show with the 2 screenshots is that in just 7 minutes RAM usage has increased by ca. 2.6 GB (from 5.3 to 7.9).

ROBERT-MCDOWELL · 2025-01-04T14:11:07Z

it should'n be threads since ebook2audiobook is running in multiprocessing and it's NOT allowing threading. So a race occurs, which can explain the increase of your RAM. now it needs to undertsand why threads are running....
are you on windows 11?

filippocarone · 2025-01-04T14:18:12Z

I'm running on Linux, Ubuntu 24.10.

ROBERT-MCDOWELL · 2025-01-04T14:27:12Z

ok try to run a conversion, then check the PID of each thread, and try to kill all but one with kill -SIGTERM , then provide the log

ROBERT-MCDOWELL · 2025-01-04T20:53:08Z

ok so the issue is more complex... any chance you have another machine to test the same?

DevonGrandahl · 2025-01-05T16:13:43Z

Just replicated the issue on the Google Collab. Log attached.

~2k sentence ebook (Penpal, but I'll switch to a Project Gutenberg book for testing going forward))
Bryan Cranston voice
Using T4 runtime
Unaltered (did not remove tqdm or refreshes)
Took maybe 30 minutes to crash out

e2aLog_1_5_25.txt

ROBERT-MCDOWELL · 2025-01-05T16:50:21Z

try to replace this function with this one

def convert_sentence_to_audio(params, session):
    try:
        if session['cancellation_requested']:
            #stop_and_detach_tts(params['tts'])
            print('Cancel requested')
            return False
        generation_params = {
            "temperature": session['temperature'],
            "length_penalty": session["length_penalty"],
            "repetition_penalty": session['repetition_penalty'],
            "num_beams": int(session['length_penalty']) + 1 if session["length_penalty"] > 1 else 1,
            "top_k": session['top_k'],
            "top_p": session['top_p'],
            "speed": session['speed'],
            "enable_text_splitting": session['enable_text_splitting']
        }
        if params['tts_model'] == 'xtts':
            if session['custom_model'] is not None or session['fine_tuned'] != 'std':
                with torch.no_grad():
                    output = params['tts'].inference(
                        text=params['sentence'],
                        language=session['metadata']['language_iso1'],
                        gpt_cond_latent=params['gpt_cond_latent'],
                        speaker_embedding=params['speaker_embedding'],
                        **generation_params
                    )
                    torchaudio.save(
                        params['sentence_audio_file'],
                        torch.tensor(output[audioproc_format]).unsqueeze(0),
                        sample_rate=24000
                    )
            else:
                with torch.no_grad():
                    params['tts'].tts_to_file(
                        text=params['sentence'],
                        language=session['metadata']['language_iso1'],
                        file_path=params['sentence_audio_file'],
                        speaker_wav=params['voice_file'],
                        **generation_params
                    )
        elif params['tts_model'] == 'fairseq':
            with torch.no_grad():
                params['tts'].tts_with_vc_to_file(
                    text=params['sentence'],
                    file_path=params['sentence_audio_file'],
                    speaker_wav=params['voice_file'].replace('_24khz','_16khz'),
                    split_sentences=session['enable_text_splitting']
                )
        if session['device'] == 'cuda':
            torch.cuda.empty_cache()
        if os.path.exists(params['sentence_audio_file']):
            return True
        print(f"Cannot create {params['sentence_audio_file']}")
        return False
    except Exception as e:
        raise DependencyError(e)

filippocarone · 2025-01-05T19:59:14Z

Memory increases also with torch.no_grad(), same as before.

ROBERT-MCDOWELL · 2025-01-05T20:06:42Z

add
import gc

and at the end of the function add
collected = gc.collect()

filippocarone · 2025-01-05T20:57:26Z

No changes with gc.collect(), memory still increases.

ROBERT-MCDOWELL · 2025-01-05T21:24:02Z

If gc.collect does not effect so I'm afraid it's out of the eb2ab scope and the bug is coming from a library

DevonGrandahl · 2025-01-06T00:20:28Z

FWIW, I am seeing this same behavior in the StyleTTS version of the project.

ROBERT-MCDOWELL · 2025-01-06T11:02:13Z

ok now let's target the origin more precisely.
switch in CPU mode and tell me if it's better. if not, use another ebook sample from the same language, around same amount of sentences. if it's still not better, use an english ebook sample with the the same characteristics.
if it's still not ok, we will start to comment out part of code to localize the one causing the issue.

filippocarone · 2025-01-06T14:32:34Z

I'm running in cpu mode, I've used 3 different ebooks, 1 in English and 2 in Italian. Same behavior across all inputs.

ROBERT-MCDOWELL · 2025-01-06T14:39:25Z

did you try on another computer?

filippocarone · 2025-01-06T14:58:06Z

I've created a debian VM (debian 12.8) on virtualbox and will run it there. I'll keep you posted.

ROBERT-MCDOWELL · 2025-01-06T15:26:45Z

well, if you created a VM on the same computer so it will be the same....

filippocarone · 2025-01-06T15:43:59Z

I don't have another computer and the issue was replicated on Google Collab by @DevonGrandahl .

Memory is being allocated on TTS/tts/models/xtts.py, in the function inference, line 568:

            wavs.append(self.hifigan_decoder(gpt_latents, g=speaker_embedding).cpu().squeeze())

After this function the allocated memory is never collected/cleaned even when executing gc.collect().

ROBERT-MCDOWELL · 2025-01-06T15:54:18Z

so it's what I said since the start, it's a coqui-tts issue, not ours... and to fix it, good luck with the fork we are working with... at our level we cannot change anything. the issue you are encountering concerns only few computers apparentely.
I suggest you open an issue there.
btw what torch version is installed?

filippocarone · 2025-01-06T15:57:47Z

These are the versions installed in the python_env which was created automatically by the application:

torch==2.5.1
torchaudio==2.5.1
coqui-tts==0.25.1
coqui-tts-trainer==0.2.0

ROBERT-MCDOWELL · 2025-01-06T16:11:40Z

it's all fine, so it's really a coqui-tts with torch memory management.... weird a gc.collect() does not do anything though.

DevonGrandahl · 2025-01-06T16:12:33Z

Do we know that users are getting successful runs of full-length novels? Saying this is an issue with CoquiTTS seems equivalent to saying this project is DOA, no? It's failing in the Collab, so it's not exactly isolated to a couple machines.

Also, want to reiterate that this same thing is happening with the old StyleTTS library, which feels like a big coincidence if it's outside of e2a's domain.

Ill report back if I get time to dig through the Python. Crossing my fingers this is fixable!

ROBERT-MCDOWELL · 2025-01-06T16:30:47Z

@DevonGrandahl in native mode yes of course!, with the docker, @DrewThomasson will tell you more.

DevonGrandahl · 2025-01-06T20:59:18Z

I'm messing around with the native mode code, and adding gc.collect and a torch cache dump might be slowing memory growth, but I can confirm it's not fixed. I wonder about deleting the TTS object entirely after every chapter (or x sentences) and rebuilding. It's a dumb idea (it'll definitely be slower), but combined with garbage collection it might stop the unlimited memory growth?

ROBERT-MCDOWELL · 2025-01-06T21:04:52Z

@DevonGrandahl Keep in mind that a very few users have this problem. so if it was a major issue the entire project won't be useable. the issue is elsewhere for sure. are you saying it's the same in native mode? btw I just saw you are on windows 10, not 11 right?

DevonGrandahl · 2025-01-06T21:07:04Z

Yep, I'm seeing the same issue in native mode. The Google Collab is also in native mode, I think.

Correct, Windows 10.

ROBERT-MCDOWELL · 2025-01-06T21:10:22Z

windows 10 could be the issue.... I also saw on coqui-tts forums users having the same issue on windows 10 even with 16GB RAM.
but the original coqui-tts repo closed it as "won't fix'.. however some found a way to stabilize the memory by cutting sentence to max 100 chars and a new tts instance on each (not sure about this one). maybe with gc.collect() it can avoid to create a new tts instance, however I can code a new condition to split more the sentences in case of cpu mode. The thing I still don't catch is my laptop test is 17 years old core 2 duo with 4GB RAM, and some tests I did were like around 30,000 sentences on CPU, and after 3 days the RAM was the same, at the max + virtual memory but no crash, in native mode. windows 11

filippocarone · 2025-01-06T21:25:35Z

I tried to create a new TTS object at every sentence and then execute a gc.collect(), but memory grew even faster. I also tried to set to None the params and session objects to see if there was something holding references to other objects, but also that did not reduce memory utilization.

ROBERT-MCDOWELL · 2025-01-06T21:27:03Z

read carefully my comment above please

filippocarone · 2025-01-06T21:40:52Z

On the debian 12.8 VM in Virtualbox memory is not growing (it grows by approx 1MB per sentence instead of 10s or 100s of MB per sentence like in Ubuntu 24.10). There could be something related to the kernel/OS which influences how memory is managed resulting in a leak. This is weird.
Anyway, I'll try a full run and see how it behaves until the end.

ROBERT-MCDOWELL · 2025-01-06T22:41:20Z

any VM is still dependent on the OS host so windows10. then docker is doing the mapping and for sure the map behavior differs from each VM to other

DevonGrandahl · 2025-01-07T02:20:00Z

Just tried this on a Windows 11 gaming machine and the memory climb seems much more reasonable. Can't leave this running to see if it ever crashes, but it seems like it would probably work.

filippocarone · 2025-01-07T05:54:52Z

A virtualbox VM is different from a docker container in many respects and also from a memory management standpoint. My host is not windows10, but ubuntu 24.10. Anyway it reached 8GB of RAM after 417 sentences, and the process was killed by the OS as the VM has 9 GB of RAM allocated. So it grew, but slower than on ubuntu 24.10.

ROBERT-MCDOWELL · 2025-01-07T12:34:25Z

run a TTS A.I. on a VM is not reasonable btw.... it's ok for test, but production not.

DrewThomasson added bug Something isn't working docker Related to docker labels Jan 2, 2025

Memory leak #180

Memory leak #180

Comments

filippocarone commented Jan 2, 2025

DrewThomasson commented Jan 2, 2025

DrewThomasson commented Jan 2, 2025

DrewThomasson commented Jan 2, 2025

DrewThomasson commented Jan 2, 2025 • edited Loading

edit: I am testing it with this book because there are errors with your.epub file as its seen as a dir for some reason

DrewThomasson commented Jan 2, 2025 • edited Loading

it's BARELY hanging on rn but still progressing

Update: still going strong

Update still going strong

DevonGrandahl commented Jan 3, 2025 • edited Loading

DrewThomasson commented Jan 3, 2025

DrewThomasson commented Jan 3, 2025

DrewThomasson commented Jan 3, 2025

DevonGrandahl commented Jan 3, 2025

DrewThomasson commented Jan 3, 2025 • edited Loading

DevonGrandahl commented Jan 3, 2025

DrewThomasson commented Jan 3, 2025 • edited Loading

Anyone who has issues with this keep posting ✨🫶🏻

Other LEGACY versions of ebook2audiobook that might not have this issue (I'm not updating them tho as these will be integrated eventually)

Legacy Ebook2Audiobook v1.0

Legacy Ebook2AudiobookpiperTTS

Legacy Ebook2AudiobookStyleTTS

Legacy Ebook2AudiobookEspeak

ROBERT-MCDOWELL commented Jan 3, 2025

DevonGrandahl commented Jan 3, 2025

ROBERT-MCDOWELL commented Jan 3, 2025

DevonGrandahl commented Jan 3, 2025 • edited Loading

ROBERT-MCDOWELL commented Jan 3, 2025

DevonGrandahl commented Jan 3, 2025

ROBERT-MCDOWELL commented Jan 3, 2025 • edited Loading

filippocarone commented Jan 3, 2025

ROBERT-MCDOWELL commented Jan 3, 2025

filippocarone commented Jan 4, 2025

ROBERT-MCDOWELL commented Jan 4, 2025

filippocarone commented Jan 4, 2025

ROBERT-MCDOWELL commented Jan 4, 2025 • edited Loading

filippocarone commented Jan 4, 2025

ROBERT-MCDOWELL commented Jan 4, 2025

ROBERT-MCDOWELL commented Jan 4, 2025

DevonGrandahl commented Jan 5, 2025 • edited Loading

ROBERT-MCDOWELL commented Jan 5, 2025 • edited Loading

filippocarone commented Jan 5, 2025

ROBERT-MCDOWELL commented Jan 5, 2025

filippocarone commented Jan 5, 2025

ROBERT-MCDOWELL commented Jan 5, 2025

DevonGrandahl commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025

filippocarone commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025

filippocarone commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025

filippocarone commented Jan 6, 2025 • edited Loading

ROBERT-MCDOWELL commented Jan 6, 2025 • edited Loading

filippocarone commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025

DevonGrandahl commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025

DevonGrandahl commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025 • edited Loading

DevonGrandahl commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025 • edited Loading

filippocarone commented Jan 6, 2025

ROBERT-MCDOWELL commented Jan 6, 2025

filippocarone commented Jan 6, 2025 • edited Loading

ROBERT-MCDOWELL commented Jan 6, 2025 • edited Loading

DevonGrandahl commented Jan 7, 2025

filippocarone commented Jan 7, 2025

ROBERT-MCDOWELL commented Jan 7, 2025

DrewThomasson commented Jan 2, 2025 •

edited

Loading

edit: I am testing it with this book because there are errors with your`.epub` file as its seen as a `dir` for some reason

DrewThomasson commented Jan 2, 2025 •

edited

Loading

DevonGrandahl commented Jan 3, 2025 •

edited

Loading

DrewThomasson commented Jan 3, 2025 •

edited

Loading

DrewThomasson commented Jan 3, 2025 •

edited

Loading

DevonGrandahl commented Jan 3, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 3, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 4, 2025 •

edited

Loading

DevonGrandahl commented Jan 5, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 5, 2025 •

edited

Loading

filippocarone commented Jan 6, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 6, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 6, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 6, 2025 •

edited

Loading

filippocarone commented Jan 6, 2025 •

edited

Loading

ROBERT-MCDOWELL commented Jan 6, 2025 •

edited

Loading