OOM interference on a fine-tuned model, while running the original large models are fine #1941

clowergen · 2024-01-05T10:04:06Z

clowergen
Jan 5, 2024

Hi! I'm trying to transcribe audio on my 3060, with 12GB VRAM, running on pop OS.
I tried using this fine-tuned model for transcribing Cantonese, but I kept going out of memory. I watched the memory stats go from 500MB to 11GB before the crash, so I just assumed I didn't have enough memory for it.
But then I tried the original medium, large, and large-v2 models (which this model is based off of), and they all worked perfectly fine without a problem. I've also tried the fine-tuned model on CPU and it runs without a problem (though slower of course).
As far as I know, this model should have the same size as v2? So I have no idea what went wrong there.
Any suggestions would be very welcome. Thanks!

The error in question, though I'm sure you've seen it before:
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacty of 11.74 GiB of which 40.25 MiB is free. Including non-PyTorch memory, this process has 11.06 GiB memory in use. Of the allocated memory 10.68 GiB is allocated by PyTorch, and 289.59 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

phineas-pta · 2024-01-05T16:52:14Z

phineas-pta
Jan 5, 2024

prefer float16 to reduce vram usage

0 replies

welliX · 2024-01-10T08:10:28Z

welliX
Jan 10, 2024

Having similar problem when running (loading) more than one whisper model (e.g. for switching between different inference models). It would be helpful if the VRAM / GPU memory could be freed in between two model loads!

Possible proposed solution like:
import torch, gc
gc.collect()
torch.cuda.empty_cache()

does not have a big effect on the VRAM.

Does anyone know how to free the VRAM occupied by a whisper model that is no longer needed??
many thanks!

0 replies

welliX · 2024-03-04T13:33:32Z

welliX
Mar 4, 2024

Any idea here? Are no-one else having the need to do garbage collection or similar?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM interference on a fine-tuned model, while running the original large models are fine #1941

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

OOM interference on a fine-tuned model, while running the original large models are fine #1941

clowergen Jan 5, 2024

Replies: 3 comments

phineas-pta Jan 5, 2024

welliX Jan 10, 2024

welliX Mar 4, 2024

clowergen
Jan 5, 2024

phineas-pta
Jan 5, 2024

welliX
Jan 10, 2024

welliX
Mar 4, 2024