-
Notifications
You must be signed in to change notification settings - Fork 7.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
发票数据集训练报错:Out of memory error on GPU 0. Cannot allocate 14.406982MB memory on GPU 0, 10.746094GB memory has been allocated and available memory is only 15.562500MB. #10247
Comments
|
@livingbody 需要进一步实验看是否是就bs=1依旧超过11g显存 |
是的 |
能否在aistudio上分享项目给我,我查看下具体情况。 |
联系我微信:livingbody |
@livingbody @dizhenx 请问问题解决了吗? |
可以提供訓練的yml檔,才能瞭解你的設定,det model 中Eval batch_size_per_card只能設為1。 |
请问最后解决了吗 |
该issue长时间未更新,暂将此issue关闭,如有需要可重新开启。 |
下载的官方的发票数据集做训练。运行python tools/train.py -c ./fapiao/train_data/ser_vi_layoutxlm.yml -o Global.save_model_dir=./output/kie/ 报以下错误。batchsize和num_works都调为1了还是报错。单卡显存有11G,无其他程序占用。
Error Message Summary:
ResourceExhaustedError:
Out of memory error on GPU 0. Cannot allocate 14.406982MB memory on GPU 0, 10.746094GB memory has been allocated and available memory is only 15.562500MB.
Please check whether there is any other process using GPU 0.
If the above ways do not solve the out of memory problem, you can try to use CUDA managed memory. The command is
export FLAGS_use_cuda_managed_memory=false
.(at /paddle/paddle/fluid/memory/allocation/cuda_allocator.cc:95)
为啥只能使用1个gpu,不能设置多卡训练吗?
The text was updated successfully, but these errors were encountered: