Slow training process #11

XuyangBai · 2019-12-03T08:33:54Z

Hi @chrischoy Thanks for your sharing. I tried your code on 3DMatch dataset using the default configuration and found the training process is very slow. Specifically it took about one and a half hour for one epoch. (as you mentioned in the paper, you trained FCGF for 100 epochs, which means more than one week in my configuration). The GPU memory it took is only less than 5000 MB and GPU utility is less than 10% but CPU utility is high. I wonder is it normal situation and what's the most time-consuming part ? I use RTX 2080Ti to train the model.

Thanks a lot.

chrischoy · 2019-12-24T21:59:32Z

The MinkowskiEngine was significantly updated recently which speeds up the inference quite a bit. The update caches some data on the GPU which also improves the GPU utilization.

Could you try the latest version of the MinkowskiEngine?

tangbohu · 2020-04-30T07:31:45Z

Hi @chrischoy Thanks for your sharing. I tried your code on 3DMatch dataset using the default configuration and found the training process is very slow. Specifically it took about one and a half hour for one epoch. (as you mentioned in the paper, you trained FCGF for 100 epochs, which means more than one week in my configuration). The GPU memory it took is only less than 5000 MB and GPU utility is less than 10% but CPU utility is high. I wonder is it normal situation and what's the most time-consuming part ? I use RTX 2080Ti to train the model.

Thanks a lot.

Hi @XuyangBai, Have you solved the efficiency issue? I adopted MinkowskiEngine v0.4.2, but the training is still quite slow.

XuyangBai · 2020-05-01T03:03:56Z

@tangbohu Sorry I haven't solved it, the training is still slow.

gitouni · 2022-09-06T06:25:16Z

I have updated the ME to v0.5 but met merge_sort CUDA error issue#67.

XuyangBai closed this as completed Feb 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow training process #11

Slow training process #11

XuyangBai commented Dec 3, 2019

chrischoy commented Dec 24, 2019

tangbohu commented Apr 30, 2020 •

edited

Loading

XuyangBai commented May 1, 2020

gitouni commented Sep 6, 2022

Slow training process #11

Slow training process #11

Comments

XuyangBai commented Dec 3, 2019

chrischoy commented Dec 24, 2019

tangbohu commented Apr 30, 2020 • edited Loading

XuyangBai commented May 1, 2020

gitouni commented Sep 6, 2022

tangbohu commented Apr 30, 2020 •

edited

Loading