Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slow training process #11

Closed
XuyangBai opened this issue Dec 3, 2019 · 4 comments
Closed

Slow training process #11

XuyangBai opened this issue Dec 3, 2019 · 4 comments

Comments

@XuyangBai
Copy link

Hi @chrischoy Thanks for your sharing. I tried your code on 3DMatch dataset using the default configuration and found the training process is very slow. Specifically it took about one and a half hour for one epoch. (as you mentioned in the paper, you trained FCGF for 100 epochs, which means more than one week in my configuration). The GPU memory it took is only less than 5000 MB and GPU utility is less than 10% but CPU utility is high. I wonder is it normal situation and what's the most time-consuming part ? I use RTX 2080Ti to train the model.

Thanks a lot.

@chrischoy
Copy link
Owner

The MinkowskiEngine was significantly updated recently which speeds up the inference quite a bit. The update caches some data on the GPU which also improves the GPU utilization.

Could you try the latest version of the MinkowskiEngine?

@tangbohu
Copy link

tangbohu commented Apr 30, 2020

Hi @chrischoy Thanks for your sharing. I tried your code on 3DMatch dataset using the default configuration and found the training process is very slow. Specifically it took about one and a half hour for one epoch. (as you mentioned in the paper, you trained FCGF for 100 epochs, which means more than one week in my configuration). The GPU memory it took is only less than 5000 MB and GPU utility is less than 10% but CPU utility is high. I wonder is it normal situation and what's the most time-consuming part ? I use RTX 2080Ti to train the model.

Thanks a lot.

Hi @XuyangBai, Have you solved the efficiency issue? I adopted MinkowskiEngine v0.4.2, but the training is still quite slow.

@XuyangBai
Copy link
Author

@tangbohu Sorry I haven't solved it, the training is still slow.

@gitouni
Copy link

gitouni commented Sep 6, 2022

I have updated the ME to v0.5 but met merge_sort CUDA error issue#67.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants