Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

9753 nan nan 0.00 nan nan 1698.001 16280.8 #23

Closed
mtli77 opened this issue Oct 21, 2020 · 2 comments
Closed

9753 nan nan 0.00 nan nan 1698.001 16280.8 #23

mtli77 opened this issue Oct 21, 2020 · 2 comments

Comments

@mtli77
Copy link

mtli77 commented Oct 21, 2020

Hi @XuyangBai

I'm sorry to disturb you with the following questions. In the process of 3dmatch training, in 9753 step

Steps desc_loss det_loss train_accuracy d_pos d_neg time memory
9753 nan nan 0.00 nan nan 1698.001 16280.8.

What does this mean when the desc_loss and det_loss is nan, and the train_accuracy become 0.00, is it some kind of error in the training process?

Thank you very much and waiting for your reply!

@XuyangBai
Copy link
Owner

Hi @Violetit It seems very similar to the problem in here. You may check your TF and CUDA version first.

@mtli77
Copy link
Author

mtli77 commented Oct 21, 2020

Hi @Violetit It seems very similar to the problem in here. You may check your TF and CUDA version first.

My CUDA version is 10.0 with the tensorflow-gpu=1.13.2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants