Make Binary cross entropy with logit numerically stable for high logit values #2562

BeneSim · 2024-10-14T09:46:30Z

See #2561 for details. I added detailed commentary to the commits.

The documentation of binary_cross_entropy_with_logit says that it expects the target to be of type usize which is wrong and yields an error at runtime due to dtype mismatch in the multiplication step.

In the current implementation of binary_cross_entropy_with_logit the loss will actually be NaN due to taking the log(0) which occurs for high logits passing through a sigmoid and an affine transformation: inp.affine(-1., 1.)?.log()? ^ ^ ^ | | | 1.0 | | 0.0 | NaN The proposed implementation is actually taken more or less directly from pytorch https://github.com/pytorch/pytorch/blob/41977a05314bbf537e1c5d6cf5916a368d1907d9/aten/src/ATen/native/Loss.cpp#L362

BeneSim · 2024-10-14T21:33:33Z

Just a quick update, I also encountered the case where the logit was so small that the sigmoid returned 0. So I guess we need a better way. Maybe the method tensorflow uses might make sense, they basically calculate the log_sigmoid by using softplus:

See log_sigmoid and softplus. This could be implemented like

let log_sigmoid_input = (inp.neg()?.exp()? + 1.)?.log()?.neg()?;

EDIT: This however will also quickyl overflow for small inp ...

BeneSim added 2 commits October 14, 2024 11:30

Fix documentation

712adde

The documentation of binary_cross_entropy_with_logit says that it expects the target to be of type usize which is wrong and yields an error at runtime due to dtype mismatch in the multiplication step.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Binary cross entropy with logit numerically stable for high logit values #2562

Make Binary cross entropy with logit numerically stable for high logit values #2562

BeneSim commented Oct 14, 2024

BeneSim commented Oct 14, 2024 •

edited

Loading

Make Binary cross entropy with logit numerically stable for high logit values #2562

Are you sure you want to change the base?

Make Binary cross entropy with logit numerically stable for high logit values #2562

Conversation

BeneSim commented Oct 14, 2024

BeneSim commented Oct 14, 2024 • edited Loading

BeneSim commented Oct 14, 2024 •

edited

Loading