Update SequenceLoss for TensorFlow 2.2 compatibility #1371

guillaumekln · 2020-03-24T11:24:19Z

For previous versions, we removed the reduction attribute to force tf.keras to call the main __call__ method. Otherwise, it calls the call method where we have no way of controlling the reduction logic.

In 2.2, tf.keras has a new way of dealing with loss objects (see the LossContainer class). It now always calls __call__ and requires the reduction attribute to exist.

This new logic is only enabled for V2 execution mode so this commit also disables the deprecated V1 graph mode for this test.

Related to #1320.

Note: this change is only compatible with TensorFlow 2.2 so I'm opening a draft PR for now.

gabrieldemarmiesse · 2020-03-25T11:18:50Z

I guess the easiest solution is to do a version check before deleting the reduction.

But having a custom reduction prevents us from using a distributed strategy (from what I remember). Would it be possible to use keras' built-in reductions for some use cases (not all)?

guillaumekln · 2020-03-25T12:25:47Z

I guess the easiest solution is to do a version check before deleting the reduction.

I suppose the project will shortly move to require TensorFlow 2.2, right? Then we could merge this PR as is (after a rebase).

But having a custom reduction prevents us from using a distributed strategy (from what I remember). Would it be possible to use keras' built-in reductions for some use cases (not all)?

I'm not sure it prevents the use of distribution strategies, but it requires the user to carefully scale the loss (or gradients) based on the global batch size.

We could indeed rely on the built-in reduction for some combinations. But these combinations are actually the ones that can be implemented with tf.keras.losses.{Sparse}CategoricalCrossentropy.

gabrieldemarmiesse · 2020-03-25T13:00:32Z

I suppose the project will shortly move to require TensorFlow 2.2, right?

To improve the UX, we'd like the python code to be compatible with as many versions of TF as possible. The custom ops will only be compatible with a specific python wheel. See #1317 (comment)

We could indeed rely on the built-in reduction for some combinations. But these combinations are actually the ones that can be implemented with tf.keras.losses.{Sparse}CategoricalCrossentropy.

Iet's let you do what you think is best here.

seanpmorgan · 2020-03-26T01:57:38Z

Thanks @guillaumekln!

Just wanted to note that this is the last component of #1320 and will block a release built with TF2.2 once it is released.

@qlzh727 Could you review when time allows and advise how best to handle this change in 2.2

qlzh727 · 2020-03-26T03:04:38Z

@pavithrasv who works on all Keras losses for more input.

For previous versions, we removed the `reduction` attribute to force tf.keras to call the main `__call__` method. Otherwise, it calls the `call` method where we have no way of controlling the reduction logic. In 2.2, tf.keras has a new way of dealing with loss objects (see the `LossContainer` class). It now always calls `__call__` and requires the `reduction` attribute to exist. This new logic is only enabled for V2 execution mode so this commit also disables the deprecated V1 graph mode for this test.

guillaumekln · 2020-03-27T08:42:29Z

Iet's let you do what you think is best here.

@gabrieldemarmiesse I propose to just make a bug fix in this PR and try to not change the behavior. Maybe @pavithrasv has more recommendation on how to deal with custom loss reduction.

gabrieldemarmiesse · 2020-03-27T10:30:31Z

Let's merge it as it's a good step forward. We can open an issue to see how to replace the __call__ which is a private keras API.

gabrieldemarmiesse

Thanks again for the fix!

* Update SequenceLoss for TensorFlow 2.2 compatibility For previous versions, we removed the `reduction` attribute to force tf.keras to call the main `__call__` method. Otherwise, it calls the `call` method where we have no way of controlling the reduction logic. In 2.2, tf.keras has a new way of dealing with loss objects (see the `LossContainer` class). It now always calls `__call__` and requires the `reduction` attribute to exist. This new logic is only enabled for V2 execution mode so this commit also disables the deprecated V1 graph mode for this test. * Fix format

guillaumekln added the seq2seq label Mar 24, 2020

googlebot added the cla: yes label Mar 24, 2020

gabrieldemarmiesse mentioned this pull request Mar 24, 2020

run keras compatibility test for sequence loss with v2 behavior. #1374

Merged

gabrieldemarmiesse self-assigned this Mar 25, 2020

seanpmorgan requested a review from qlzh727 March 26, 2020 01:58

guillaumekln added 2 commits March 27, 2020 09:16

Fix format

474d758

guillaumekln marked this pull request as ready for review March 27, 2020 08:42

gabrieldemarmiesse approved these changes Mar 27, 2020

View reviewed changes

gabrieldemarmiesse merged commit b8cd9bf into tensorflow:master Mar 27, 2020

guillaumekln deleted the sequence-loss-tf2.2-compat branch March 27, 2020 10:34

gabrieldemarmiesse mentioned this pull request Mar 27, 2020

SequenceLoss shoudn't override the __call__ of tf.keras.losses.Loss #1459

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update SequenceLoss for TensorFlow 2.2 compatibility #1371

Update SequenceLoss for TensorFlow 2.2 compatibility #1371

guillaumekln commented Mar 24, 2020

gabrieldemarmiesse commented Mar 25, 2020

guillaumekln commented Mar 25, 2020

gabrieldemarmiesse commented Mar 25, 2020 •

edited

Loading

seanpmorgan commented Mar 26, 2020

qlzh727 commented Mar 26, 2020

guillaumekln commented Mar 27, 2020 •

edited

Loading

gabrieldemarmiesse commented Mar 27, 2020

gabrieldemarmiesse left a comment

Update SequenceLoss for TensorFlow 2.2 compatibility #1371

Update SequenceLoss for TensorFlow 2.2 compatibility #1371

Conversation

guillaumekln commented Mar 24, 2020

gabrieldemarmiesse commented Mar 25, 2020

guillaumekln commented Mar 25, 2020

gabrieldemarmiesse commented Mar 25, 2020 • edited Loading

seanpmorgan commented Mar 26, 2020

qlzh727 commented Mar 26, 2020

guillaumekln commented Mar 27, 2020 • edited Loading

gabrieldemarmiesse commented Mar 27, 2020

gabrieldemarmiesse left a comment

Choose a reason for hiding this comment

gabrieldemarmiesse commented Mar 25, 2020 •

edited

Loading

guillaumekln commented Mar 27, 2020 •

edited

Loading