Add weighting option to CopyNet #258

epwalsh · 2021-05-03T23:32:51Z

This is just a minor, general improvement to CopyNet. It adds the option to weight the loss contributions from individual instances when calculating the batch loss.

dirkgr · 2021-05-04T16:40:30Z

allennlp_models/generation/dataset_readers/copynet_seq2seq.py

@@ -194,6 +202,9 @@ def text_to_instance(

        fields_dict["metadata"] = MetadataField(meta_fields)

+        if weight is not None:
+            fields_dict["weight"] = ArrayField(np.array([float(weight)], dtype=np.single))


Just make it a TensorField, and use Torch methods to create the tensors. ArrayField is the old way.

dirkgr · 2021-05-04T16:41:53Z

allennlp_models/generation/models/copynet_seq2seq.py

+            # shape: (batch_size,)
+            if len(weight.shape) > 1:
+                weight = weight.squeeze()
+            loss = -(weight * log_likelihood).sum() / batch_size


Shouldn't this be / weight.sum()?

I've gone back and forth on this.

If you use weight.sum() to normalize, then the weighting is only relative to each batch, which is probably not what you want.

For example, let's say we normalize by weight.sum() and your weights range from 0.5 - 1.0. If you have a batch that contains only instances with weights of 0.5, then this will give you the same result as if they all had weights of 1.0.

Hmm, fair enough. I would expect that setting all the weights to 1000 should be the same as setting them all to 0.001. But to get that behavior, and also the behavior you want, we would have to sum up all the weights in the dataset before processing a single batch, and then scale each batch accordingly. That's not practical. So let's leave it like this then.

epwalsh added 8 commits April 30, 2021 19:10

add weight option to CopyNet

59c8e03

normalize by sum of weight

d2b8f52

clarify comment

d25a50f

fix

b18158d

fix again

de9a1f5

add option to normalize weights

9b550c5

optional weighted loss with CopyNet

0dbe12f

fix

426fa8e

epwalsh requested a review from dirkgr May 4, 2021 16:32

dirkgr suggested changes May 4, 2021

View reviewed changes

ArrayField -> TensorField

4a86dcb

dirkgr approved these changes May 4, 2021

View reviewed changes

epwalsh merged commit 77315fc into main May 4, 2021

epwalsh deleted the weighted-copynet branch May 4, 2021 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add weighting option to CopyNet #258

Add weighting option to CopyNet #258

epwalsh commented May 3, 2021 •

edited

Loading

dirkgr May 4, 2021

dirkgr May 4, 2021

epwalsh May 4, 2021

dirkgr May 4, 2021

Add weighting option to CopyNet #258

Add weighting option to CopyNet #258

Conversation

epwalsh commented May 3, 2021 • edited Loading

dirkgr May 4, 2021

Choose a reason for hiding this comment

dirkgr May 4, 2021

Choose a reason for hiding this comment

epwalsh May 4, 2021

Choose a reason for hiding this comment

dirkgr May 4, 2021

Choose a reason for hiding this comment

epwalsh commented May 3, 2021 •

edited

Loading