🚨 [Idefics2] Update ignore index #30898

NielsRogge · 2024-05-19T09:08:09Z

What does this PR do?

This PR proposes to set the ignore index of the loss function of Idefics2 like the value of any other LLM/multimodal in the Transformers library, which is -100 (the default ignore index of PyTorch's cross-entropy loss).

It's up to users to create the labels and set whatever they don't want to be included in the loss calculation to -100. In the case of Idefics2, this could look like so:

labels = batch["input_ids"].clone()
labels[labels == processor.tokenizer.pad_token_id] = -100
labels[labels == processor.image_processor.image_token_id] = -100

which means, it's up to the user to make sure padding and image tokens are ignored.

HuggingFaceDocBuilderDev · 2024-05-19T09:27:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…ics_ignore_index

amyeroberts

Thanks for adding!

Two things to update before merging:

Docstring info to match this logic
Add an example in on modeling pad on how to mask, as users will likely not know to mask the images

VictorSanh

thanks!

amyeroberts

Thanks for updating and providing an example!

Added a 🚨 to flag that this is a breaking change

NielsRogge · 2024-05-21T17:38:19Z

Thanks, btw @VictorSanh could you update the DocVQA notebook to account for this change?

VictorSanh · 2024-05-22T14:03:51Z

Thanks, btw @VictorSanh could you update the DocVQA notebook to account for this change?

good point! done!

Update ignore index

4b85007

NielsRogge requested a review from VictorSanh May 19, 2024 09:12

NielsRogge requested a review from amyeroberts May 20, 2024 09:04

Merge remote-tracking branch 'upstream/main' into feature/update_idef…

758345a

…ics_ignore_index

amyeroberts reviewed May 20, 2024

View reviewed changes

NielsRogge added 2 commits May 20, 2024 14:50

Update docs

5f56c14

Update docs

910d11b

VictorSanh approved these changes May 20, 2024

View reviewed changes

NielsRogge requested a review from amyeroberts May 21, 2024 16:01

amyeroberts changed the title ~~[Idefics2] Update ignore index~~ 🚨 [Idefics2] Update ignore index May 21, 2024

amyeroberts approved these changes May 21, 2024

View reviewed changes

NielsRogge merged commit 60bb571 into huggingface:main May 21, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚨 [Idefics2] Update ignore index #30898

🚨 [Idefics2] Update ignore index #30898

NielsRogge commented May 19, 2024

HuggingFaceDocBuilderDev commented May 19, 2024

amyeroberts left a comment

VictorSanh left a comment

amyeroberts left a comment

NielsRogge commented May 21, 2024

VictorSanh commented May 22, 2024

🚨 [Idefics2] Update ignore index #30898

🚨 [Idefics2] Update ignore index #30898

Conversation

NielsRogge commented May 19, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented May 19, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

VictorSanh left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

NielsRogge commented May 21, 2024

VictorSanh commented May 22, 2024