Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. #22473

manueldeprada · 2023-03-30T12:52:56Z

Fix error in docs: multinomial sampling decoding strategy

As indicated in the library source code:
https://github.com/huggingface/transformers/blob/228792a9dc0c36f1e82ab441e1b1991d116ee0a0/src/transformers/generation/utils.py#LL1364-L1367

Multinomial sampling needs num_beams=1. However, this is not indicated in the docs, potentially leading to execute beam-search multinomial sampling instead of the intended multinomial sampling.

This deviation from the expected behaviour happens quite often, since a lot of models have in their generation_config.json the parameter num_beams set to something higher than 1. This happens, for example, in the majority of top translation models from the Hub.

Also, I have included "ancestral sampling" as another name for multinomial sampling, since it is the most common name in the decoding algorithms literature.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).

Who can review?

Original authors of this piece of documentation: @gante, @sgugger, @stevhliu and @MKhalusova

HuggingFaceDocBuilderDev · 2023-03-30T13:12:09Z

The documentation is not available anymore as the PR was closed or merged.

gante

LGTM, thanks for the correction!

… default it is usually not 1. (huggingface#22473) Fix: Multinomial sampling needs "num_beams=1", since by default is 5.

Fix: Multinomial sampling needs "num_beams=1", since by default is 5.

fa67ed8

manueldeprada changed the title ~~Docs fix: Multinomial sampling dedoding needs "num_beams=1", since by default it is usually not 1.~~ Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. Mar 30, 2023

MKhalusova requested a review from gante March 30, 2023 13:34

gante approved these changes Mar 30, 2023

View reviewed changes

gante requested review from gante and sgugger March 30, 2023 15:00

gante approved these changes Mar 30, 2023

View reviewed changes

sgugger approved these changes Mar 30, 2023

View reviewed changes

sgugger merged commit d5de578 into huggingface:main Mar 30, 2023

manueldeprada mentioned this pull request Jul 24, 2023

Generation refactor: new interface, new classes. #25061

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. #22473

Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. #22473

manueldeprada commented Mar 30, 2023

HuggingFaceDocBuilderDev commented Mar 30, 2023 •

edited

Loading

gante left a comment

Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. #22473

Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. #22473

Conversation

manueldeprada commented Mar 30, 2023

Fix error in docs: multinomial sampling decoding strategy

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Mar 30, 2023 • edited Loading

gante left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 30, 2023 •

edited

Loading