Generate: nudge towards `do_sample=False` when `temperature=0.0` #25722

gante · 2023-08-24T10:05:26Z

What does this PR do?

Improves the error message when temperature=0.0, which asymptotically corresponds to greedy decoding... except that it results in numerical problems :D

test run:

from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("distilgpt2")
model = AutoModelForCausalLM.from_pretrained("distilgpt2")

inputs = tokenizer(["The quick brown"], return_tensors="pt")
gen_out = model.generate(**inputs, do_sample=True, temperature=0.0)

yields

ValueError: `temperature` (=0.0) has to be a strictly positive float, otherwise your next token scores will be invalid. If you're looking for greedy decoding strategies, set `do_sample=False`.

HuggingFaceDocBuilderDev · 2023-08-24T10:27:06Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

We could also just set do_sample = False in case temperature = 0. Will let you decide !

ArthurZucker · 2023-08-24T11:13:56Z

src/transformers/generation/logits_process.py

-            raise ValueError(f"`temperature` has to be a strictly positive float, but is {temperature}")
+            except_msg = (
+                f"`temperature` (={temperature}) has to be a strictly positive float, otherwise your next token "
+                "scores will be invalid."


do you mean that it will be nan?

depends on the value the user places here (e.g. a negative float will not generate nans, but make the scores enter uncharted territory), hence the vague message

gante · 2023-08-24T13:14:29Z

We could also just set do_sample = False in case temperature = 0. Will let you decide !

I agree we should do that! But I'm going to leave that for the generate refactor, as it implies significant code changes to do it right :)

…gingface#25722)

nudge towards do_sample

5319ea8

gante requested a review from ArthurZucker August 24, 2023 10:05

gante added 2 commits August 24, 2023 10:06

120 char line

f93d2c5

missing space

9ea68e9

ArthurZucker approved these changes Aug 24, 2023

View reviewed changes

gante merged commit 0a365c3 into huggingface:main Aug 24, 2023

gante deleted the temp_msg branch August 24, 2023 13:15

parambharat pushed a commit to parambharat/transformers that referenced this pull request Sep 26, 2023

Generate: nudge towards do_sample=False when temperature=0.0 (hug…

d132a03

…gingface#25722)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate: nudge towards `do_sample=False` when `temperature=0.0` #25722

Generate: nudge towards `do_sample=False` when `temperature=0.0` #25722

gante commented Aug 24, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 24, 2023 •

edited

Loading

ArthurZucker left a comment

ArthurZucker Aug 24, 2023

gante Aug 24, 2023

gante commented Aug 24, 2023

Generate: nudge towards do_sample=False when temperature=0.0 #25722

Generate: nudge towards do_sample=False when temperature=0.0 #25722

Conversation

gante commented Aug 24, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 24, 2023 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Aug 24, 2023

Choose a reason for hiding this comment

gante Aug 24, 2023

Choose a reason for hiding this comment

gante commented Aug 24, 2023

Generate: nudge towards `do_sample=False` when `temperature=0.0` #25722

Generate: nudge towards `do_sample=False` when `temperature=0.0` #25722

gante commented Aug 24, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 24, 2023 •

edited

Loading