Support for `frequency_penalty` #275

richardliaw · 2023-11-03T18:43:50Z

Frequency Penalty is between -2.0 and 2.0 and it impacts how the model penalizes new tokens based on their existing frequency in the text.

Positive values will decrease the likelihood of the model repeating the same line verbatim by penalizing new tokens that have already been used frequently.

This parameter is important for OpenAI compatibility, which is a growing standard for LLM usage.

cc @Yard1 @akshay-anyscale

juney-nvidia · 2023-11-04T14:25:22Z

@richardliaw

Thanks for reporting this. Can you elaborate a little bit more about the difference between the already supported repetiton_penalty and frequency_penalty here?

Thanks
June

richardliaw · 2023-11-05T03:44:32Z

Frequency penalty has a specific implementation here: https://platform.openai.com/docs/guides/gpt/parameter-details.

vLLM treats the two differently (repetition_penalty is multiplicative with the logits, whereas frequency penalty is additive): https://github.com/vllm-project/vllm/blob/9f669a9a7c2b2d0a7963a6e29253280e57680adb/vllm/model_executor/layers/sampler.py#L233-L236

juney-nvidia · 2023-11-05T04:01:25Z

Thanks for sharing this, this issue relates to the other one, both regarding to the control of sampling/decoder process. We will follow up and reply later.

June

kaiyux · 2023-12-27T11:50:23Z

The support should be included in the latest main branch, please see #754.

Closing. Please feel free to comment if you have any question, thanks!

juney-nvidia self-assigned this Nov 4, 2023

juney-nvidia added the triaged Issue has been triaged by maintainers label Nov 4, 2023

juney-nvidia added feature request New feature or request sampling labels Nov 5, 2023

ncomly-nvidia mentioned this issue Dec 11, 2023

TensorRT-LLM Requests #632

Open

41 tasks

kaiyux mentioned this issue Dec 27, 2023

Update TensorRT-LLM main branch #754

Merged

kaiyux closed this as completed Dec 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for `frequency_penalty` #275

Support for `frequency_penalty` #275

richardliaw commented Nov 3, 2023

juney-nvidia commented Nov 4, 2023 •

edited

Loading

richardliaw commented Nov 5, 2023

juney-nvidia commented Nov 5, 2023

kaiyux commented Dec 27, 2023

Support for frequency_penalty #275

Support for frequency_penalty #275

Comments

richardliaw commented Nov 3, 2023

juney-nvidia commented Nov 4, 2023 • edited Loading

richardliaw commented Nov 5, 2023

juney-nvidia commented Nov 5, 2023

kaiyux commented Dec 27, 2023

Support for `frequency_penalty` #275

Support for `frequency_penalty` #275

juney-nvidia commented Nov 4, 2023 •

edited

Loading