Convert `SamplingParams.strategy` to a union #767

hardikjshah · 2025-01-15T00:16:43Z

What does this PR do?

Cleans up how we provide sampling params. Earlier, strategy was an enum and all params (top_p, temperature, top_k) across all strategies were grouped. We now have a strategy union object with each strategy (greedy, top_p, top_k) having its corresponding params.
Earlier,

class SamplingParams: 
    strategy: enum ()
    top_p, temperature, top_k and other params

However, the strategy field was not being used in any providers making it confusing to know the exact sampling behavior purely based on the params since you could pass temperature, top_p, top_k and how the provider would interpret those would not be clear.

Hence we introduced -- a union where the strategy and relevant params are all clubbed together to avoid this confusion.

Have updated all providers, tests, notebooks, readme and otehr places where sampling params was being used to use the new format.

Test Plan

pytest llama_stack/providers/tests/inference/groq/test_groq_utils.py
// inference on ollama, fireworks and together
with-proxy pytest -v -s -k "ollama" --inference-model="meta-llama/Llama-3.1-8B-Instruct" llama_stack/providers/tests/inference/test_text_inference.py
// agents on fireworks
pytest -v -s -k 'fireworks and create_agent' --inference-model="meta-llama/Llama-3.1-8B-Instruct" llama_stack/providers/tests/agents/test_agents.py --safety-shield="meta-llama/Llama-Guard-3-8B"

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Ran pre-commit to handle lint / formatting issues.
Read the contributor guideline,
Pull Request section?
Updated relevant documentation.
Wrote necessary unit or integration tests.

ashwinb

looks good to me! thank you for going through all the providers....

hardikjshah requested review from ashwinb, yanxi0830, dltn, raghotham, dineshyv, vladimirivic and sixianyi0721 as code owners January 15, 2025 00:16

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 15, 2025

hardikjshah changed the title ~~Clean up SamplingParams Strategy to be a union to avoid grouping params across all strategies~~ Convert SamplingParams.strategy to a union Jan 15, 2025

ashwinb approved these changes Jan 15, 2025

View reviewed changes

Hardik Shah added 4 commits January 15, 2025 05:38

Update Strategy in SamplingParams to be a union

dea575c

fix groq util tests

d9d827f

fix tests

0edd3ce

fix nvidia sampling logic

cb6c734

ashwinb force-pushed the sampling branch from 1fce33a to cb6c734 Compare January 15, 2025 13:38

ashwinb merged commit a51c8b4 into main Jan 15, 2025
2 checks passed

ashwinb deleted the sampling branch January 15, 2025 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert `SamplingParams.strategy` to a union #767

Convert `SamplingParams.strategy` to a union #767

hardikjshah commented Jan 15, 2025 •

edited

Loading

ashwinb left a comment

Convert SamplingParams.strategy to a union #767

Convert SamplingParams.strategy to a union #767

Conversation

hardikjshah commented Jan 15, 2025 • edited Loading

What does this PR do?

Test Plan

Before submitting

ashwinb left a comment

Choose a reason for hiding this comment

Convert `SamplingParams.strategy` to a union #767

Convert `SamplingParams.strategy` to a union #767

hardikjshah commented Jan 15, 2025 •

edited

Loading