fix(reasoning): compatible with the return data of the reasoning mod… #8286

kuloud · 2025-02-05T17:16:56Z

Title

compatible with the return data of the reasoning model

Relevant issues

Fixes #8193

Type

🐛 Bug Fix

Changes

[REQUIRED] Testing - Attach a screenshot of any new tests passing locally

If UI changes, send a screenshot/GIF of working UI fixes

vercel · 2025-02-05T17:17:02Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Feb 5, 2025 5:17pm

kuloud · 2025-02-06T09:01:31Z

@krrishdholakia Do you think this PR needs to be merged? there aren't many customization scenarios for reasoning_content, and adding a global param doesn't seem necessary for now.

qtnx · 2025-02-07T15:23:34Z

Waiting for this one to be merged to have reasoning output in OpenRouter. Quite long...

krrishdholakia · 2025-02-07T17:13:21Z

Hi @kuloud can you share a screenshot of this passing your test?

Once done, it should be good to merge

kuloud · 2025-02-08T07:39:27Z

Hi @kuloud can you share a screenshot of this passing your test?

Once done, it should be good to merge

I found that I had some misunderstandings about the implementation within LiteLLM. I tried it and although Delta did handle the mapping of reasoning_content, I don't quite understand where in streaming_handler.py it is processed and the result is ignored.

INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': 'Sure', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     root:test_streaming.py:4104 chunk ----: ModelResponseStream(id='chatcmpl-200e1aaf-b64a-4c3a-bc3c-9e31ca6cfadf', created=1738997695, model='deepseek/deepseek-r1', object='chat.completion.chunk', system_fingerprint=None, choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(provider_specific_fields=None, content='Sure', role='assistant', function_call=None, tool_calls=None, audio=None), logprobs=None)], stream_options=None)
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '!', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     root:test_streaming.py:4104 chunk ----: ModelResponseStream(id='chatcmpl-200e1aaf-b64a-4c3a-bc3c-9e31ca6cfadf', created=1738997695, model='deepseek/deepseek-r1', object='chat.completion.chunk', system_fingerprint=None, choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(provider_specific_fields=None, content='!', role=None, function_call=None, tool_calls=None, audio=None), logprobs=None)], stream_options=None)
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': ' Here', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     root:test_streaming.py:4104 chunk ----: ModelResponseStream(id='chatcmpl-200e1aaf-b64a-4c3a-bc3c-9e31ca6cfadf', created=1738997695, model='deepseek/deepseek-r1', object='chat.completion.chunk', system_fingerprint=None, choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(provider_specific_fields=None, content=' Here', role=None, function_call=None, tool_calls=None, audio=None), logprobs=None)], stream_options=None)
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': "'s", 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     root:test_streaming.py:4104 chunk ----: ModelResponseStream(id='chatcmpl-200e1aaf-b64a-4c3a-bc3c-9e31ca6cfadf', created=1738997695, model='deepseek/deepseek-r1', object='chat.completion.chunk', system_fingerprint=None, choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(provider_specific_fields=None, content="'s", role=None, function_call=None, tool_calls=None, audio=None), logprobs=None)], stream_options=None)
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': '', 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0, 'tool_use': None}
INFO     LiteLLM:streaming_handler.py:871 1--------{'text': ' a', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}
INFO     root:test_streaming.py:4104 chunk ----: ModelResponseStream(id='chatcmpl-200e1aaf-b64a-4c3a-bc3c-9e31ca6cfadf', created=1738997695, model='deepseek/deepseek-r1', object='chat.completion.chunk', system_fingerprint=None, choices=[StreamingChoices(finish_reason=None, index=0, delta=Delta(provider_specific_fields=None, content=' a', role=None, function_call=None, tool_calls=None, audio=None), logprobs=None)], stream_options=None)

When processing the reasoning information, it reaches is_async_iterable as shown in the figure, but the chunk information is:

{'text': ' a', 'tool_use': None, 'is_finished': False, 'finish_reason': '', 'usage': None, 'index': 0}

kuloud · 2025-02-08T07:47:49Z

This PR doesn't solve the actual problem. It still requires your support @krrishdholakia

kuloud · 2025-02-10T01:56:49Z

This PR doesn't solve the actual problem. It still requires your support @krrishdholakia

I fixed the issue, and maybe I will make an new pr today or tomottow

vercel bot deployed to Preview February 5, 2025 17:17 View deployment

kuloud closed this Feb 9, 2025

kuloud force-pushed the main branch from 9a1f2f3 to e25806d Compare February 9, 2025 06:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(reasoning): compatible with the return data of the reasoning mod… #8286

fix(reasoning): compatible with the return data of the reasoning mod… #8286

kuloud commented Feb 5, 2025

vercel bot commented Feb 5, 2025 •

edited

Loading

kuloud commented Feb 6, 2025

qtnx commented Feb 7, 2025 •

edited

Loading

krrishdholakia commented Feb 7, 2025

kuloud commented Feb 8, 2025 •

edited

Loading

kuloud commented Feb 8, 2025

kuloud commented Feb 10, 2025

fix(reasoning): compatible with the return data of the reasoning mod… #8286

fix(reasoning): compatible with the return data of the reasoning mod… #8286

Conversation

kuloud commented Feb 5, 2025

Title

Relevant issues

Type

Changes

[REQUIRED] Testing - Attach a screenshot of any new tests passing locally

vercel bot commented Feb 5, 2025 • edited Loading

kuloud commented Feb 6, 2025

qtnx commented Feb 7, 2025 • edited Loading

krrishdholakia commented Feb 7, 2025

kuloud commented Feb 8, 2025 • edited Loading

kuloud commented Feb 8, 2025

kuloud commented Feb 10, 2025

vercel bot commented Feb 5, 2025 •

edited

Loading

qtnx commented Feb 7, 2025 •

edited

Loading

kuloud commented Feb 8, 2025 •

edited

Loading