Fix non-function calls messages #5026

enyst · 2024-11-15T14:55:31Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
fix 400 error on HuggingFace models
fix KeyError on multiple LLMs

Give a summary of what the PR does, explaining any non-trivial design decisions

This PR proposes a refactoring of Messages and function calling. It will fix the LLMs without function calling enabled:

400 error on HF for top_p=1
'content' error on many LLMs
serialization of tool-like messages
misc

HF model example that works:

Link of any specific issues this addresses
Fix #4960
Fix #5090
Fix #5142

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:f6e70f5-nikolaik   --name openhands-app-f6e70f5   docker.all-hands.dev/all-hands-ai/openhands:f6e70f5

openhands/llm/fn_call_converter.py

xingyaoww

Except for some concern in adding function_calling_enabled -- otherwise LGTM!

openhands/core/message.py

xingyaoww · 2024-11-15T15:13:29Z

openhands/llm/llm.py

@@ -567,6 +567,7 @@ def format_messages_for_llm(self, messages: Message | list[Message]) -> list[dic
        for message in messages:
            message.cache_enabled = self.is_caching_prompt_active()
            message.vision_enabled = self.vision_is_active()
+            message.function_calling_enabled = self.is_function_calling_active()


This will only be True for the selected set of model -- but we still send messages WITH tool_ids to the LLM

Ahh so we need the part of the serializer that adds tool_call_id. OK, sure, I'll revert that. Sigh, this is messy. The attempt to separate things when function calling is native is because we need to send a single thing, not a list.

(I'll follow up on litellm too, they actually do this kind of conversion, so we don't have to play whack-a-mole anymore)

@xingyaoww I actually restored this check, but it does send tool ids now. It just sends them as string, if non-native function calling, and as list if native. (because this applies now)

I started to work on refactoring the Message class. I think it needs to include tool calls/ids conversion part (at least as a single Message), do full serialization, and become closer to litellm's Message. I started to work on it on this branch, but in the meantime, I found out that people rely on this PR, and use it via docker because it solves what's broken with non-native function calling.

So I'll make another branch to work on the refactoring, and this PR is up for review again. If it's not too terrible, I'd say let's just fix the problems first, and refactor later.

(I'll clean up some leftover comment, too, but I prefer to do it if it's approved, because I think it would again break people's use via docker on the last commit.)

openhands/llm/llm.py

Co-authored-by: Xingyao Wang <[email protected]>

neubig · 2024-11-21T02:06:31Z

Updated the comment to note this fixes Fix #5142

xingyaoww

LGTM! Thanks!

fix non-function calls

da5fca6

enyst requested a review from xingyaoww November 15, 2024 14:55

xingyaoww reviewed Nov 15, 2024

View reviewed changes

openhands/llm/fn_call_converter.py Show resolved Hide resolved

xingyaoww reviewed Nov 15, 2024

View reviewed changes

enyst marked this pull request as draft November 15, 2024 15:34

enyst and others added 4 commits November 15, 2024 18:18

Update openhands/core/message.py

ba1791a

Co-authored-by: Xingyao Wang <[email protected]>

Update openhands/llm/llm.py

4a31c6f

Co-authored-by: Xingyao Wang <[email protected]>

Update openhands/core/message.py

2ea1297

Co-authored-by: Xingyao Wang <[email protected]>

fix tool calls in non-native function calling messages

ffca2b4

enyst mentioned this pull request Nov 18, 2024

[Bug]: Error during parsing ollama return. #5090

Closed

1 task

enyst force-pushed the enyst/conversion-fixes branch from 07c3643 to ffca2b4 Compare November 20, 2024 22:51

enyst mentioned this pull request Nov 21, 2024

[Bug]: FinishTool doesn't have a tool response #5154

Open

1 task

add comment

f6e70f5

enyst marked this pull request as ready for review November 21, 2024 00:42

enyst requested a review from xingyaoww November 21, 2024 01:06

enyst mentioned this pull request Nov 21, 2024

Fix KeyError when accessing content in messages with function calls #5158

Closed

1 task

xingyaoww approved these changes Nov 21, 2024

View reviewed changes

enyst enabled auto-merge (squash) November 21, 2024 17:57

enyst merged commit d08886f into main Nov 21, 2024
13 checks passed

enyst deleted the enyst/conversion-fixes branch November 21, 2024 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix non-function calls messages #5026

Fix non-function calls messages #5026

enyst commented Nov 15, 2024 •

edited

Loading

xingyaoww left a comment

xingyaoww Nov 15, 2024

enyst Nov 15, 2024

enyst Nov 21, 2024 •

edited

Loading

neubig commented Nov 21, 2024

xingyaoww left a comment

Fix non-function calls messages #5026

Fix non-function calls messages #5026

Conversation

enyst commented Nov 15, 2024 • edited Loading

xingyaoww left a comment

Choose a reason for hiding this comment

xingyaoww Nov 15, 2024

Choose a reason for hiding this comment

enyst Nov 15, 2024

Choose a reason for hiding this comment

enyst Nov 21, 2024 • edited Loading

Choose a reason for hiding this comment

neubig commented Nov 21, 2024

xingyaoww left a comment

Choose a reason for hiding this comment

enyst commented Nov 15, 2024 •

edited

Loading

enyst Nov 21, 2024 •

edited

Loading