Fix Agents to support code and rag simultaneously #908

hardikjshah · 2025-01-30T19:55:18Z

What does this PR do?

Fixes a bug where agents were not working when both rag and code-interpreter were added as tools.

Test Plan

Added a new client_sdk test which tests for this scenario

LLAMA_STACK_CONFIG=together pytest -s -v  tests/client-sdk -k 'test_rag_and_code_agent'

yanxi0830 · 2025-01-30T20:00:40Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

@@ -476,9 +476,6 @@ async def _run(
                )
                span.set_attribute("output", retrieved_context)
                span.set_attribute("tool_name", MEMORY_QUERY_TOOL)
-                if retrieved_context:
-                    last_message = input_messages[-1]
-                    last_message.context = retrieved_context


is there another place where we set the context for RAG?

yeah how does RAG even work if you don't do this?

yes, all tests were passing for RAG but then i realized there was nothing getting added to the context.
I tested this change and now can see RAG work properly.

ehhuang · 2025-01-30T22:16:11Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

+
+                # append retrieved_context to the last user message
+                for message in input_messages[::-1]:
+                    if isinstance(message, UserMessage):


What other message type could this be if it's triggering RAG?

Separately curious why for RAG results we use .context but for other tool execs we do 708 input_messages = input_messages + [message, result_message]

@ehhuang I think @hardikjshah did this so you could identify which message you added context to and then in the next turn get rid of it. we needed to keep only one context around as the turns proceeded.

I really think we should nuke that .context field completely and manage whatever state we need to manage completely within agent_instance.py (and ensure the invariant above)

What other message type could this be if it's triggering RAG?

That was the original bug , where if both code and rag are enabled, we end up with a ToolResponseMessage coming from the code_interpreter side (check out def handle_documents)

And it was erroring since ToolResponseMessage does not have any attr context.

I agree that we should find a better solution to context. May be RAG responses are also passed in the ToolResponseMessage like all other tools. Although outside the scope of this PR.

ashwinb

lg! Agree with killing .context completely.

hardikjshah and others added 2 commits January 30, 2025 10:59

drop last_message

80348cb

remove logs from test

6695627

hardikjshah requested review from ashwinb, yanxi0830, dltn, raghotham, dineshyv, vladimirivic and sixianyi0721 as code owners January 30, 2025 19:55

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 30, 2025

yanxi0830 reviewed Jan 30, 2025

View reviewed changes

add back RAG logic

e2bfaf9

ehhuang reviewed Jan 30, 2025

View reviewed changes

check for exact response in rag

fa52813

ashwinb approved these changes Jan 31, 2025

View reviewed changes

hardikjshah merged commit 97eb3ee into main Jan 31, 2025
2 checks passed

hardikjshah deleted the agent_fix branch January 31, 2025 01:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Agents to support code and rag simultaneously #908

Fix Agents to support code and rag simultaneously #908

hardikjshah commented Jan 30, 2025

yanxi0830 Jan 30, 2025

ashwinb Jan 30, 2025

hardikjshah Jan 30, 2025

ehhuang Jan 30, 2025

ashwinb Jan 30, 2025

hardikjshah Jan 30, 2025

ashwinb left a comment

Fix Agents to support code and rag simultaneously #908

Fix Agents to support code and rag simultaneously #908

Conversation

hardikjshah commented Jan 30, 2025

What does this PR do?

Test Plan

yanxi0830 Jan 30, 2025

Choose a reason for hiding this comment

ashwinb Jan 30, 2025

Choose a reason for hiding this comment

hardikjshah Jan 30, 2025

Choose a reason for hiding this comment

ehhuang Jan 30, 2025

Choose a reason for hiding this comment

ashwinb Jan 30, 2025

Choose a reason for hiding this comment

hardikjshah Jan 30, 2025

Choose a reason for hiding this comment

ashwinb left a comment

Choose a reason for hiding this comment