Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: prompting O1 #1891

Merged
merged 63 commits into from
Oct 17, 2024
Merged

feat: prompting O1 #1891

merged 63 commits into from
Oct 17, 2024

Conversation

kl2806
Copy link
Collaborator

@kl2806 kl2806 commented Oct 15, 2024

Please describe the purpose of this pull request.
Create a new agent type O1Agent, that uses prompting to get the agent to "think" more at inference time before responding by calling a think function repeatedly before the final answer function.

How to test
How can we test your PR during review? What commands should we run? What outcomes should we expect?
Added a simple test in test_o1_agent that creates an O1 agent and asks it to compare which two numbers are larger. With got-4o-mini, it improves accuracy from 0->80%. The test just checks that 3 steps to respond.

Have you tested this PR?
Have you tested the latest commit on the PR? If so please provide outputs from your tests.

Related issues or PRs
Please link any related GitHub issues or PRs.
This is based on the refactor to make Agent multi-step in #1884.

@kl2806 kl2806 mentioned this pull request Oct 15, 2024
@sarahwooders sarahwooders merged commit 0249d82 into main Oct 17, 2024
15 checks passed
@cpacker cpacker deleted the o1 branch October 20, 2024 00:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants