Add LLMCheckerChain #281

andychenmagrathea-06e0a82f-34fc-48ca · 2022-12-07T21:24:12Z

Implementation of https://github.com/jagilley/fact-checker. Works pretty well.

Verifying this manually:

"Only two kinds of egg-laying mammals are left on the planet today—the duck-billed platypus and the echidna, or spiny anteater." https://www.scientificamerican.com/article/extreme-monotremes/
"An [Echidna] egg weighs 1.5 to 2 grams (0.05 to 0.07 oz)[19] and is about 1.4 centimetres (0.55 in) long." https://en.wikipedia.org/wiki/Echidna#:~:text=sleep%20is%20suppressed.-,Reproduction,a%20reptile%2Dlike%20egg%20tooth.
"A [platypus] lays one to three (usually two) small, leathery eggs (similar to those of reptiles), about 11 mm (7⁄16 in) in diameter and slightly rounder than bird eggs." https://en.wikipedia.org/wiki/Platypus#:~:text=It%20lays%20one%20to%20three,slightly%20rounder%20than%20bird%20eggs.
Therefore, an Echidna is the mammal that lays the biggest eggs.

hwchase17

so im down to check this in as is because i think its great, but one thing id like to think about before checking this in - how general could we make this? eg what is the most general prompt/chain that i could have that would work with this?

eg right now the use case here is you ask a generic question in the first prompt. are there other good use cases for this?

langchain/chains/llm_checker/base.py

andychenmagrathea-06e0a82f-34fc-48ca · 2022-12-08T23:57:44Z

so im down to check this in as is because i think its great, but one thing id like to think about before checking this in - how general could we make this? eg what is the most general prompt/chain that i could have that would work with this?

In the most general case, I think this verification becomes an agent, not a chain? You probably want to have an agent list its assumptions and then verify them with its tools, e.g. go to Wikipedia to look up a list of mammals.

This actually looks a lot like a parallel version of self-ask?

@hwchase17

hwchase17 · 2022-12-09T01:25:50Z

so im down to check this in as is because i think its great, but one thing id like to think about before checking this in - how general could we make this? eg what is the most general prompt/chain that i could have that would work with this?

In the most general case, I think this verification becomes an agent, not a chain? You probably want to have an agent list its assumptions and then verify them with its tools, e.g. go to Wikipedia to look up a list of mammals.

This actually looks a lot like a parallel version of self-ask?

@hwchase17

oh i like that idea. in that framing the checkers are the tools then right? so what part of this chain is that checker/tool? not the whole part, because it also contains the question. maybe the next two parts at the end?

andychenmagrathea-06e0a82f-34fc-48ca · 2022-12-09T03:20:28Z

Right so I guess there's two options:

If we turn LLMCheckerChain into an agent, the verification of each individual assumption is routed to the appropriate tool by the agent. I can see this as potentially being more robust, at the cost of increased compute (especially if you allow the agent to call itself; self-ask doesn't have as much of a problem because there's no fanout)
On the other hand, I do see benefits of keeping things as-is and having a fast-running verification chain as just a tool — seems like a cheap step to throw in if an agent is unsure of an answer.

Maybe we do both? #1 used when compute / latency isn't as much an issue, #2 more widely used to make things a bit more correct?

hwchase17 · 2022-12-09T16:59:26Z

Right so I guess there's two options:

If we turn LLMCheckerChain into an agent, the verification of each individual assumption is routed to the appropriate tool by the agent. I can see this as potentially being more robust, at the cost of increased compute (especially if you allow the agent to call itself; self-ask doesn't have as much of a problem because there's no fanout)

On the other hand, I do see benefits of keeping things as-is and having a fast-running verification chain as just a tool — seems like a cheap step to throw in if an agent is unsure of an answer.

Maybe we do both? #1 used when compute / latency isn't as much an issue, #2 more widely used to make things a bit more correct?

what would (1) look like exactly? would we change the prompt of the agent, to tell it to answer the question and then use a tool to fact check? what would that tool look like? would htat tool basically be the prompts here?

is there a common prereq for both (1) and (2)? namely the tool/chain that does the verfication? and in (2) its just a simple sequence of original LLM -> verificaton chain, and in (1) its a tool the agent has access to?

andychenmagrathea-06e0a82f-34fc-48ca · 2022-12-09T18:17:11Z

Scoping this PR to a chain for now. Ready for review!

langchain/chains/llm_checker/base.py

hwchase17 reviewed Dec 7, 2022

View reviewed changes

langchain/chains/llm_checker/base.py Outdated Show resolved Hide resolved

andychenmagrathea-06e0a82f-34fc-48ca changed the title ~~[WIP] Add LLMCheckerChain~~ Add LLMCheckerChain Dec 9, 2022

Add LLMCheckerChain

32af7f4

hwchase17 approved these changes Dec 9, 2022

View reviewed changes

langchain/chains/llm_checker/base.py Outdated Show resolved Hide resolved

langchain/chains/llm_checker/base.py Outdated Show resolved Hide resolved

andychenmagrathea-06e0a82f-34fc-48ca added 2 commits December 9, 2022 15:33

Respond to comments

9c6d40a

nit

701e529

hwchase17 merged commit 5267ebc into langchain-ai:master Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLMCheckerChain #281

Add LLMCheckerChain #281

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 7, 2022 •

edited

Loading

hwchase17 left a comment

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 8, 2022

hwchase17 commented Dec 9, 2022

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 9, 2022

hwchase17 commented Dec 9, 2022

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 9, 2022

Add LLMCheckerChain #281

Add LLMCheckerChain #281

Conversation

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 7, 2022 • edited Loading

hwchase17 left a comment

Choose a reason for hiding this comment

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 8, 2022

hwchase17 commented Dec 9, 2022

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 9, 2022

hwchase17 commented Dec 9, 2022

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 9, 2022

andychenmagrathea-06e0a82f-34fc-48ca commented Dec 7, 2022 •

edited

Loading