Combine FAQ and extractive QA Pipelines #902

F95GIT · 2021-03-18T11:48:07Z

Hi,

I understand from your docs, that it's possible to combine several retrievers and join their results.
Is it also possible to combine the extractive QA pipeline and the FAQ pipeline?
I would like to achieve something similar to this: if a certain term "XY" is included in the query use the FAQ pipeline, if not use the extractive QA pipeline.
Or even better: run the query in both pipelines, join the results and return the most relevant results.
If this is possible, could you please give me a pointer on how to do this?
Thank you!

tholor · 2021-03-29T13:35:46Z

Hey @F95GIT ,

Sure, that's possible. It should work pretty similarly to the "multiple retriever examples" in the docs.
Let me try to give you some pointers:

Route Query (Run only one branch)

    class QueryClassifier():
        outgoing_edges = 2

        def run(self, **kwargs):
            # you can put here some rule or fast classification model
            if "?" in kwargs["query"]:
                return (kwargs, "output_1")
            else:
                return (kwargs, "output_2")

    pipe = Pipeline()
    pipe.add_node(component=QueryClassifier(), name="QueryClassifier", inputs=["Query"])
    pipe.add_node(component=dpr_retriever, name="DPRRetriever", inputs=["QueryClassifier.output_1"])
    pipe.add_node(component=reader, name="QAReader", inputs=["DPRRetriever"])
    pipe.add_node(component=embedding_retriever, name="FAQRetriever", inputs=["QueryClassifier.output_2"])
    res = p.run(query="What did Einstein work on?", top_k_retriever=1)

Combine results (Run both branches)
Ideally we should be able to define some pipeline like this:

p = Pipeline()
#extractive branch
p.add_node(component=dpr_retriever, name="DPRRetriever", inputs=["Query"])
p.add_node(component=reader, name="QAReader", inputs=["DPRRetriever"])
#faq branch
p.add_node(component=embedding_retriever, name="EmbeddingRetriever", inputs=["Query"])
#TODO add conversion node "Docs2Answers"
p.add_node(component=JoinAnswers(join_mode="concatenate"), name="JoinResults", inputs=["EmbeddingRetriever", "QAReader"])
res = p.run(query="What did Einstein work on?", top_k_retriever=1)

However, I just realized, that the FAQPipeline has one extra step needed to convert the retrieved documents into proper "answers" (see code here). This is needed, as both branches in the pipeline need to pass lists of Answer to the JoinAnswers node. So we would need a new node (e.g. Document2Answer) that converts the output of EmbeddingRetriever to the needed format.

I guess you already saw those, but maybe it's helpful for others who read this: here are the exemplary code snippets for the retriever pipelines in our docs: https://haystack.deepset.ai/docs/latest/pipelinesmd#Multiple-retrievers

Let me know if you have any further questions on this!

F95GIT · 2021-04-07T19:25:26Z

@tholor thank you for your detailed answer.
I managed to run the first option (route query), which works well for my use case. If I find the time, I will experiment a bit with second option.

SasikiranJ · 2021-05-21T11:55:47Z

@tholor is Document2Answer class available?

tholor · 2021-05-21T12:51:51Z

Nope. You are right this would still be necessary for your case in #1081.

Would you be interested in raising a pull request for it? I can help you with the implementation if needed. Should be rather straight forward...

SasikiranJ · 2021-05-22T02:49:46Z

@tholor I will try to do PR for that. You have mentioned JoinAnswers class as well. Both Document2Answer and JoinAnswers should be implemented right?

F95GIT added the question label Mar 18, 2021

Timoeller assigned oryx1729 Mar 18, 2021

F95GIT closed this as completed Apr 7, 2021

sophgit mentioned this issue May 21, 2021

Combining FAQ and ExtractiveQA in a pipeline with conditions #1081

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Combine FAQ and extractive QA Pipelines #902

Combine FAQ and extractive QA Pipelines #902

F95GIT commented Mar 18, 2021

tholor commented Mar 29, 2021

F95GIT commented Apr 7, 2021

SasikiranJ commented May 21, 2021

tholor commented May 21, 2021

SasikiranJ commented May 22, 2021

Combine FAQ and extractive QA Pipelines #902

Combine FAQ and extractive QA Pipelines #902

Comments

F95GIT commented Mar 18, 2021

tholor commented Mar 29, 2021

F95GIT commented Apr 7, 2021

SasikiranJ commented May 21, 2021

tholor commented May 21, 2021

SasikiranJ commented May 22, 2021