Allow for batch querying when using Pipelines #1239

brandenchan · 2021-06-29T15:36:35Z

Same larger goal as #1168 but for querying instead of indexing.

As discussed with @oryx1729 and @tholor, we'd like to design the node and Pipeline APIs so that batches of queries can be processed together. This will allow for a better user experience but also allows in future for batch optimizations to speed up querying. Note that we're here focusing first on designing the interfaces. The optimization can be handled separately for each node in different issues.

One design choice we decided upon is have explicit separation of the single and batch functions. This will make for a clear user experience, and avoid any ambiguity for developers looking into the code. More specifically we want to avoid any uncertainty in the input and output formats of nodes.

The following methods should be implemented:

Pipeline.run(query)
Pipeline.run_batch(queries)

Node.run()
Node.run_batch()

If we call Pipeline.run_batch() every node should be executed using their Node.run_batch() method, not Node.run().
In particular, @bogdankostic and @julian-risch agreed in the refinement of this issue that Pipeline.run_batch() should be implemented with the following signature and should return a list of the elements usually returned by run():

def run_batch(
        self,
        queries: Optional[Union[str, List[str]]] = None,
        file_paths: Optional[List[str]] = None,
        labels: Optional[Union[MultiLabel, List[MultiLabel]]] = None,
        documents: Optional[Union[List[Document], List[List[Document]]]] = None,
        meta: Optional[Union[Dict[str, Any], List[Dict[str, Any]]]] = None,
        params: Optional[dict] = None
        debug: Optional[bool] = None

There might be use cases when the same query is executed separately on different sets of documents. Therefore queries is still allowed to be a single query.
Note that every query in the batch needs to use the same params for now. Otherwise, optimization will be hardly possible. We could change that limitation in future if necessary.

file_paths and meta are so far only used in indexing pipelines. If they are set, we should call run() instead of run_batch or trigger an error message if that is impossible.
Note that we should allow for flexibility in the format of the metadata passed into Pipeline.run_batch(). If it is a single meta dictionary, it should be applied to all queries. If it is a list, it should be one meta dict for each query.

Further, every node's predict_batch() should have a batch_size param that can be passed via params.

For now, a Node.run_batch() can be implemented in a naive, non-optimized way by simply calling Node.run() multiple times in a loop and split queries, labels, documents and meta if necessary so that the run() method is called with a single query/list of document. The individual results need to be collected in a list of results.
As an alternative, BaseComponent could implement run_batch() by calling the node's run() method.

The FARMReader already has a predict_batch() method that can be use:

haystack/haystack/nodes/reader/farm.py

Line 528 in f33c2b9

    
           def predict_batch(self, query_doc_list: List[dict], top_k: int = None, batch_size: int = None):

For the transformer based models, we could make use of transformer's pipeline batching mechanism: https://huggingface.co/docs/transformers/main_classes/pipelines#pipeline-batching
However, docs say

All pipelines (except zero-shot-classification and question-answering currently) can use batching.

The text was updated successfully, but these errors were encountered:

kathy-lee · 2025-01-10T12:18:22Z

Hi, may I ask the new version of Haystack (now 2.8), is 'run_batch' still available? Thank you!

tholor added topic:pipeline type:feature New feature or request labels Aug 6, 2021

Timoeller mentioned this issue Oct 5, 2021

speed up slow retriever evaluation #1557

Closed

tholor assigned julian-risch and bogdankostic Dec 8, 2021

julian-risch mentioned this issue Jan 6, 2022

Multi GPU support for FARM reader inference #1971

Closed

vblagoje mentioned this issue Mar 30, 2022

Implementing Generative Pseudo Labeling (GPL) #1908

Closed

vblagoje mentioned this issue Apr 9, 2022

Add Generative Pseudo Labeling (GPL) #2388

Merged

5 tasks

bogdankostic mentioned this issue May 2, 2022

Add run_batch method to all nodes and Pipeline to allow batch querying #2481

Merged

bogdankostic closed this as completed in #2481 May 11, 2022

bogdankostic mentioned this issue May 11, 2022

Rethink inputs to run_batch #2529

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for batch querying when using Pipelines #1239

Allow for batch querying when using Pipelines #1239

brandenchan commented Jun 29, 2021 •

edited by julian-risch

Loading

kathy-lee commented Jan 10, 2025

Allow for batch querying when using Pipelines #1239

Allow for batch querying when using Pipelines #1239

Comments

brandenchan commented Jun 29, 2021 • edited by julian-risch Loading

kathy-lee commented Jan 10, 2025

brandenchan commented Jun 29, 2021 •

edited by julian-risch

Loading