Programmable pipeline for AskAsync #612

alkampfergit · 2024-02-29T17:30:50Z

alkampfergit
Feb 29, 2024

Context / Scenario

Actually if you use the AskAsync method of the memory the system will perform a vector search and use the most X relevant result to pass to LLM.

It would be interesting to customize this pipeline, so we can retrieve documents with other techniques, an example could be query expansion, re-ranking, using BM25 in parallel to vector search then rerank etc.

Ingestion pipeline is fully customizable, it would be fantastic if query part could be also customizable.

The problem

It is difficult to implmenent advanced techniques like re-reanking or query expansion.

Proposed solution

It could be nice that the AskAsync methods simply is changed to a pipeline with default component. Default solution is two stage, the first is vector search, the other takes document in order of vector search result. But we can change confguring different pipeline for more advanced techniques.

Importance

would be great to have

alkampfergit · 2024-03-27T15:44:14Z

alkampfergit
Mar 27, 2024
Author

@dluc I'm trying to lay down an example I'm starting here https://github.com/alkampfergit/SemanticKernelPlayground/blob/feature/better_search/200_CSharpSemanticMemory/KernelMemorySamples/Samples/CustomPipelineBase.cs just a simple way to decouple search/query, next step I'll add re-ranker

0 replies

OrionSeven · 2024-05-09T19:12:53Z

OrionSeven
May 9, 2024

+1 At minimum the ability to easily support rerankers would be highly beneficial.

0 replies

alkampfergit · 2024-05-11T05:35:06Z

alkampfergit
May 11, 2024
Author

I've done it here https://github.com/alkampfergit/KernelMemory.Extensions

If you want I've done a couple of video on a chain with Keyword + Vector -> reranking

You can find it here https://www.linkedin.com/feed/update/urn:li:activity:7195302978747060225/

0 replies

alkampfergit · 2024-06-05T06:31:19Z

alkampfergit
Jun 5, 2024
Author

I'm actually using for demo and planning on going in production soon with the extension, if something is interesting feel free to ask me to include in the main Kernel Memory package. I've created extensions mainly because I can move at my speed in doing and breaking things to experiment.

Actually I have good result with new openai embedding + cohere reranking + cohere Command R+ LLM deployed on azure ai studio, great quality of results, good citations (it can tell you which document used for answer)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Programmable pipeline for AskAsync #612

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Programmable pipeline for AskAsync #612

alkampfergit Feb 29, 2024

Context / Scenario

The problem

Proposed solution

Importance

Replies: 4 comments

alkampfergit Mar 27, 2024 Author

OrionSeven May 9, 2024

alkampfergit May 11, 2024 Author

alkampfergit Jun 5, 2024 Author

alkampfergit
Feb 29, 2024

alkampfergit
Mar 27, 2024
Author

OrionSeven
May 9, 2024

alkampfergit
May 11, 2024
Author

alkampfergit
Jun 5, 2024
Author