[FR] RAG and Ollama embedding model #30

wwjCMP · 2024-04-29T03:20:55Z

Now using the Ollama embedding model to implement RAG in the Obsidian plugin has become quite common. I wonder if this plugin will be extended in this aspect next.

cephalization · 2024-05-05T22:19:21Z

I am not familiar, do you have any references?

wwjCMP · 2024-05-06T03:07:00Z

I hope the following content is helpful.
https://github.com/ollama/ollama/blob/main/docs/api.md#generate-embeddings
brianpetro/obsidian-smart-connections#559 (comment)
https://github.com/logancyang/obsidian-copilot/blob/master/local_copilot.md
https://github.com/your-papa/obsidian-Smart2Brain

blindmansion · 2024-05-06T03:28:43Z

Yep embeddings are something we've been thinking about implementing soon as well. We may even piggyback off of the smart connections embeddings, as they enable other plugins to use the ones it creates for a vault.

Still thinking about how to implement them in a way that makes sense for cannoli.

oyajiru · 2024-11-01T11:47:55Z

I'm not a developer, but I love using Cannoli for AI-powered workflows in Obsidian, and I am very interested in this to enhance drafting research papers and fiction.

Would it be possible to integrate a lightweight vector database like Milvus Lite into Cannoli to create on-the-fly Retrieval-Augmented Generation (RAG) databases for each workflow? This could allow users to bypass token limits and use larger language models on consumer hardware by limiting their token contexts to fit within system memory. I believe this could also help limit hallucinations.

The idea is:

On-the-fly RAG databases using Milvus Lite:
Would it be possible to integrate a lightweight vector database like Milvus Lite into Cannoli to create on-the-fly Retrieval-Augmented Generation (RAG) databases for each workflow? This could allow users to bypass token limits and use larger language models on consumer hardware, even with lower token contexts.
Advanced data management features:
a. Selective data removal: Implement a feature to remove specific obsolete information, possibly by "reversing the arrows" from the knowledge input node to the one representing the database.
b. Information editing: Allow users to edit existing information by adding it to the reference node, updating the vector representations whenever the content of a knowledge node is changed.

Benefits:

Enables use of larger, more capable models without requiring extensive hardware resources.
Allows for more dynamic and contextually rich interactions within Cannoli workflows.
Provides users with greater control over their knowledge base, allowing for data refinement and updates. All within the canvas.

Implementation considerations:

I believe Milvus Lite supports complex delete expressions, which could be leveraged for selective data removal.
The Python client for Milvus allows for data insertion, updating, and deletion, which could be used to implement the editing feature.

What are your thoughts on this? Would this be feasible to implement, and do you see any potential challenges or alternative approaches?

Thank you for considering these suggestions and for your fantastic work on Cannoli!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FR] RAG and Ollama embedding model #30

[FR] RAG and Ollama embedding model #30

wwjCMP commented Apr 29, 2024

cephalization commented May 5, 2024

wwjCMP commented May 6, 2024

blindmansion commented May 6, 2024

oyajiru commented Nov 1, 2024

[FR] RAG and Ollama embedding model #30

[FR] RAG and Ollama embedding model #30

Comments

wwjCMP commented Apr 29, 2024

cephalization commented May 5, 2024

wwjCMP commented May 6, 2024

blindmansion commented May 6, 2024

oyajiru commented Nov 1, 2024