OllamaPDFInsight

OllamaPDFInsight is a small RAG (Retrieval-Augmented Generation) system that leverages Ollama to create embeddings from a PDF file, stores these embeddings in a Weaviate vector store, and uses Ollama to answer questions regarding the PDF content.

Steps

Use LangChain document loader to turn the PDF into a set of documents.
Create a collection from these documents in the Weaviate vector store.
Use Ollama to generate embeddings from the documents, and store the embeddings in the collection.
Query and retrieve information from the PDF using Ollama.

Run & Modify

Create a virtual environment:

pyenv install 3.8.10
pyenv virtualenv 3.8.10 ollama-pdf-insight-env
pyenv activate ollama-pdf-insight-env

Once the virtual environment is activated, install the requirements from the requirements.txt file.

pip install -r requirements.txt

Clone the repository:

git clone https://github.com/yourusername/OllamaPDFInsight.git
cd OllamaPDFInsight

Prepare the data and the weaviate vector store.

python load_data.py

Prepare the template, llm, and run the prompt.

python retrieve_context.py

References

I essentially followed the steps in this article and added my own touch to it.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
beirut_data		beirut_data
.DS_Store		.DS_Store
README.md		README.md
load_data.py		load_data.py
requirements.txt		requirements.txt
retrieve_context.py		retrieve_context.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OllamaPDFInsight

Steps

Run & Modify

References

About

Releases

Packages

Languages

omarsinno54/OllamaPDFInsight

Folders and files

Latest commit

History

Repository files navigation

OllamaPDFInsight

Steps

Run & Modify

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages