Add PebbloRetrievalQA Chain with Identity & Semantic Enforcement support

daxa-ai · Mar 26, 2024 · 498bb18 · 498bb18
1 parent c93d4ea
commit 498bb18
Show file tree

Hide file tree

Showing 7 changed files with 726 additions and 0 deletions.
diff --git a/docs/docs/use_cases/question_answering/pebblo_retrieval_qa.ipynb b/docs/docs/use_cases/question_answering/pebblo_retrieval_qa.ipynb
@@ -0,0 +1,399 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "3ce451e9-f8f1-4f27-8c6b-4a93a406d504",
+   "metadata": {},
+   "source": [
+    "# Identity-enabled RAG using PebbloRetrievalQA\n",
+    "\n",
+    "> PebbloRetrievalQA is a Retrieval chain with Identity & Semantic Enforcement for question-answering\n",
+    "against a vector database.\n",
+    "\n",
+    "This notebook covers how to retrieve documents with Identity & Semantic Enforcement.\n",
+    "\n",
+    "To start, we will load documents with authorization metadata into an in-memory Qdrant vector database we want to use and then use it as a retriever in PebbloRetrievalQA. Next, we will define an \"ask\" function that loads the PebbloRetrievalQA chain using the retriever and provided *auth_context*. Finally, we will ask it questions with authorization context for authorized users and unauthorized users."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "4ee16b6b-5dac-4b5c-bb69-3ec87398a33c",
+   "metadata": {},
+   "source": [
+    "## Setup\n",
+    "\n",
+    "### Dependencies\n",
+    "\n",
+    "We'll use an OpenAI LLM, OpenAI embeddings and a Qdrant vector store in this walkthrough.\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "e68494fa-f387-4481-9a6c-58294865d7b7",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Note: you may need to restart the kernel to use updated packages.\n"
+     ]
+    }
+   ],
+   "source": [
+    "%pip install --upgrade --quiet  langchain langchain-community langchain-openai"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "61498d51-0c38-40e2-adcd-19dfdf4d37ef",
+   "metadata": {},
+   "source": [
+    "### Identity-aware Data Ingestion\n",
+    "\n",
+    "Here we are using Qdrant as a vector database; however, you can use any of the supported vector databases.\n",
+    "\n",
+    "**PebbloRetrievalQA chain supports the following vector databases:**\n",
+    "- Qdrant\n",
+    "- Pinecone\n",
+    "- Weaviate\n",
+    "\n",
+    "\n",
+    "**Load vector database with authorization information in metadata:**\n",
+    "\n",
+    "In this step, we capture the authorization information of the source document into the `authorized_identities` field within the metadata of the VectorDB entry for each chunk. \n",
+    "\n",
+    "Example:\n",
+    "```\n",
+    "{\n",
+    "    \"page_content\": \"Employee leave-of-absence policy ...\",\n",
+    "    \"metadata\": {\n",
+    "        \"authorized_identities\": [\"hr-support\", \"hr-leadership\"],\n",
+    "        ...\n",
+    "    }\n",
+    "    ...\n",
+    "},\n",
+    "...\n",
+    "\n",
+    "```\n",
+    "\n",
+    "\n",
+    "*NOTE: To use the PebbloRetrievalQA chain, you always need to place authorization metadata in the `authorized_identities` field, which must be a list of strings.*"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "ae4fcbc1-bdc3-40d2-b2df-8c82cad1f89c",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Loading vectordb...\n"
+     ]
+    }
+   ],
+   "source": [
+    "from langchain_core.documents import Document\n",
+    "from langchain_openai.embeddings import OpenAIEmbeddings\n",
+    "from langchain_community.vectorstores.qdrant import Qdrant\n",
+    "from langchain_openai.llms import OpenAI\n",
+    "\n",
+    "llm = OpenAI()\n",
+    "embeddings = OpenAIEmbeddings()\n",
+    "collection_name = \"pebblo-identity-rag\"\n",
+    "\n",
+    "page_content = '''\n",
+    "Performance Report: John Smith\n",
+    "Employee Information:\n",
+    "    •Name: John Smith\n",
+    "    •Employee ID: JS12345\n",
+    "    •Department: Sales\n",
+    "    •Position: Sales Representative\n",
+    "    •Review Period: January 1, 2023 - December 31, 2023\n",
+    "\n",
+    "Performance Summary: \n",
+    "John Smith has demonstrated commendable performance as a Sales Representative during the review period. \n",
+    "He consistently met and often exceeded sales targets, contributing signiﬁcantly to the department's success. \n",
+    "His dedication, professionalism, and collaborative approach have been instrumental in fostering positive \n",
+    "relationships with clients and colleagues alike.\n",
+    "\n",
+    "Key Achievements:\n",
+    "•Exceeded sales targets by 20% for the ﬁscal year, demonstrating exceptional sales acumen and strategic planning skills.\n",
+    "•Successfully negotiated several high-value contracts, resulting in increased revenue and client satisfaction.\n",
+    "•Proactively identiﬁed opportunities for process improvement within the sales team, \n",
+    "    leading to streamlined workﬂows and enhanced efﬁciency.\n",
+    "•Received positive feedback from clients and colleagues for excellent communication skills, responsiveness, and customer service.\n",
+    "    Areas for Development: While John's performance has been exemplary overall, \n",
+    "there are opportunities for further development in certain areas:\n",
+    "•Continued focus on expanding product knowledge to better address client needs and provide tailored solutions.\n",
+    "•Enhancing time management skills to prioritize tasks effectively and maximize productivity during busy periods.\n",
+    "•Further development of leadership abilities to support and mentor junior team members within the sales department.\n",
+    "\n",
+    "Conclusion: In conclusion, John Smith has delivered outstanding results as a Sales Representative at ACME Corp. \n",
+    "His dedication, performance, and commitment to excellence reﬂect positively on the organization.\" \n",
+    "'''\n",
+    "\n",
+    "documents = [\n",
+    "    Document(**{\n",
+    "        \"page_content\": page_content,\n",
+    "        \"metadata\": {\n",
+    "            \"authorized_identities\": [\"hr-support\", \"hr-leadership\"],\n",
+    "            'page': 0, \n",
+    "            'source': 'https://drive.google.com/file/d/xxxxxxxxxxxxx/view', \n",
+    "            'title': 'Performance Report- John Smith.pdf'\n",
+    "        }\n",
+    "    })\n",
+    "]\n",
+    "\n",
+    "print(\"Loading vectordb...\")\n",
+    "\n",
+    "vectordb = Qdrant.from_documents(\n",
+    "    documents,\n",
+    "    embeddings,\n",
+    "    location=\":memory:\",\n",
+    "    collection_name=collection_name,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f630bb8b-67ba-41f9-8715-76d006207e75",
+   "metadata": {},
+   "source": [
+    "## Retrieval with Identity & Semantic Enforcement\n",
+    "\n",
+    "PebbloRetrievalQA chain uses a SafeRetrieval to enforce that the snippets used for in-context are retrieved only from the documents authorized for the user. \n",
+    "To achieve this, the Gen-AI application needs to provide an authorization context for this retrieval chain. \n",
+    "This *auth_context* should be filled with the identity and authorization groups of the user accessing the Gen-AI app.\n",
+    "\n",
+    "\n",
+    "Here is the sample code for the `Pebblo SafeRetriever` with `authorized_identities` from the user accessing the RAG application, passed in `auth_context`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "e978bee6-3a8c-459f-ab82-d380d7499b36",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chains import PebbloRetrievalQA\n",
+    "\n",
+    "def ask(question: str, auth_context: dict):\n",
+    "    qa_chain = PebbloRetrievalQA.from_chain_type(\n",
+    "        llm=llm,\n",
+    "        chain_type=\"stuff\",\n",
+    "        retriever=vectordb.as_retriever(),\n",
+    "        auth_context=auth_context,\n",
+    "    )\n",
+    "    return qa_chain.invoke(question)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "7a267e96-70cb-468f-b830-83b65e9b7f6f",
+   "metadata": {},
+   "source": [
+    "### Questions by Authorized User\n",
+    "\n",
+    "We ingested data for authorized identities [\"hr-support\", \"hr-leadership\"], so a user with the authorized identity/group \"hr-support\" should receive the correct answer."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "2688fc18-1eac-45a5-be55-aabbe6b25af5",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Question: Please share the performance report for John Smith?\n",
+      "\n",
+      "Answer: \n",
+      "\n",
+      "The performance report for John Smith shows that he consistently met and exceeded sales targets, successfully negotiated high-value contracts, identified opportunities for process improvement, and received positive feedback from clients and colleagues. Areas for development include expanding product knowledge, enhancing time management skills, and further developing leadership abilities. Overall, John has delivered outstanding results as a Sales Representative at ACME Corp.\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "auth = {\n",
+    "    \"authorized_identities\": [\n",
+    "        \"hr-support\",\n",
+    "    ]\n",
+    "}\n",
+    "\n",
+    "question = \"Please share the performance report for John Smith?\"\n",
+    "resp = ask(question, auth)\n",
+    "print(f\"Question: {question}\\n\\nAnswer: {resp['result']}\\n\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b4db6566-6562-4a49-b19c-6d99299b374e",
+   "metadata": {},
+   "source": [
+    "### Questions by Unauthorized User\n",
+    "\n",
+    "Since the user's authorized identity/group \"eng-support\" is not included in the authorized identities [\"hr-support\", \"hr-leadership\"], we should not receive an answer."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "2d736ce3-6e05-48d3-a5e1-fb4e7cccc1ee",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Question: Please share the performance report for John Smith?\n",
+      "\n",
+      "Answer:  I don't know, do you have any specific performance reports or information about John Smith that you would like me to share?\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "auth = {\n",
+    "    \"authorized_identities\": [\n",
+    "        \"eng-support\",\n",
+    "    ]\n",
+    "}\n",
+    "\n",
+    "question = \"Please share the performance report for John Smith?\"\n",
+    "resp = ask(question, auth)\n",
+    "print(f\"Question: {question}\\n\\nAnswer: {resp['result']}\\n\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "33a8afe1-3071-4118-9714-a17cba809ee4",
+   "metadata": {},
+   "source": [
+    "### Using PromptTemplate to provide additional instructions\n",
+    "You can use PromptTemplate to provide additional instructions to the LLM for generating a custom response."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "59c055ba-fdd1-48c6-9bc9-2793eb47438d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain_core.prompts import PromptTemplate\n",
+    "prompt_template = PromptTemplate.from_template('''\n",
+    "Answer the question using the provided context. \n",
+    "If no context is provided, just say \"I'm sorry, but that information is unavailable, or Access to it is restricted.\".\n",
+    "\n",
+    "Question: {question}\n",
+    "''')\n",
+    "\n",
+    "question = \"Please share the performance summary for John Smith?\"\n",
+    "prompt = prompt_template.format(question=question)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c4d27c00-73d9-4ce8-bc70-29535deaf0e2",
+   "metadata": {},
+   "source": [
+    "#### Questions by Authorized User"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "id": "e68a13a4-b735-421d-9655-2a9a087ba9e5",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Question: Please share the performance summary for John Smith?\n",
+      "\n",
+      "Answer:  \n",
+      "\n",
+      "John Smith has demonstrated commendable performance as a Sales Representative during the review period. He consistently met and often exceeded sales targets, contributing significantly to the department's success. His dedication, professionalism, and collaborative approach have been instrumental in fostering positive relationships with clients and colleagues alike.\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "auth = {\n",
+    "    \"authorized_identities\": [\n",
+    "        \"hr-support\",\n",
+    "    ]\n",
+    "}\n",
+    "resp = ask(prompt, auth)\n",
+    "print(f\"Question: {question}\\n\\nAnswer: {resp['result']}\\n\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "7b97a9ca-bdc6-400a-923d-65a8536658be",
+   "metadata": {},
+   "source": [
+    "#### Questions by Unauthorized Users"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "id": "438e48c6-96a2-4d5e-81db-47f8c8f37739",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Question: Please share the performance summary for John Smith?\n",
+      "\n",
+      "Answer: \n",
+      "I'm sorry, but that information is unavailable, or Access to it is restricted.\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "auth = {\n",
+    "    \"authorized_identities\": [\n",
+    "        \"eng-support\",\n",
+    "    ]\n",
+    "}\n",
+    "resp = ask(prompt, auth)\n",
+    "print(f\"Question: {question}\\n\\nAnswer: {resp['result']}\\n\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.7"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}