🍊 tangerine (backend)

tangerine is a slim and light-weight RAG (Retieval Augmented Generated) system used to create and manage chat bot agents.

Each agent is intended to answer questions related to a set of documents known as a knowledge base (KB).

It relies on 4 key components:

A vector database (PostgresQL with the pgvector extension)
A large language model (LLM) hosted on any OpenAI-compatible API service.
An embedding model hosted on any OpenAI-compatible API service.
(optional) An S3 bucket that you wish to sync documentation from.

The backend service manages:

Creating/updating/deleting chat bot agents
Uploading documents to be used as context to assist the agents in answering questions
Document ingestion including cleanup/conversion, chunking, and embedding into the vector database.
Document chunk retrieval from the vector database.
Interfacing with the LLM to prompt it and stream responses
(optional) Interfacing with S3 to provide continuous document sync.

tangerine will work with any deployed instance of PostgresQL+pgvector and can be configured to use any OpenAI-compliant API service that is hosting a large language model or embedding model.

This repository provides Open Shift templates for all infrastructure (except for the model hosting service) as well as a docker compose file that allows you to spin it up locally and use ollama.

The accompanying frontend service is tangerine-frontend and a related plugin for Red Hat Developer Hub can be found here

This project is currently used by Red Hat's Hybrid Cloud Management Engineering Productivity Team. It was born out of a hack-a-thon and is still a work in progress. You will find some areas of code well developed while others are in need of attention and some tweaks to make it production-ready are needed (with that said, the project is currently in good enough shape to provide a working chat bot system).

Getting started

The project can be deployed to a local development environment using ollama to host the LLM and huggingface's text-embeddings-inference server to host the embedding model.

Local Environment Setup for Linux / Intel Macs

You may require further tweaks to properly make use of your GPU. Refer to the ollama docker image documentation.

Make sure git-lfs is installed:
- Fedora: sudo dnf install git-lfs
- MacOS: brew install git-lfs
Then, activate it globally with:
```
git lfs install
```
Create the directory which will house the local environment data:
```
mkdir data
```

Create a directory to house the embedding model and download the snowflake-arctic-embed-m-long model:

mkdir data/embeddings
git clone https://huggingface.co/Snowflake/snowflake-arctic-embed-m-long \
  data/embeddings/snowflake-arctic-embed-m-long

Invoke docker compose (postgres data will persist in data/postgres):
```
docker compose up --build
```
Pull the mistral LLM (data will persist in data/ollama):
```
docker exec tangerine-ollama ollama pull mistral
```
The API can now be accessed on http://localhost:5000

Local Environment Setup for Apple Silicon Macs

Some of the images used in the docker-compose.yml are unsupported on Apple silicon. In order to develop on those systems you will need to start some of the processes manually.

You'll need to have the following installed and working before proceeding:
- brew
- pipenv
- pyenv
- docker or podman
Install ollama
```
brew install ollama
```
Start ollama
```
ollama serve
```

Pull the language and embedding models

ollama pull mistral
ollama pull nomic-embed-text

Install the C API for Postgres (libpq)

brew install libpq

For Apple Silicon Macs, you'll need to export the following environment variables to avoid C library errors:

export PATH="/opt/homebrew/opt/libpq/bin:$PATH"
export LDFLAGS="-L/opt/homebrew/opt/libpq/lib"
export CPPFLAGS="-I/opt/homebrew/opt/libpq/include"

Start the vector database

docker run \
    -e POSTGRES_PASSWORD="citrus" \
    -e POSTGRES_USER="citrus" \
    -e POSTGRES_DB="citrus" \
    -e POSTGRES_HOST_AUTH_METHOD=trust \
    -p 5432:5432 \
    pgvector/pgvector:pg16

Prepare your python virtual environment:
```
pipenv install
pipenv shell
```
Start Tangerine Backend

[!NOTE] The default tangerine port, 5000, is already claimed by Bonjour on Macs, so we need to use a different port instead.
```
flask run --host=127.0.0.1 --port=8000
```
You can now communicate with the API on port 8000
```
curl -XGET 127.0.0.1:8000/api/agents
{
    "data": []
}
```

Available API Paths

Path	Method	Description
`/api/agents`	`GET`	Get a list of all agents
`/api/agents`	`POST`	Create a new agent
`/api/agents/<id>`	`GET`	Get an agent
`/api/agents/<id>`	`PUT`	Update an agent
`/api/agents/<id>`	`DELETE`	Delete an agent
`/api/agents/<id>/chat`	`POST`	Chat with an agent
`/api/agents/<id>/documents`	`POST`	Agent document uploads
`/api/agents/<id>/documents`	`DELETE`	Delete agent documents
`/api/agentDefaults`	`GET`	Get agent default settings
`/ping`

Syncrhonizing Documents from S3

You can configure a set of agents and continually sync their knowledge base via documents stored in an S3 bucket.

To do so you'll need to do the following:

Export environment variables that contain your S3 bucket auth info:

export AWS_ACCESS_KEY_ID="MYKEYID"
export AWS_DEFAULT_REGION="us-east-1"
export AWS_ENDPOINT_URL_S3="https://s3.us-east-1.amazonaws.com"
export AWS_SECRET_ACCESS_KEY="MYACCESSKEY"
export BUCKET="mybucket"

Create an s3.yaml file that describes your agents and the documents they should ingest. See s3-example.yaml for an example.
Run the S3 sync job:
```
flask s3sync
```

The sync creates agents and ingests the configured documents for each agent. After initial creation, when the task is run it checks the S3 bucket for updates and will only re-ingest files into the vector DB when it detects file changes.

The OpenShift templates contain a CronJob configuration that is used to run this document sync repeatedly.

Run Tangerine Frontend Locally

The API can be used to create/manage/update agents, upload documents, and to chat with each agent. However, the frontend provides a simpler interface to manage the service with. To run the UI in a development environment, see tangerine-frontend

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.tekton		.tekton
connectors		connectors
openshift		openshift
resources		resources
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
app.py		app.py
build.sh		build.sh
docker-compose.yml		docker-compose.yml
file_upload_cli.py		file_upload_cli.py
pytest.ini		pytest.ini
s3-example.yaml		s3-example.yaml
setup.cfg		setup.cfg
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍊 tangerine (backend)

Getting started

Local Environment Setup for Linux / Intel Macs

Local Environment Setup for Apple Silicon Macs

Available API Paths

Syncrhonizing Documents from S3

Run Tangerine Frontend Locally

About

Releases

Packages

Contributors 10

Languages

RedHatInsights/tangerine-backend

Folders and files

Latest commit

History

Repository files navigation

🍊 tangerine (backend)

Getting started

Local Environment Setup for Linux / Intel Macs

Local Environment Setup for Apple Silicon Macs

Available API Paths

Syncrhonizing Documents from S3

Run Tangerine Frontend Locally

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages