Telco-RAG: Retrieval-Augmented Generation for Telecommunications (on device inference)

Telco-RAG is a specialized Retrieval-Augmented Generation (RAG) framework designed to address the unique challenges of the telecommunications industry, particularly in handling the complexity and rapid evolution of 3GPP documents.

References

Bornea, A.-L., Ayed, F., De Domenico, A., Piovesan, N., & Maatouk, A. (2024). Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for Telecommunications. arXiv preprint arXiv:2404.15939. DOI | Read the paper

Features

Custom RAG Pipeline: Specifically tailored to handle the complexities of telecommunications standards.
Enhanced Query Processing: Implements a dual-stage query enhancement and retrieval process, improving the accuracy and relevance of generated responses.
Hyperparameter Optimization: Optimized for the best performance by fine-tuning chunk sizes, context length, and embedding models.
NN Router: A neural network-based router that enhances document retrieval efficiency while significantly reducing RAM usage.
Open-Source: Freely available for the community to use, adapt, and improve.

Presentation Video

The video is presented at 1.5x speed.

Getting Started

To get started with Telco-RAG, clone the repository and set up the environment:

git clone https://github.com/netop-team/telco-rag.git
cd telco-rag

Prerequisites

Python 3.11
Node.js (npm version 10.8.2 and node version 20.17.0 are validated, please use install instructions here if you're versions are different: https://linuxize.com/post/how-to-install-node-js-on-ubuntu-22-04/)

Other dependencies are listed in requirements.txt.

Installation

Install the necessary Python packages and download the 3GPP knowledge database:

cd ./Telco-RAG_api
pip install -r requirements.txt
python setup.py

Get access to llama2-7b

Follow initial steps to create HF token and gain access to gated llama repo: https://medium.com/@lucnguyen_61589/llama-2-using-huggingface-part-1-3a29fdbaa9ed

Use HF cli to login: https://huggingface.co/docs/huggingface_hub/en/guides/cli

Running the Full Pipeline

To run the full pipeline, open the first terminal window and execute the following commands:

npm install
npm run dev

This will open two terminals: one for the frontend and one for the Telco-RAG backend. You can access the frontend via your browser at http://localhost:3000/.

Running Only the API Server

Open a second terminal window and run the API server using this command:

cd ./Telco-RAG_api
uvicorn api.deploy_api:app --reload

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Telco-RAG_api		Telco-RAG_api
components		components
pages		pages
styles		styles
types		types
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
license		license
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
video_720p.gif		video_720p.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telco-RAG: Retrieval-Augmented Generation for Telecommunications (on device inference)

References

Features

Presentation Video

Getting Started

Prerequisites

Installation

Get access to llama2-7b

Running the Full Pipeline

Running Only the API Server

License

About

Releases

Packages

Languages

License

plischwe/Telco-RAG

Folders and files

Latest commit

History

Repository files navigation

Telco-RAG: Retrieval-Augmented Generation for Telecommunications (on device inference)

References

Features

Presentation Video

Getting Started

Prerequisites

Installation

Get access to llama2-7b

Running the Full Pipeline

Running Only the API Server

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages