5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning #678

NirantK · 2023-09-04T16:10:43Z

Outline

Data Preparation: We use a subset of the SQuADv2 and get answers using OpenAI's GPT3.5-Turbo model. This serves as our baseline for performance comparison.
Evaluation Metrics: Wrote an Evaluation class to assess the performance of the initial RAG model on our dataset. This sets the stage for the fine-tuning process by providing a quantitative measure of the model's initial capabilities.
Fine-Tuning Setup: We convert the dataset into a JSONL format that's compatible with OpenAI's fine-tuning process and create a fine-tuning job, targeting improvements in the model's answer-generating capabilities.

Performance Comparison: After fine-tuning, we run the model on the same dataset and use the Evaluator again to quantify the improvements gained from fine-tuning. We see an error reduction from ~50% questions to ~10% questions

Documentation: The entire process is commented, aimed at aiding anyone looking to fine-tune OpenAI's RAG models.

* feat(fine-tuned-RAG): add DatasetPrep.ipynb file for dataset preparation * feat(DatasetPrep.ipynb): add code for downloading validation.json file

* docs(ModelFinetune.ipynb): add comments to code

…Qdrant to improve RAG model * feat(fine-tuned-RAG): use Qdrant for finetuning and inference changes

…gitignore file

* fix(ModelFinetune.ipynb): update generated answer for few-shot question * fix(ModelFinetune.ipynb): update count percentages for different answer types

* feat(ModelFinetune.ipynb): add new sections for Qdrant integration and Few-Shot Learning * fix(ModelFinetune.ipynb): fix typo in section title

… notebook * fix(ModelFinetune.ipynb): fix heading for the setting up section * fix(ModelFinetune.ipynb): fix heading for the data preparation section

…to the blog post * feat(ModelFinetune.ipynb): add section on why to read the blog post

…n section * feat(ModelFinetune.ipynb): add information about reducing hallucinations in the introduction section

…f the notebook and target audience

…ss for fine-tuning the OpenAI model

…valuation section

…ne-tuning chat models

…nd add more details in the comment

examples/fine-tuned-RAG/.gitignore

colin-openai

Love this cookbook, its going to be a great addition. However, the messages are not super clear when it gets to the evaluation, especially the bottom "Comparison & Results" section, and the method you're using in the Few-Shot Learning section is similarly unclear.

Can you clear these up and ask for another review? The other change is a housekeeping one, can you please move this notebook into the existing "fine-tuned-qa" directory in the parent "examples" folder?

examples/fine-tuned-RAG/ModelFinetune.ipynb

…local_cache/

…ding for plotting the results

NirantK · 2023-09-11T12:50:06Z

Revised image, simpler to understand!

…tent * docs(ft_retrieval_augmented_generation.ipynb): add instructions and insights to the results breakdown

… and update section numbering * feat(ft_retrieval_augmented_generation.ipynb): add new section for evaluation * fix(ft_retrieval_augmented_generation.ipynb): fix section numbering and update section

…ft_retrieval_augmented_generation_qdrant.ipynb

…all command * feat(ft_retrieval_augmented_generation_qdrant.ipynb): add cell to set OpenAI and Qdrant keys

colin-openai

I like the updated version, clearer what the trade-offs are between each approach and how you can optimize them. I have some remaining non-blocking comments which I'll raise via a separate PR.

Happy to merge this one.

NirantK added 19 commits September 4, 2023 19:37

* feat(fine-tuned-RAG): add .gitignore file for ignoring *.parquet files

ee1ec03

* feat(fine-tuned-RAG): add DatasetPrep.ipynb file for dataset preparation * feat(DatasetPrep.ipynb): add code for downloading validation.json file

Add model finetune nbs

18f5031

Make it easier to follow!

18aed3d

* docs(ModelFinetune.ipynb): update error categories descriptions

b95e5f0

* docs(ModelFinetune.ipynb): add comments to code

* docs(fine-tuned-RAG): add documentation for few-shot learning with …

3088e4d

…Qdrant to improve RAG model * feat(fine-tuned-RAG): use Qdrant for finetuning and inference changes

Better labels

d058e3e

Add Few Shot prompt creation

645e56c

* chore(DatasetPrep.ipynb): remove DatasetPrep.ipynb file

147d770

Simplify datasets

03d6c6a

* chore(.gitignore): add examples/fine-tuned-RAG/local_cache/ to the …

b63216d

…gitignore file

Clean dataset prep for 1 shot, embed the few shot dataset

3c233d6

Add few shot RAG with Qdrant

350393d

Remove unused code

8278d50

more inline output

fe59ced

* fix(ModelFinetune.ipynb): update execution counts for code cells

324a4f3

* fix(ModelFinetune.ipynb): update generated answer for few-shot question * fix(ModelFinetune.ipynb): update count percentages for different answer types

* chore(ModelFinetune.ipynb): update section titles and content

b3d2dfe

* feat(ModelFinetune.ipynb): add new sections for Qdrant integration and Few-Shot Learning * fix(ModelFinetune.ipynb): fix typo in section title

* chore(ModelFinetune.ipynb): update headings and descriptions in the…

219fdab

… notebook * fix(ModelFinetune.ipynb): fix heading for the setting up section * fix(ModelFinetune.ipynb): fix heading for the data preparation section

Reorganise

3fc4256

Add pretty plots ✨

f93cf5d

NirantK marked this pull request as ready for review September 7, 2023 13:11

NirantK added 8 commits September 7, 2023 19:07

* chore(ModelFinetune.ipynb): add introduction and table of contents …

5260c4f

…to the blog post * feat(ModelFinetune.ipynb): add section on why to read the blog post

* chore(ModelFinetune.ipynb): update bullet points in the introductio…

2a61773

…n section * feat(ModelFinetune.ipynb): add information about reducing hallucinations in the introduction section

* docs(ModelFinetune.ipynb): update introduction to clarify the aim o…

448a059

…f the notebook and target audience

Replace the data verification with a link

5c6ae43

* feat(ModelFinetune.ipynb): refactor code to use OpenAIFineTuner cla…

4c74a97

…ss for fine-tuning the OpenAI model

* chore(ModelFinetune.ipynb): update error category descriptions in e…

4fa2b68

…valuation section

* docs(ModelFinetune.ipynb): add link to OpenAI Cookbook guide for fi…

3d13114

…ne-tuning chat models

* docs(ModelFinetune.ipynb): update link to OpenAI Fine-Tuning Docs a…

b9d633e

…nd add more details in the comment

shyamal-anadkat requested a review from colin-openai September 7, 2023 14:58

colin-openai reviewed Sep 8, 2023

View reviewed changes

examples/fine-tuned-RAG/.gitignore Outdated Show resolved Hide resolved

* chore(fine-tuned-RAG): remove .gitignore file for *.parquet

1c9ea7e

colin-openai requested changes Sep 8, 2023

View reviewed changes

NirantK added 6 commits September 8, 2023 13:19

Move nbs

f347a9c

* chore(.gitignore): update ignored directory path for fine-tuned_qa/…

faca10b

…local_cache/

Better numbering, easier to read, renamed error categories

d1c0f4e

Add clean observations and results

9db7a32

* chore(ft_retrieval_augmented_generation.ipynb): update markdown hea…

b4adbcc

…ding for plotting the results

Crisp story

1ae556e

NirantK added 4 commits September 12, 2023 13:10

* chore(ft_retrieval_augmented_generation.ipynb): update markdown con…

1fdd15a

…tent * docs(ft_retrieval_augmented_generation.ipynb): add instructions and insights to the results breakdown

* chore(ft_retrieval_augmented_generation.ipynb): reorganize sections…

12f800c

… and update section numbering * feat(ft_retrieval_augmented_generation.ipynb): add new section for evaluation * fix(ft_retrieval_augmented_generation.ipynb): fix section numbering and update section

* chore(examples): rename ft_retrieval_augmented_generation.ipynb to …

49a0146

…ft_retrieval_augmented_generation_qdrant.ipynb

* chore(ft_retrieval_augmented_generation_qdrant.ipynb): fix pip inst…

21fa0d9

…all command * feat(ft_retrieval_augmented_generation_qdrant.ipynb): add cell to set OpenAI and Qdrant keys

colin-openai approved these changes Sep 12, 2023

View reviewed changes

colin-openai merged commit fd4e31b into openai:main Sep 12, 2023

katia-openai pushed a commit that referenced this pull request Feb 29, 2024

5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning (#678)

57ecc44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning #678

5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning #678

NirantK commented Sep 4, 2023 •

edited

Loading

colin-openai left a comment

NirantK commented Sep 11, 2023

colin-openai left a comment

5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning #678

5x Error Reduction in RAG with gpt-3.5-turbo-0613 Finetuning #678

Conversation

NirantK commented Sep 4, 2023 • edited Loading

Outline

colin-openai left a comment

Choose a reason for hiding this comment

NirantK commented Sep 11, 2023

colin-openai left a comment

Choose a reason for hiding this comment

NirantK commented Sep 4, 2023 •

edited

Loading