Name		Name	Last commit message	Last commit date
parent directory ..
ds_configs		ds_configs
.detignore		.detignore
.gitignore		.gitignore
README.md		README.md
chat_format.py		chat_format.py
dataset_utils.py		dataset_utils.py
deepspeed.yaml		deepspeed.yaml
finetune.py		finetune.py
inference.py		inference.py
lora.yaml		lora.yaml
requirements.txt		requirements.txt
startup-hook.sh		startup-hook.sh
validate_tokenizer.py		validate_tokenizer.py

README.md

Finetuning Mistral-7B using LoRA and DeepSpeed

In this demo, we finetune Mistral-7B using LoRA and DeepSpeed. We ran LoRA on two 80 GB A100 GPUs, and DeepSpeed on two, four, and eight 80 GB A100 GPUs.

To get started, first install Determined on your local machine:

pip install determined

Then finetune with LoRA:

det e create lora.yaml .

Or finetune with DeepSpeed:

det e create deepspeed.yaml .

You can view the actual training code in finetune.py.

Configuration

Change configuration options in lora.yaml or deepspeed.yaml. Some important options are:

slots_per_trial: the number of GPUs to use.
dataset_subset: the difficulty subset to train on.
per_device_train_batch_size: the batch size per GPU.

The results in our blog post were obtained using per_device_train_batch_size: 1 and per_device_eval_batch_size: 4

DeepSpeed configuration files are in the ds_configs folder.

Testing

Test your model's generation capabilities:

python inference.py --exp_id <exp_id> --dataset_subset <dataset_subset>

Where

<exp_id> is the id of your finetuning experiment in the Determined UI.
<dataset_subset> is one of "easy", "medium", or "hard".

If you're testing a LoRA model, then add --lora to the above command.

To use CPU instead of GPU, add --device cpu.

To test the pretrained model (not finetuned), leave out --exp_id. For example:

python inference.py --dataset_subset easy

Validating the tokenizer

Plot the distribution of dataset sample lengths, and see how many samples will be truncated by the tokenizer:

python validate_tokenizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-finetuning-2

llm-finetuning-2

README.md

Finetuning Mistral-7B using LoRA and DeepSpeed

Configuration

Testing

Validating the tokenizer

Contributors

Files

llm-finetuning-2

Directory actions

More options

Directory actions

More options

Latest commit

History

llm-finetuning-2

Folders and files

parent directory

README.md

Finetuning Mistral-7B using LoRA and DeepSpeed

Configuration

Testing

Validating the tokenizer

Contributors