Skip to content

Commit

Permalink
instruction finetune README improvement (opea-project#897)
Browse files Browse the repository at this point in the history
Signed-off-by: rbrugaro <[email protected]>
  • Loading branch information
rbrugaro authored and ctao456 committed Oct 9, 2024
1 parent 400dffb commit 935f589
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion InstructionTuning/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Instruction Tuning

Instruction tuning is the process of further training LLMs on a dataset consisting of (instruction, output) pairs in a supervised fashion, which bridges the gap between the next-word prediction objective of LLMs and the users' objective of having LLMs adhere to human instructions.
Instruction tuning is the process of further training LLMs on a dataset consisting of (instruction, output) pairs in a supervised fashion, which bridges the gap between the next-word prediction objective of LLMs and the users' objective of having LLMs adhere to human instructions. This implementation deploys a Ray cluster for the task.

## Deploy Instruction Tuning Service

Expand Down Expand Up @@ -38,6 +38,8 @@ curl http://${your_ip}:8015/v1/fine_tuning/jobs \
}'
```

The outputs of the finetune job (adapter_model.safetensors, adapter_config,json... ) are stored in `/home/user/comps/finetuning/output` and other execution logs are stored in `/home/user/ray_results`

### 3. Manage fine-tuning job

Below commands show how to list finetuning jobs, retrieve a finetuning job, cancel a finetuning job and list checkpoints of a finetuning job.
Expand Down

0 comments on commit 935f589

Please sign in to comment.