Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
bachvudinh authored Nov 25, 2024
1 parent b0bf9ce commit c94d66d
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -144,9 +144,7 @@ accelerate launch --config_file ./accelerate_config.yaml train.py
```bash
tune download homebrewltd/llama3.1-s-whispervq-init --output-dir ../model_zoo/llama3.1-s-whispervq-init --ignore-patterns "original/consolidated*"
```

2. **Pretraining Multi GPU (1-8GPUs Supported)**
[NOTE] : Before start training, make sure you uploaded the resized embedding model to Hugging Face Hub:
[NOTE] : In case you want to use different base model, you can uploaded your own resized embedding model to Hugging Face Hub:
```python
# folder containing the checkpoint files
model_name = "meta-llama/Llama-3.2-3B-Instruct"
Expand All @@ -160,6 +158,8 @@ accelerate launch --config_file ./accelerate_config.yaml train.py
model.push_to_hub("<your_hf>/Llama3.1-s-whispervq-init")
tokenizer.push_to_hub("<your_hf>/Llama3.1-s-whispervq-init")
```
2. **Pretraining Multi GPU (1-8GPUs Supported)**
```
tune run --nproc_per_node <no-gpu> full_finetune_fsdp2 --config recipes/configs/jan-llama3-1-s/pretrain/8B_full.yaml
```
Expand Down

0 comments on commit c94d66d

Please sign in to comment.