Skip to content

Commit

Permalink
update readme for sdpa
Browse files Browse the repository at this point in the history
  • Loading branch information
shang-mt committed Dec 19, 2023
1 parent 5fae218 commit a1c98e3
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion training/mthreads/llama2_7b-deepspeed/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@

- ##### 优化策略

-
- scaled dot product attention

### 运行情况

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -3,5 +3,5 @@
datafilename = "openwebtext_llama2_100M.npy"
epochs = 1
theoryflops = 98000000000000.0
flashattn = True
flashattn = True # sdpa
gradient_checkpointing_enable = True

0 comments on commit a1c98e3

Please sign in to comment.