Skip to content

Commit

Permalink
add links from fsdp and deepspeed docs. fix deepspeed imports
Browse files Browse the repository at this point in the history
  • Loading branch information
fabianlim committed May 1, 2024
1 parent 7183706 commit 3b1948f
Show file tree
Hide file tree
Showing 3 changed files with 19 additions and 5 deletions.
10 changes: 5 additions & 5 deletions docs/source/package_reference/deepspeed.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,12 +17,12 @@ rendered properly in your Markdown viewer.

[[autodoc]] utils.DeepSpeedPlugin

[[autodoc]] utils.DummyOptim
[[autodoc]] utils.deepspeed.DummyOptim

[[autodoc]] utils.DummyScheduler
[[autodoc]] utils.deepspeed.DummyScheduler

[[autodoc]] utils.DeepSpeedEngineWrapper
[[autodoc]] utils.deepspeed.DeepSpeedEngineWrapper

[[autodoc]] utils.DeepSpeedOptimizerWrapper
[[autodoc]] utils.deepspeed.DeepSpeedOptimizerWrapper

[[autodoc]] utils.DeepSpeedSchedulerWrapper
[[autodoc]] utils.deepspeed.DeepSpeedSchedulerWrapper
7 changes: 7 additions & 0 deletions docs/source/usage_guides/deepspeed.md
Original file line number Diff line number Diff line change
Expand Up @@ -729,3 +729,10 @@ Papers:

Finally, please, remember that 🤗 `Accelerate` only integrates DeepSpeed, therefore if you
have any problems or questions with regards to DeepSpeed usage, please, file an issue with [DeepSpeed GitHub](https://github.com/microsoft/DeepSpeed/issues).


<Tip>

For those interested in the similarities and differences between FSDP and DeepSpeed, please check out the [concept guide here](../concept_guides/fsdp_and_deepspeed.md)!

</Tip>
7 changes: 7 additions & 0 deletions docs/source/usage_guides/fsdp.md
Original file line number Diff line number Diff line change
Expand Up @@ -175,3 +175,10 @@ You can then pass `state` into the `save_pretrained` method. There are several

For more control, users can leverage the `FullyShardedDataParallelPlugin`. After creating an instance of this class, users can pass it to the Accelerator class instantiation.
For more information on these options, please refer to the PyTorch [FullyShardedDataParallel](https://github.com/pytorch/pytorch/blob/0df2e863fbd5993a7b9e652910792bd21a516ff3/torch/distributed/fsdp/fully_sharded_data_parallel.py#L236) code.


<Tip>

For those interested in the similarities and differences between FSDP and DeepSpeed, please check out the [concept guide here](../concept_guides/fsdp_and_deepspeed.md)!

</Tip>

0 comments on commit 3b1948f

Please sign in to comment.