diff --git a/docs/_tutorials/pipeline.md b/docs/_tutorials/pipeline.md index afe68f0b1112..35865d24fcca 100644 --- a/docs/_tutorials/pipeline.md +++ b/docs/_tutorials/pipeline.md @@ -82,7 +82,7 @@ are present, DeepSpeed will also use hybrid data parallelism. stages. {: .notice--info} -**Note:** For large model training, see [memory-efficient model construction](#memory-efficient-module-initialization). +**Note:** For large model training, see [memory-efficient model construction](#memory-efficient-model-initialization). {: .notice--info} ### AlexNet