You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"CPU Checkpointing/Contiguous Checkpointing is only available with partitioned activations."
It seems to me like one could be implemented without the other? Or am I missing something.
This is not an issue with the code, just a question to firm up my understanding.
Thanks in advance!
I am trying to better understand offloading activations to CPU (
PA_TO_CPU
) here https://github.com/microsoft/DeepSpeed/blob/master/deepspeed/runtime/activation_checkpointing/checkpointing.py#L730 and wondering whyIt seems to me like one could be implemented without the other? Or am I missing something.
This is not an issue with the code, just a question to firm up my understanding.
Thanks in advance!
Tagging @jeffra based on git-blame.
The text was updated successfully, but these errors were encountered: