-
Notifications
You must be signed in to change notification settings - Fork 9.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FSDP T5 Example not working #1210
Comments
and i have this problem on save and load sharded too... |
I am also facing the same issue...Any solution to this? |
Facing the same issue. |
facing the same issue |
facing the same issue, Any solution to this? thanks. |
I fixed and merged this on main by disabling activation checkpointing #1273 By changing the below line in - fsdp_activation_checkpointing: bool=True
+ fsdp_activation_checkpointing: bool=False Will look for a proper fix next |
Context
Your Environment
Expected Behavior
training well
Current Behavior
error raised and training stop
Possible Solution
Steps to Reproduce
TypeError: T5Block.forward() got an unexpected keyword argument 'offload_to_cpu'
...
Failure Logs [if any]
The text was updated successfully, but these errors were encountered: