-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow DeepSpeed init to accept dictionary instead of args #81
Comments
+1 |
In the process of implementing this enhancement we will consider removing the --deepspeed flag since this is not used internally in DeepSpeed code. It should be up to the user if they want to implement a DeepSpeed and non-DeepSpeed version of their code controlled by a flag. |
* Sparse attention Co-authored-by: Arash Ashari <[email protected]>
* fix transformer kernel call parameters * remove post-ln modeling
We should remove the need for args to be passed to deepspeed init, ideally we should accept a dictionary which represents everything so we can write smaller tests.
The text was updated successfully, but these errors were encountered: