-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add tool to easily assess model compatibility #178
Comments
The goal here is to predict whether or not .train() is going to complete successfully, given
I see two ways to accomplish this, so if I am off base feel free to advise:
I don't see a way to predict whether I do have several bash shell scripts that does handle the overhead of setting up a training session (fetch the model, public or 'bring your own') sets up the output, etc. I do not know how much help that would be, but I could look into parameterize it further. |
@olson-ibm lets start with 2, i.e
With train, we can set 1 epoch, so that it doesn't keep going and not save the model at the end. Later on I image we can also add a |
Can't wait to see what your thinking is here :) PR on the rest of the above will be out shortly... |
Signed-off-by: Joe Olson <[email protected]>
Signed-off-by: Joe Olson <[email protected]>
Signed-off-by: Joe Olson <[email protected]>
Signed-off-by: Joe Olson <[email protected]>
Signed-off-by: Joe Olson <[email protected]>
Signed-off-by: Joe Olson <[email protected]>
Gaurav says to leverage the 'estimate' module for compatibility testing without doing actual training. |
Moving this back to ToDo for now since Joe is working on a more pressing task in the internal repo |
Description
As we are exploring support for more models, more tuning techniques, larger size models and multi-gpu vs single gpu with various context sizes, we often need to perform test to figure out if a model is compatible or not and under which configuration (single-gpu vs multi-gpu).
This story is to create a script that automates some of above exploration a bit and provides output that is easier to document (or automatically create markdown file).
Discussion
Provide detailed discussion here
Acceptance Criteria
The text was updated successfully, but these errors were encountered: