Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add model ladder building blocks #114

Merged
merged 18 commits into from
Nov 27, 2024
Merged

Add model ladder building blocks #114

merged 18 commits into from
Nov 27, 2024

Conversation

epwalsh
Copy link
Member

@epwalsh epwalsh commented Nov 22, 2024

Porting over from https://github.com/allenai/OLMo/blob/ladder-1xC/scripts/ladder_peteish.py.

Example

You can run a model size (e.g. the 190M) for the peteish ladder on Beaker as follows:

python src/scripts/train/OLMo2-ladder.py launch 190M ai2/jupiter-cirrascale-2

Notes for reviewers

The key file to review is src/olmo_core/model_ladder.py.

@epwalsh epwalsh marked this pull request as ready for review November 26, 2024 00:17
@epwalsh epwalsh requested review from dirkgr and AkshitaB November 26, 2024 00:24
@epwalsh
Copy link
Member Author

epwalsh commented Nov 27, 2024

Merging now because I need some of these changes for other things, but plz leave comments anyway.

@epwalsh epwalsh merged commit 8e716b5 into main Nov 27, 2024
14 checks passed
@epwalsh epwalsh deleted the epwalsh/ladder branch November 27, 2024 19:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant