Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add DS-Chat CI workflow #4127

Merged
merged 15 commits into from
Aug 17, 2023
Merged

Add DS-Chat CI workflow #4127

merged 15 commits into from
Aug 17, 2023

Conversation

lekurile
Copy link
Contributor

@lekurile lekurile commented Aug 10, 2023

This PR adds a DS-Chat CI workflow for testing Step 3 in the DS-Chat pipeline.

This test sweeps across the following parameters:

Zero Stage:    2, 3
Hybrid Engine: True, False
Offload:       True, False
LoRA:          True, False

The models used for this test are:

Actor:  facebook/opt-125m
Critic: facebook/opt-125m (trained in DS-Chat Step 2)

Each configuration (16 total) runs through a limited number of Step 3 non-overflow training steps (i.e. steps where neither actor nor critic overflow) and saves the actor/critic models. Assertions are used to check if the training pipeline executed correctly and if the actor and critic models were saved properly.

The test is scheduled to run nightly at midnight.

@lekurile lekurile changed the title Add DS Chat CI workflow Add DS-Chat CI workflow Aug 16, 2023
@lekurile lekurile marked this pull request as ready for review August 16, 2023 23:31
@lekurile lekurile enabled auto-merge August 17, 2023 01:03
@lekurile lekurile added this pull request to the merge queue Aug 17, 2023
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Aug 17, 2023
@loadams loadams added this pull request to the merge queue Aug 17, 2023
Merged via the queue into master with commit 64c670e Aug 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants