-
Notifications
You must be signed in to change notification settings - Fork 260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix GAIL with SAC learner on GPU #660
Conversation
Codecov Report
@@ Coverage Diff @@
## master #660 +/- ##
=======================================
Coverage 97.58% 97.58%
=======================================
Files 87 87
Lines 8566 8568 +2
=======================================
+ Hits 8359 8361 +2
Misses 207 207
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
This now fixes the issue and makes the test pass again (also on a GPU device). I tried to use the Another option would be to switch to a cuda-enabled runner but then the supported images seem to be quite restricted and it would probably be costly/wasteful to run all of our tests in such a runner. I would propose to introduce a GPU flag for our tests and then run only those on a dedicated GPU runner. What do you think @AdamGleave ? |
d51d77e
to
67e25ad
Compare
Sorry for only just getting around to reviewing this. Overall the changes looks good, I had a few minor comments on the tests.
At a high-level makes sense. We don't actually have access to the CircleCI GPU executors (they require a minimum spend of $6k/year to enable it). We have self-hosted GPU test runners in the past (and that lets us use any Docker image), but it is a bit of a pain to maintain. So I might lean towards adding GPU tests with the flag but putting enabling it on CI on the back-burner, and we can always try to manually run the GPU tests before a release to sanity check? |
d0ecdab
to
ff1037a
Compare
Thanks for the review! I implemented all of your suggestions. Now the test only runs when a GPU is available (otherwise it is pointless) and I opened #671 so we keep in mind to automate GPU tests someday. |
Note: the reduction in coverage is intentional. We lost some lines of coverage since I disabled the regression test for CPU-only instances. |
55b93af
to
2fd96d5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@ernestum can you add |
Description
Right now GAIL with a SAC learner is broken on the GPU. Read details in #655
Testing
This PR includes a test to reproduce the issue.
Note: the test in the pipeline does not fail since it is executed without GPU.