Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added Deep Reinforcement Learning example #2090

Closed
wants to merge 1 commit into from
Closed

Added Deep Reinforcement Learning example #2090

wants to merge 1 commit into from

Conversation

Guillem96
Copy link
Contributor

Add a Deep Reinforcement Learning example to demonstrate the JAX versability across multiple domains.
The example implements a simplified version of the algorithm described in Playing Atari with Deep Reinforcement Learning.
For simplicity, the agent is trained in the CartPole gym environment.
To demonstrate that the example works properly I attach a plot reporting the timesteps that agent holds the pole along with the different episodes.

Report

We can see as the number of training episodes increases the agent is capable to hold the pole for longer times.

For the example, I tried to apply the best JAX good practices. as I could

@googlebot
Copy link
Collaborator

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.


What to do if you already signed the CLA

Individual signers
Corporate signers

ℹ️ Googlers: Go here for more info.

@Guillem96
Copy link
Contributor Author

@googlebot I signed it!

@googlebot
Copy link
Collaborator

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants