Added Deep Reinforcement Learning example #2090
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Add a Deep Reinforcement Learning example to demonstrate the JAX versability across multiple domains.
The example implements a simplified version of the algorithm described in Playing Atari with Deep Reinforcement Learning.
For simplicity, the agent is trained in the CartPole gym environment.
To demonstrate that the example works properly I attach a plot reporting the timesteps that agent holds the pole along with the different episodes.
We can see as the number of training episodes increases the agent is capable to hold the pole for longer times.
For the example, I tried to apply the best JAX good practices. as I could