Skip to content

Commit

Permalink
Merge pull request openai#90 from rootulp/patch-4
Browse files Browse the repository at this point in the history
SOTA -> state of the art
  • Loading branch information
bchess authored Jun 28, 2019
2 parents 61dc7c3 + 96df49f commit 79a6d5d
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/user/algorithms.rst
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ They are all implemented with `MLP`_ (non-recurrent) actor-critics, making them
Why These Algorithms?
=====================

We chose the core deep RL algorithms in this package to reflect useful progressions of ideas from the recent history of the field, culminating in two algorithms in particular---PPO and SAC---which are close to SOTA on reliability and sample efficiency among policy-learning algorithms. They also expose some of the trade-offs that get made in designing and using algorithms in deep RL.
We chose the core deep RL algorithms in this package to reflect useful progressions of ideas from the recent history of the field, culminating in two algorithms in particular---PPO and SAC---which are close to state of the art on reliability and sample efficiency among policy-learning algorithms. They also expose some of the trade-offs that get made in designing and using algorithms in deep RL.

The On-Policy Algorithms
------------------------
Expand Down

0 comments on commit 79a6d5d

Please sign in to comment.