Skip to content

Commit

Permalink
Merge pull request openai#90 from rootulp/patch-4
Browse files Browse the repository at this point in the history
SOTA -> state of the art
  • Loading branch information
bchess authored Jun 28, 2019
2 parents 2523ee9 + 1595953 commit f8cff4f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/user/algorithms.rst
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ They are all implemented with `MLP`_ (non-recurrent) actor-critics, making them
Why These Algorithms?
=====================

We chose the core deep RL algorithms in this package to reflect useful progressions of ideas from the recent history of the field, culminating in two algorithms in particular---PPO and SAC---which are close to SOTA on reliability and sample efficiency among policy-learning algorithms. They also expose some of the trade-offs that get made in designing and using algorithms in deep RL.
We chose the core deep RL algorithms in this package to reflect useful progressions of ideas from the recent history of the field, culminating in two algorithms in particular---PPO and SAC---which are close to state of the art on reliability and sample efficiency among policy-learning algorithms. They also expose some of the trade-offs that get made in designing and using algorithms in deep RL.

The On-Policy Algorithms
------------------------
Expand Down

0 comments on commit f8cff4f

Please sign in to comment.