Skip to content

Commit

Permalink
Merge pull request openai#54 from marko-bast/master
Browse files Browse the repository at this point in the history
Add link for MLP. Had to Google it
  • Loading branch information
jachiam authored Nov 24, 2018
2 parents 52e7ec3 + 858c57a commit 5eade94
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion docs/user/algorithms.rst
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ The following algorithms are implemented in the Spinning Up package:
- `Twin Delayed DDPG`_ (TD3)
- `Soft Actor-Critic`_ (SAC)

They are all implemented with MLP (non-recurrent) actor-critics, making them suitable for fully-observed, non-image-based RL environments, eg the `Gym Mujoco`_ environments.
They are all implemented with `MLP`_ (non-recurrent) actor-critics, making them suitable for fully-observed, non-image-based RL environments, eg the `Gym Mujoco`_ environments.

.. _`Gym Mujoco`: https://gym.openai.com/envs/#mujoco
.. _`Vanilla Policy Gradient`: ../algorithms/vpg.html
Expand All @@ -25,6 +25,7 @@ They are all implemented with MLP (non-recurrent) actor-critics, making them sui
.. _`Deep Deterministic Policy Gradient`: ../algorithms/ddpg.html
.. _`Twin Delayed DDPG`: ../algorithms/td3.html
.. _`Soft Actor-Critic`: ../algorithms/sac.html
.. _`MLP`: https://en.wikipedia.org/wiki/Multilayer_perceptron


Why These Algorithms?
Expand Down

0 comments on commit 5eade94

Please sign in to comment.