Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
alexis-jacq committed Feb 27, 2018
2 parents 3020ae4 + 66327be commit ff74235
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ Using PPO with clip loss (from https://arxiv.org/pdf/1707.06347.pdf).

I finally fixed what was wrong with the gradient descent step, using previous log-prob from rollout batches. At least ppo.py is fixed, the rest is going to be corrected as well very soon.

On the following example I was not patient enough to wait for million iterations, I just wanted to check if the model is properly learning:
In the following example I was not patient enough to wait for million iterations, I just wanted to check if the model is properly learning:

Progress of single PPO:
-----------------------
Expand Down

0 comments on commit ff74235

Please sign in to comment.