Merge pull request openai#63 from marcinic/patch-1

Changed use to using
MKorablyov · Nov 29, 2018 · 8b92b8a · 8b92b8a
2 parents f67828a + 4cb53a2
commit 8b92b8a
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/docs/algorithms/vpg.rst b/docs/algorithms/vpg.rst
@@ -40,7 +40,7 @@ The policy gradient algorithm works by updating policy parameters via stochastic
 
     \theta_{k+1} = \theta_k + \alpha \nabla_{\theta} J(\pi_{\theta_k})
 
-Policy gradient implementations typically compute advantage function estimates based on the infinite-horizon discounted return, despite otherwise use the finite-horizon undiscounted policy gradient formula. 
+Policy gradient implementations typically compute advantage function estimates based on the infinite-horizon discounted return, despite otherwise using the finite-horizon undiscounted policy gradient formula. 
 
 Exploration vs. Exploitation
 ----------------------------