Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[rllib] mountaincarcontinous-ddpg regression #5604

Closed
kifarid opened this issue Aug 31, 2019 · 7 comments · Fixed by #5626
Closed

[rllib] mountaincarcontinous-ddpg regression #5604

kifarid opened this issue Aug 31, 2019 · 7 comments · Fixed by #5626
Assignees

Comments

@kifarid
Copy link

kifarid commented Aug 31, 2019

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Ubuntu 16.04
  • Ray version: 0.7.3
  • Python version: 3.7.4
  • Exact command to reproduce: rllib train -f mountaincarcontinuous-ddpg.yaml
    this is the tuned example here

project/ray/blob/747daff2cb73deae7b8a6755e70e550476c09d71/rllib/tuned_examples/mountaincarcontinuous-ddpg.yaml#L1

Describe the problem

Running tuned example cause an error, i think it would be in the target update frequency or the handling of multistep returns

Error Trace back

Traceback (most recent call last):
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/tune/trial_runner.py", line 498, in _process_trial
result = self.trial_executor.fetch_result(trial)
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/tune/ray_trial_executor.py", line 347, in fetch_result
result = ray.get(trial_future[0])
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/worker.py", line 2332, in get
raise value
ray.exceptions.RayTaskError: �[36mray_DDPG:train()�[39m (pid=5200, host=karimy)
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/agents/trainer.py", line 402, in train
raise e
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/agents/trainer.py", line 388, in train
result = Trainable.train(self)
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/tune/trainable.py", line 171, in train
result = self._train()
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/agents/trainer_template.py", line 126, in _train
fetches = self.optimizer.step()
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/optimizers/sync_replay_optimizer.py", line 123, in step
batch = self.workers.local_worker().sample()
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/evaluation/rollout_worker.py", line 467, in sample
batches = [self.input_reader.next()]
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/evaluation/sampler.py", line 56, in next
batches = [self.get_data()]
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/evaluation/sampler.py", line 99, in get_data
item = next(self.rollout_provider)
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/evaluation/sampler.py", line 340, in _env_runner
base_env.send_actions(actions_to_send)
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/env/base_env.py", line 332, in send_actions
self.vector_env.vector_step(action_vector)
File "/home/karimy/tensorflow/venv/lib/python3.5/site-packages/ray/rllib/env/vector_env.py", line 114, in vector_step
r, type(r)))
ValueError: Reward should be finite scalar, got nan (<class 'float'>)

@ericl
Copy link
Contributor

ericl commented Sep 1, 2019

What TensorFlow version is this? It works on 1.14

@kifarid
Copy link
Author

kifarid commented Sep 2, 2019

it's on 1.14 actually, but this error doesn't happen consistently

@ericl ericl changed the title running tuned examples [rllib] mountaincarcontinous-ddpg regression Sep 2, 2019
@ericl ericl self-assigned this Sep 2, 2019
@ericl
Copy link
Contributor

ericl commented Sep 2, 2019

Ok, this seems to be an issue on master as well.

@ericl
Copy link
Contributor

ericl commented Sep 3, 2019

This seems to be caused by the recent DDPG refactoring: #5242

@ninafiona
Copy link

I am getting the same error with the pendulum-v0 example. Does this bug still persist?

@devinbarry
Copy link

Getting this same issue with IMPALA on MountainCarContinuous-v0

@ericl
Copy link
Contributor

ericl commented Mar 8, 2020

Please file a new bug with reproduction script instead of commenting on old issues. There are many root causes that can lead to the same error message.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants