[RLlib] Add metrics to buffers. #49822

simonsays1980 · 2025-01-14T08:49:38Z

Why are these changes needed?

This PR proposes the following changes:

Add a MetricsLogger to the EpisodeReplayBuffer's.
Add multiple additional metrics to the ray.rllib.utils.metrics.__init__.
Log values for the added metrics in the EpisodeReplayBuffers during add and sample operations.
Collect these metrics in the off-policy algorithms in RLlib, namely, DQN and SAC.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…odeReplayBuffer'. Signed-off-by: simonsays1980 <[email protected]>

…chanism in 'DQN'. Signed-off-by: simonsays1980 <[email protected]>

Signed-off-by: simonsays1980 <[email protected]>

…ermore, added a further key argument for the initialization of the buffer to get the number of iterations for smoothing. Signed-off-by: simonsays1980 <[email protected]>

sven1977 · 2025-01-14T09:00:12Z

rllib/algorithms/dqn/dqn.py

@@ -51,6 +51,7 @@
    NUM_ENV_STEPS_SAMPLED_LIFETIME,
    NUM_TARGET_UPDATES,
    REPLAY_BUFFER_ADD_DATA_TIMER,
+    REPLAY_BUFFER_RESULTS,


sven1977 · 2025-01-14T09:00:34Z

rllib/tuned_examples/dqn/cartpole_dqn.py

@@ -18,11 +18,15 @@
        lr=0.0005 * (args.num_learners or 1) ** 0.5,
        train_batch_size_per_learner=32,
        replay_buffer_config={
-            "type": "PrioritizedEpisodeReplayBuffer",
+            "type": "EpisodeReplayBuffer",


is this just for testing?

Yeah, I wanted to check with you, if we proceed like this and then all buffers get the metrics. Then I can test with any of them.

rllib/utils/metrics/__init__.py

sven1977 · 2025-01-14T09:05:13Z

rllib/algorithms/dqn/dqn.py

@@ -660,6 +661,11 @@ def _training_step_new_api_stack(self):
                        sample_episodes=True,
                    )

+                    replay_buffer_results = self.local_replay_buffer.get_metrics()


nice. Unified API names get_metrics. Analogous to EnvRunners.

sven1977 · 2025-01-14T09:07:29Z

rllib/algorithms/dqn/dqn.py

@@ -660,6 +661,11 @@ def _training_step_new_api_stack(self):
                        sample_episodes=True,
                    )

+                    replay_buffer_results = self.local_replay_buffer.get_metrics()
+                    self.metrics.merge_and_log_n_dicts(


Yeah, I wonder why log_dict doesn't work here. It should be the better choice here b/c we don't have more than one buffer.

self.metrics.log_dict( replay_buffer_results, key=REPLAY_BUFFER_RESULTS, )

Maybe b/c in replay_buffer_results there are already Stats objects with their individual settings? ...

I need to check it.

So, basically, the lifetime metrics are somehow wrongly accumulated and grow exponentially. They probably need to be reduced before given to the log_dict method.

sven1977 · 2025-01-14T09:07:42Z

rllib/utils/replay_buffers/episode_replay_buffer.py

@@ -646,6 +896,10 @@ def get_added_timesteps(self) -> int:
        """Returns number of timesteps that have been added in buffer's lifetime."""
        return self._num_timesteps_added

+    def get_metrics(self) -> ResultDict:


nit: Add docstring.

sven1977

LGTM! Thanks for this really cool PR. A handful of nits and one important question on the usage of log_dict vs merge_and_log_n_dicts. Let's take a look at log_dicts and figure out why it doesn't work here (according to our offline discussion). log_dicts should be the better choice here, b/c we are NOT merging > 1 dicts from parallel subcomponents.

…ics to the 'PrioritizedEpisodeReplayBuffer'. Added also docstrings. Signed-off-by: simonsays1980 <[email protected]>

Signed-off-by: simonsays1980 <[email protected]>

Signed-off-by: Anson Qian <[email protected]>

Signed-off-by: Puyuan Yao <[email protected]>

simonsays1980 added 4 commits January 8, 2025 12:01

Defined metrics for replay buffers and added metrics logging to 'Epis…

f4e56ee

…odeReplayBuffer'. Signed-off-by: simonsays1980 <[email protected]>

WIP. Added counting metrics to 'EpisodeReplayBuffer' and a logging me…

22d406f

…chanism in 'DQN'. Signed-off-by: simonsays1980 <[email protected]>

LINTER

5eea74e

Signed-off-by: simonsays1980 <[email protected]>

Added metrics for adding and sampling in 'EpisodeReplayBuffer'. Furth…

78bcb4f

…ermore, added a further key argument for the initialization of the buffer to get the number of iterations for smoothing. Signed-off-by: simonsays1980 <[email protected]>

simonsays1980 marked this pull request as ready for review January 14, 2025 08:49

simonsays1980 requested a review from sven1977 as a code owner January 14, 2025 08:49

sven1977 reviewed Jan 14, 2025

View reviewed changes

rllib/utils/metrics/__init__.py Outdated Show resolved Hide resolved

sven1977 reviewed Jan 14, 2025

View reviewed changes

rllib/utils/metrics/__init__.py Outdated Show resolved Hide resolved

sven1977 reviewed Jan 14, 2025

View reviewed changes

sven1977 approved these changes Jan 14, 2025

View reviewed changes

simonsays1980 added 3 commits January 14, 2025 18:37

Added further metrics to the 'EpisodeReplayBuffer' and added all metr…

d440c93

…ics to the 'PrioritizedEpisodeReplayBuffer'. Added also docstrings. Signed-off-by: simonsays1980 <[email protected]>

Merged master and resolved conflicts.

12a251b

Signed-off-by: simonsays1980 <[email protected]>

Merge branch 'master' into offpolicy-add-metrics-to-buffers

9f5ab29

sven1977 changed the title ~~[RLlib] - Add metrics to buffers~~ [RLlib] Add metrics to buffers. Jan 15, 2025

sven1977 enabled auto-merge (squash) January 15, 2025 11:48

github-actions bot added the go add ONLY when ready to merge, run all tests label Jan 15, 2025

Fixed a small bugs in 'EpisodeReplayBuffer._sample_batch'.

a4df011

Signed-off-by: simonsays1980 <[email protected]>

github-actions bot disabled auto-merge January 16, 2025 15:47

sven1977 merged commit 3d0f4ac into ray-project:master Jan 20, 2025
5 checks passed

anson627 pushed a commit to anson627/ray that referenced this pull request Jan 31, 2025

[RLlib] Add metrics to buffers. (ray-project#49822)

5b9f7f7

Signed-off-by: Anson Qian <[email protected]>

srinathk10 pushed a commit that referenced this pull request Feb 2, 2025

[RLlib] Add metrics to buffers. (#49822)

feeb440

anyadontfly pushed a commit to anyadontfly/ray that referenced this pull request Feb 13, 2025

[RLlib] Add metrics to buffers. (ray-project#49822)

60890b6

Signed-off-by: Puyuan Yao <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Add metrics to buffers. #49822

[RLlib] Add metrics to buffers. #49822

simonsays1980 commented Jan 14, 2025 •

edited by sven1977

Loading

sven1977 Jan 14, 2025

sven1977 Jan 14, 2025

simonsays1980 Jan 14, 2025

sven1977 Jan 14, 2025

sven1977 Jan 14, 2025

simonsays1980 Jan 14, 2025

simonsays1980 Jan 14, 2025

sven1977 Jan 14, 2025

sven1977 left a comment

[RLlib] Add metrics to buffers. #49822

[RLlib] Add metrics to buffers. #49822

Conversation

simonsays1980 commented Jan 14, 2025 • edited by sven1977 Loading

Why are these changes needed?

Related issue number

Checks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

simonsays1980 commented Jan 14, 2025 •

edited by sven1977

Loading