[RLlib; Offline RL] Offline RL example that shows how to customize an offline data pipeline. #49046

simonsays1980 · 2024-12-03T19:14:16Z

Why are these changes needed?

RLlib's new Offline RL was built to be more powerful as its old stack one and competitive to the SOTA standards out in the market. Part of it is its highly customizable pipeline that can be used to literally read any data from
local or remote storage and transform data from raw format to SingleAgentEpisode instances to feed the learner connector pipeline and further to MultiAgentBatch format ready-to-train on the Learner.

This example is the first of multiple ones to show users the cacabilities of RLlib's new Offline RL API. It shows

how to read in raw image data from a cloud bucket
how to transform this data via byte streams to numpy arrays
how to further transform these arrays into SingleAgentEpisode format to be readable by ConnectorV2 pieces
how to convert further into MultiAgentBatch format to pass to a Learner's update method.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…eLearner._map_to_episodes' methods. Signed-off-by: simonsays1980 <[email protected]>

… for it. In addition added keyword arguments to the 'OfflinePreLearner' mmethods such that overriding offers more customization. Signed-off-by: simonsays1980 <[email protected]>

…data'. Signed-off-by: simonsays1980 <[email protected]>

… how to customize an offline data pipeline. Signed-off-by: simonsays1980 <[email protected]>

rllib/offline/offline_data.py

rllib/examples/offline_rl/offline_rl_with_image_data.py

sven1977

Looks great! Thanks for the quick turnaround @simonsays1980 !

Just the one question on the commented-out line. ??

Signed-off-by: simonsays1980 <[email protected]>

…ample

… offline data pipeline. (ray-project#49046) Signed-off-by: ujjawal-khare <[email protected]>

simonsays1980 added 5 commits December 2, 2024 13:10

Added keyword arguments to 'OfflinePreLearner.__init_' and 'OfflinePr…

a9bbc60

…eLearner._map_to_episodes' methods. Signed-off-by: simonsays1980 <[email protected]>

Added customization option for 'OfflineData' classes and wrote a test…

d0040f0

… for it. In addition added keyword arguments to the 'OfflinePreLearner' mmethods such that overriding offers more customization. Signed-off-by: simonsays1980 <[email protected]>

Added docstring for 'offline_data_class' in ÄALgorithmConfig.offline_…

6a13961

…data'. Signed-off-by: simonsays1980 <[email protected]>

Merge branch 'master' into offline-rl-make-offline-data-overrideable

e8a978b

Carved out example from Offline RL docs. Simple example that explains…

2805c04

… how to customize an offline data pipeline. Signed-off-by: simonsays1980 <[email protected]>

simonsays1980 marked this pull request as ready for review December 3, 2024 19:14

simonsays1980 requested a review from sven1977 as a code owner December 3, 2024 19:14

sven1977 changed the title ~~[RLlib; Offline RL] - Offline RL example that shows how to customize an offline data pipeline.~~ [RLlib; Offline RL] Offline RL example that shows how to customize an offline data pipeline. Dec 3, 2024

sven1977 reviewed Dec 3, 2024

View reviewed changes

rllib/offline/offline_data.py Outdated Show resolved Hide resolved

sven1977 reviewed Dec 3, 2024

View reviewed changes

rllib/examples/offline_rl/offline_rl_with_image_data.py Show resolved Hide resolved

sven1977 approved these changes Dec 3, 2024

View reviewed changes

sven1977 enabled auto-merge (squash) December 3, 2024 20:57

github-actions bot added the go add ONLY when ready to merge, run all tests label Dec 3, 2024

sven1977 self-assigned this Dec 3, 2024

sven1977 added rllib RLlib related issues rllib-offline-rl Offline RL problems rllib-docs-or-examples Issues related to RLlib documentation or rllib/examples rllib-newstack labels Dec 3, 2024

sven1977 disabled auto-merge December 3, 2024 20:58

simonsays1980 added 2 commits December 4, 2024 13:04

Added @sven1977's review.

9e7c6e0

Signed-off-by: simonsays1980 <[email protected]>

Merge branch 'master' into offline-rl-overriding-offlinedata-class-ex…

3070977

…ample

sven1977 enabled auto-merge (squash) December 4, 2024 12:23

sven1977 merged commit 10daade into ray-project:master Dec 4, 2024
6 checks passed

ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Dec 17, 2024

[RLlib; Offline RL] Offline RL example that shows how to customize an…

02c2a28

… offline data pipeline. (ray-project#49046) Signed-off-by: ujjawal-khare <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib; Offline RL] Offline RL example that shows how to customize an offline data pipeline. #49046

[RLlib; Offline RL] Offline RL example that shows how to customize an offline data pipeline. #49046

simonsays1980 commented Dec 3, 2024 •

edited

Loading

sven1977 left a comment

[RLlib; Offline RL] Offline RL example that shows how to customize an offline data pipeline. #49046

[RLlib; Offline RL] Offline RL example that shows how to customize an offline data pipeline. #49046

Conversation

simonsays1980 commented Dec 3, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 left a comment

Choose a reason for hiding this comment

simonsays1980 commented Dec 3, 2024 •

edited

Loading