-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] APPO (new API stack) enhancements vol 05: Auto-sleep time AND thread-safety for MetricsLogger. #48868
Merged
sven1977
merged 93 commits into
ray-project:master
from
sven1977:appo_enhancements_05_auto_sleep_and_thread_safety_for_metrics
Dec 3, 2024
Merged
Changes from all commits
Commits
Show all changes
93 commits
Select commit
Hold shift + click to select a range
2b09969
wip
sven1977 5089e12
wip
sven1977 490e254
wip
sven1977 0c8fb9e
wip
sven1977 8d46658
Merge branch 'master' of https://github.com/ray-project/ray into fix_…
sven1977 12185ca
wip
sven1977 6e4652b
ppo reporting everything ok now.
sven1977 ef549b8
fix episodes/episodes-lifetime in env runners.
sven1977 1171ccf
wip
sven1977 85c48e8
wip
sven1977 bd5a884
wip
sven1977 937ff49
wip
sven1977 4673c96
Merge branch 'master' of https://github.com/ray-project/ray into fix_…
sven1977 a686fc7
wip
sven1977 a6fcc37
wip
sven1977 b6ef29e
wip
sven1977 666ba01
wip
sven1977 70939e7
wip
sven1977 18fbb91
wip
sven1977 dbf2d07
fix
sven1977 75f761f
fix
sven1977 6c1aa7a
Merge branch 'master' of https://github.com/ray-project/ray into fix_…
sven1977 b164f31
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 b65d49c
fix
sven1977 2342bc9
wip
sven1977 3707a13
Merge branch 'master' of https://github.com/ray-project/ray into add_…
sven1977 4d1a4ec
Merge branch 'add_off_policyness_metric_to_new_api_stack' into impala…
sven1977 ec7159c
wip
sven1977 7327a4e
wip
sven1977 64c09e4
wip
sven1977 d0969d6
wip
sven1977 29fa4ed
Merge branch 'fix_accumulation_of_results_in_algorithm' into impala_a…
sven1977 2754c9e
wip
sven1977 fd45de5
wip
sven1977 4301721
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 4c98e7e
wip
sven1977 fcfff7b
merge
sven1977 d2ee136
merge
sven1977 6e48131
wip
sven1977 cc6b753
wip
sven1977 f9af97a
wip
sven1977 68515d2
fix
sven1977 5cbbb96
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 339ff6e
wip
sven1977 5a2ce74
learns Pong in some time (>>10min). Not great, but does learn.
sven1977 bc374d0
test copying dummy batches to circumvent spending time on learner con…
sven1977 e0844c3
wip
sven1977 9e2a755
learning Pong in 700sec (R>20.0) on 31 EnvRunners and 1 local L4 GPU.
sven1977 f330ff3
wip
sven1977 6a59c81
wip
sven1977 4866467
merge
sven1977 901fbc8
wip
sven1977 1a53331
wip
sven1977 1aacee7
wip
sven1977 308ebc9
wip
sven1977 1540d1f
Learns Pong-v5 in <8min (20.0+) with 31 ER and 1 local GPU.
sven1977 3a12aba
wip
sven1977 df18c93
various fixes and enhancements:
sven1977 329b9de
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 3ddb9bb
deadlock and deepcopy fix
sven1977 abf8f67
fix problem with tensor found in reduced stats -> have to unlock tens…
sven1977 d213174
wip
sven1977 f8d6f7b
wip
sven1977 2b3a468
wip
sven1977 c0ee159
merge
sven1977 9c1b7c1
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 cc56e55
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 d121103
wip
sven1977 bd87e61
merge
sven1977 a4f0dff
some bug fixes related to metrics (was not thread safe) and throughpu…
sven1977 8eeb1fe
Merge branch 'master' of https://github.com/ray-project/ray into impa…
sven1977 39dc133
wip
sven1977 540d488
LEARNING!!!
sven1977 200b200
wip
sven1977 321d8dc
wip
sven1977 93a6f7f
wip
sven1977 3ec5a54
wip
sven1977 ed24c39
Merge branch 'master' of https://github.com/ray-project/ray into appo…
sven1977 4128993
wip
sven1977 0996f0e
wip
sven1977 122bb07
Merge branch 'master' of https://github.com/ray-project/ray into appo…
sven1977 56b4854
wip
sven1977 a87ec84
LINT
sven1977 942a372
test: old APPO loss variable names and no "x lambda".
sven1977 66bd406
wip
sven1977 7ecac19
wip
sven1977 8e12918
LINT; learns Pong in ~12min (maybe less) on 1 local A10 GPU and 31 En…
sven1977 eef1302
wip
sven1977 180305f
Merge branch 'master' of https://github.com/ray-project/ray into appo…
sven1977 00a8fef
wip
sven1977 ecb4b37
wip
sven1977 a9eb5da
LINT
sven1977 d40b0c2
fix
sven1977 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -1813,9 +1813,11 @@ def env_runners( | |
fill up, causing spilling of objects to disk. This can cause any | ||
asynchronous requests to become very slow, making your experiment run | ||
slowly as well. You can inspect the object store during your experiment | ||
via a call to `ray memory` on your head node, and by using the Ray | ||
through a call to `ray memory` on your head node, and by using the Ray | ||
dashboard. If you're seeing that the object store is filling up, | ||
turn down the number of remote requests in flight or enable compression. | ||
turn down the number of remote requests in flight or enable compression | ||
or increase the object store memory through, for example: | ||
`ray.init(object_store_memory=10 * 1024 * 1024 * 1024) # =10 GB` | ||
sample_collector: For the old API stack only. The SampleCollector class to | ||
be used to collect and retrieve environment-, model-, and sampler data. | ||
Override the SampleCollector base class to implement your own | ||
|
@@ -2144,9 +2146,14 @@ def learners( | |
CUDA devices. For example if `os.environ["CUDA_VISIBLE_DEVICES"] = "1"` | ||
and `local_gpu_idx=0`, RLlib uses the GPU with ID=1 on the node. | ||
max_requests_in_flight_per_learner: Max number of in-flight requests | ||
to each Learner (actor)). See the | ||
`ray.rllib.utils.actor_manager.FaultTolerantActorManager` class for more | ||
details. | ||
to each Learner (actor). You normally do not have to tune this setting | ||
(default is 3), however, for asynchronous algorithms, this determines | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Great description! |
||
the "queue" size for incoming batches (or lists of episodes) into each | ||
Learner worker, thus also determining, how much off-policy'ness would be | ||
acceptable. The off-policy'ness is the difference between the numbers of | ||
updates a policy has undergone on the Learner vs the EnvRunners. | ||
See the `ray.rllib.utils.actor_manager.FaultTolerantActorManager` class | ||
for more details. | ||
|
||
Returns: | ||
This updated AlgorithmConfig object. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome!