Memory scheduling 2 DFS with Priority #3

jaewan · 2021-11-11T02:17:54Z

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://ray.readthedocs.io/en/latest/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failure rates at https://ray-travis-tracker.herokuapp.com/.

…' into memory-scheduling-2

stephanie-wang

Can you also remove the extra files (package-lock.json, etc) and split out the benchmark changes to a separate PR? I only reviewed the C++ changes here.

stephanie-wang · 2021-11-11T02:58:20Z

src/ray/core_worker/reference_count.cc

+void ReferenceCounter::UpdateObjectPriority(
+		const ObjectID &object_id,
+		const Priority &priority){
+  object_id_priority_.emplace(object_id, priority);


.emplace won't update the object ID if there is already a priority set. You can use object_id_priority_[object_id] = priority instead.

stephanie-wang · 2021-11-11T02:58:32Z

src/ray/core_worker/reference_count.cc

+      // This happens if a large argument is transparently passed by reference
+      // because we don't hold a Python reference to its ObjectID.
+	  // When an object is made with Put() Priority is not set. Should to this Jae
+      it = object_id_priority_.emplace(object_id, Priority()).first;


Probably fine to just return Priority() instead of modifying the map.

src/ray/core_worker/reference_count.cc

stephanie-wang · 2021-11-11T02:59:30Z

src/ray/core_worker/task_manager.cc

+  Priority &max_priority = dummy_pri;
+  for (const ObjectID &argument_id : task_deps) {
+    Priority &p = reference_counter_->GetObjectPriority(argument_id);
+    if(max_priority > p){


Don't we want min_priority, not max?

It's little messed up.
Btree_map picks the lowest value and from priority queue perspective it is the highest priority.
Not sure how to name it to make it clear max or min

stephanie-wang · 2021-11-11T03:01:02Z

src/ray/core_worker/task_manager.cc

  for (size_t i = 0; i < num_returns; i++) {
    if (!spec.IsActorCreationTask()) {
      // We pass an empty vector for inner IDs because we do not know the return
      // value of the task yet. If the task returns an ID(s), the worker will
      // publish the WaitForRefRemoved message that we are now a borrower for
      // the inner IDs. Note that this message can be received *before* the
      // PushTaskReply.
+      // TODO: Set the priority of this task's return Refs.


Can you remove the TODOs that are done now?

src/ray/raylet/scheduling/cluster_task_manager.cc

To make it in different branch

…ng/ray into memory-scheduling-2

This reverts commit ba57bb2.

… dispatched if priority is too low

…cheduling-2

…style) #3 (ray-project#21652)

…ray-project#23821) This PR refactors `LazyBlockList` in service of out-of-band serialization (see [mono-PR](ray-project#22616)) and is a precursor to an execution plan refactor (PR #2) and adding the actual out-of-band serialization APIs (PR #3). The following is included in this refactor: 1. `ReadTask`s are now a first-class concept, replacing calls; 2. read stage progress tracking is consolidated into `LazyBlockList._get_blocks_with_metadta()` and more of the read task complexity, e.g. the read remote function, was pushed into `LazyBlockList` to make `ray.data.read_datasource()` simpler; 3. we are a bit smarter with how we progressively launch tasks and fetch and cache metadata, including fetching the metadata for read tasks in `.iter_blocks_with_metadata()` instead of relying on the pre-read task metadata (which will be less accurate), and we also fix some small bugs in the lazy ramp-up around progressive metadata fetching. (1) is the most important item for supporting out-of-band serialization and fundamentally changes the `LazyBlockList` data model. This is required since we need to be able to reference the underlying read tasks when rewriting read stages during optimization and when serializing the lineage of the Dataset. See the [mono-PR](ray-project#22616) for more context. Other changes: 1. Changed stats actor to a global named actor singleton in order to obviate the need for serializing the actor handle with the Dataset stats; without this, we were encountering serialization failures.

We encountered SIGSEGV when running Python test `python/ray/tests/test_failure_2.py::test_list_named_actors_timeout`. The stack is: ``` #0 0x00007fffed30f393 in std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string(std::string const&) () from /lib64/libstdc++.so.6 #1 0x00007fffee707649 in ray::RayLog::GetLoggerName() () from /home/admin/dev/Arc/merge/ray/python/ray/_raylet.so #2 0x00007fffee70aa90 in ray::SpdLogMessage::Flush() () from /home/admin/dev/Arc/merge/ray/python/ray/_raylet.so #3 0x00007fffee70af28 in ray::RayLog::~RayLog() () from /home/admin/dev/Arc/merge/ray/python/ray/_raylet.so #4 0x00007fffee2b570d in ray::asio::testing::(anonymous namespace)::DelayManager::Init() [clone .constprop.0] () from /home/admin/dev/Arc/merge/ray/python/ray/_raylet.so #5 0x00007fffedd0d95a in _GLOBAL__sub_I_asio_chaos.cc () from /home/admin/dev/Arc/merge/ray/python/ray/_raylet.so #6 0x00007ffff7fe282a in call_init.part () from /lib64/ld-linux-x86-64.so.2 #7 0x00007ffff7fe2931 in _dl_init () from /lib64/ld-linux-x86-64.so.2 #8 0x00007ffff7fe674c in dl_open_worker () from /lib64/ld-linux-x86-64.so.2 #9 0x00007ffff7b82e79 in _dl_catch_exception () from /lib64/libc.so.6 #10 0x00007ffff7fe5ffe in _dl_open () from /lib64/ld-linux-x86-64.so.2 #11 0x00007ffff7d5f39c in dlopen_doit () from /lib64/libdl.so.2 #12 0x00007ffff7b82e79 in _dl_catch_exception () from /lib64/libc.so.6 #13 0x00007ffff7b82f13 in _dl_catch_error () from /lib64/libc.so.6 #14 0x00007ffff7d5fb09 in _dlerror_run () from /lib64/libdl.so.2 #15 0x00007ffff7d5f42a in dlopen@@GLIBC_2.2.5 () from /lib64/libdl.so.2 #16 0x00007fffef04d330 in py_dl_open (self=<optimized out>, args=<optimized out>) at /tmp/python-build.20220507135524.257789/Python-3.7.11/Modules/_ctypes/callproc.c:1369 ``` The root cause is that when loading `_raylet.so`, `static DelayManager _delay_manager` is initialized and `RAY_LOG(ERROR) << "RAY_testing_asio_delay_us is set to " << delay_env;` is executed. However, the static variables declared in `logging.cc` are not initialized yet (in this case, `std::string RayLog::logger_name_ = "ray_log_sink"`). It's better not to rely on the initialization order of static variables in different compilation units because it's not guaranteed. I propose to change all `RAY_LOG`s to `std::cerr` in `DelayManager::Init()`. The crash happens in Ant's internal codebase. Not sure why this test case passes in the community version though. BTW, I've tried different approaches: 1. Using a static local variable in `get_delay_us` and remove the global variable. This doesn't work because `init()` needs to access the variable as well. 2. Defining the global variable as type `std::unique_ptr<DelayManager>` and initialize it in `get_delay_us`. This works but it requires a lock to be thread-safe.

…ay-project#41074) (ray-project#41212)

jaewan and others added 8 commits November 2, 2021 06:30

BlockTask + test_pipeline

099bad3

Merge remote-tracking branch 'refs/remotes/origin/memory-scheduling-2…

dc7b468

…' into memory-scheduling-2

Pipeline Benchmark Initiate from the Driver

c1f64f4

pipeline test with stages

546796c

TODOs

e28ae48

Add priority from SubmitTask only explicitly. This gives error on tests

3f8fab1

Priority Assigning Complete

00a3a4d

DFS checked with correct priority

8bb0cb1

stephanie-wang reviewed Nov 11, 2021

View reviewed changes

jaewan and others added 11 commits November 11, 2021 08:12

Fixed bugs on emplace -> []

4506300

blocktasks implemented #1

b099e84

Delete OSDI22 directory

82556f3

To make it in different branch

threshold version

ba57bb2

Merge branch 'memory-scheduling-2' of https://github.com/stephanie-wa…

40f0e1c

…ng/ray into memory-scheduling-2

remove random files

30ba93e

Check object priority in map during Get

de4eb4f

Revert "threshold version"

3442138

This reverts commit ba57bb2.

Avoid modifying leased_workers during iterate, block tasks from being…

c646427

… dispatched if priority is too low

Add debug script

7c413a2

Merge remote-tracking branch 'origin/memory-scheduling' into memory-s…

704fab1

…cheduling-2

stephanie-wang merged commit 00cfdef into memory-scheduling Nov 16, 2021

stephanie-wang deleted the memory-scheduling-2 branch November 16, 2021 19:07

stephanie-wang pushed a commit that referenced this pull request Jan 27, 2022

[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star …

d5bfb7b

…style) #3 (ray-project#21652)

stephanie-wang pushed a commit that referenced this pull request Jun 21, 2023

[Datasets] Streaming executor fixes #3 (ray-project#32836)

4cc3a53

stephanie-wang pushed a commit that referenced this pull request Dec 21, 2023

[RLlib] New ConnectorV2 API #3: Introduce actual ConnectorV2 API. (r…

bd555a0

…ay-project#41074) (ray-project#41212)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory scheduling 2 DFS with Priority #3

Memory scheduling 2 DFS with Priority #3

jaewan commented Nov 11, 2021

stephanie-wang left a comment

stephanie-wang Nov 11, 2021

stephanie-wang Nov 11, 2021

stephanie-wang Nov 11, 2021

jaewan Nov 11, 2021

stephanie-wang Nov 11, 2021

Memory scheduling 2 DFS with Priority #3

Memory scheduling 2 DFS with Priority #3

Conversation

jaewan commented Nov 11, 2021

Why are these changes needed?

Related issue number

Checks

stephanie-wang left a comment

Choose a reason for hiding this comment

stephanie-wang Nov 11, 2021

Choose a reason for hiding this comment

stephanie-wang Nov 11, 2021

Choose a reason for hiding this comment

stephanie-wang Nov 11, 2021

Choose a reason for hiding this comment

jaewan Nov 11, 2021

Choose a reason for hiding this comment

stephanie-wang Nov 11, 2021

Choose a reason for hiding this comment