Clean up job data on both Scheduler and Executor #188

mingmwang · 2022-09-05T08:50:22Z

Which issue does this PR close?

Closes #9 and #185.

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

mingmwang · 2022-09-05T09:54:58Z

@thinkharderdev @yahoNanJing @Ted-Jiang @andygrove

Ted-Jiang · 2022-09-06T07:49:20Z

ballista/rust/scheduler/src/state/task_manager.rs

+            Keyspace::CompletedJobs
+        };
+
+        let lock = state.lock(keyspace.clone(), "").await?;


I think this will lock the whole 🤔 in standalone

async fn lock(&self, keyspace: Keyspace, key: &str) -> Result<Box<dyn Lock>> { let mut mlock = self.locks.lock().await; let lock_key = format!("/{:?}/{}", keyspace, key); if let Some(lock) = mlock.get(&lock_key) { Ok(Box::new(lock.clone().lock_owned().await)) } else { let new_lock = Arc::new(Mutex::new(())); mlock.insert(lock_key, new_lock.clone()); Ok(Box::new(new_lock.lock_owned().await)) } }

Oh, this lock only works in func async fn lock(&self, keyspace: Keyspace, key: &str), will release after this func quick call.

Ted-Jiang · 2022-09-06T08:52:31Z

ballista/rust/scheduler/src/state/task_manager.rs

+        let alive_executors = executor_manager.get_alive_executors_within_one_minute();
+        for executor in alive_executors {
+            let job_id_clone = job_id.to_owned();
+            let executor_manager_clone = executor_manager.clone();


I think each SQL sends an RPC to all executors is not a good idea (for interactive query finish in ms)🤔

Maybe we can do some improvement later !

Yes, in current code base, the TaskManager/ExecutionGraph do not track the tasks are executed by which executors.

Ted-Jiang · 2022-09-06T08:56:41Z

ballista/rust/scheduler/src/state/task_manager.rs

+        let job_id_str = job_id.to_owned();
+        let active_job_cache = self.active_job_cache.clone();
+        tokio::spawn(async move {
+            tokio::time::sleep(Duration::from_secs(CLEANUP_FINISHED_JOB_DELAY_SECS))


(for interactive query finish in ms) i think this will store a lot feature in heap. Just some opinion.

yahoNanJing · 2022-09-09T01:12:54Z

ballista/rust/scheduler/src/scheduler_server/grpc.rs

+                error!("{}", msg);
+                Status::internal(msg)
+            })?
+            .post_event(QueryStageSchedulerEvent::JobCancel(job_id))


Nice refinement. It's better to have only one entrance to modify the scheduler state and all of the state changes should have a related event and be dealt with in the event loop.

andygrove · 2022-09-25T22:08:08Z

I am constantly filling my disk up with shuffle files so would love to see us get this merged before the 0.9.0 release.

@mingmwang Could you rebase when you get a chance and I will test this out and review the PR as well.

mingmwang · 2022-09-26T02:58:08Z

I am constantly filling my disk up with shuffle files so would love to see us get this merged before the 0.9.0 release.

@mingmwang Could you rebase when you get a chance and I will test this out and review the PR as well.

Sure, working on it.

mingmwang · 2022-09-26T08:02:36Z

@andygrove @yahoNanJing @Ted-Jiang @yahoNanJing

BTW, in this PR, the job data in the state store will also be deleted after 300s.
I think we need a following PR to move the completed(Success or Failed) job data from state store to ObjectStore
for long time storing purpose, and Scheduler UI can read from the ObjectStore.

Please share your thoughts.

const CLEANUP_FINISHED_JOB_DELAY_SECS: u64 = 300;

async fn clean_up_job_data(
        state: Arc<dyn StateBackendClient>,
        active_job_cache: ExecutionGraphCache,
        failed: bool,
        job_id: String,
        executor_manager: Option<ExecutorManager>,
    ) -> Result<()> {
        let mut active_graph_cache = active_job_cache.write().await;
        active_graph_cache.remove(&job_id);

        let keyspace = if failed {
            Keyspace::FailedJobs
        } else {
            Keyspace::CompletedJobs
        };

        let lock = state.lock(keyspace.clone(), "").await?;
        with_lock(lock, state.delete(keyspace, &job_id)).await?;

        executor_manager
            .map(|em| async { Self::clean_up_executors_data(job_id.clone(), em).await });
        Ok(())
    }

andygrove · 2022-10-02T15:26:49Z

ballista/rust/scheduler/src/state/task_manager.rs

 type ExecutionGraphCache = Arc<RwLock<HashMap<String, Arc<RwLock<ExecutionGraph>>>>>;

+const CLEANUP_FINISHED_JOB_DELAY_SECS: u64 = 300;


We should make this configurable. Some of the queries I am testing take much longer than 300 seconds. We already have the ability to set configs on the context.

Never mind, this is a delay after the job completes. I would still like to see this configurable but we could do that as a follow in PR.

andygrove

LGTM. Thanks @mingmwang

andygrove · 2022-10-10T14:28:46Z

@mingmwang could you fix the conflicts here when you have the time so that we can merge this?

mingmwang · 2022-10-11T10:45:36Z

@mingmwang could you fix the conflicts here when you have the time so that we can merge this?

Sure, I will fix the conflicts tomorrow.

mingmwang · 2022-10-12T16:08:42Z

Resolved conflicts.

andygrove · 2022-10-12T17:48:49Z

Thanks again @mingmwang

Clean up job data on both Scheduler and Executor

9e64e54

Ted-Jiang reviewed Sep 6, 2022

View reviewed changes

yahoNanJing reviewed Sep 9, 2022

View reviewed changes

andygrove mentioned this pull request Sep 25, 2022

Ballista 0.9.0 Release (October 2022) #273

Closed

26 tasks

merge with upstream

ab1d1c8

andygrove reviewed Oct 2, 2022

View reviewed changes

andygrove approved these changes Oct 2, 2022

View reviewed changes

Resolve conflicts with upstream

8669f57

andygrove merged commit e42a6c9 into apache:master Oct 12, 2022

This was referenced Oct 28, 2022

Clean up legacy job shuffle data #459

Closed

Long running stability #466

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up job data on both Scheduler and Executor #188

Clean up job data on both Scheduler and Executor #188

mingmwang commented Sep 5, 2022

mingmwang commented Sep 5, 2022

Ted-Jiang Sep 6, 2022

Ted-Jiang Sep 6, 2022

Ted-Jiang Sep 6, 2022

Ted-Jiang Sep 6, 2022

mingmwang Sep 26, 2022

Ted-Jiang Sep 6, 2022

yahoNanJing Sep 9, 2022

andygrove commented Sep 25, 2022 •

edited

Loading

mingmwang commented Sep 26, 2022

mingmwang commented Sep 26, 2022

andygrove Oct 2, 2022

andygrove Oct 2, 2022

andygrove left a comment

andygrove commented Oct 10, 2022

mingmwang commented Oct 11, 2022

mingmwang commented Oct 12, 2022

andygrove commented Oct 12, 2022

		type ExecutionGraphCache = Arc<RwLock<HashMap<String, Arc<RwLock<ExecutionGraph>>>>>;

		const CLEANUP_FINISHED_JOB_DELAY_SECS: u64 = 300;

Clean up job data on both Scheduler and Executor #188

Clean up job data on both Scheduler and Executor #188

Conversation

mingmwang commented Sep 5, 2022

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

mingmwang commented Sep 5, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andygrove commented Sep 25, 2022 • edited Loading

mingmwang commented Sep 26, 2022

mingmwang commented Sep 26, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andygrove left a comment

Choose a reason for hiding this comment

andygrove commented Oct 10, 2022

mingmwang commented Oct 11, 2022

mingmwang commented Oct 12, 2022

andygrove commented Oct 12, 2022

andygrove commented Sep 25, 2022 •

edited

Loading