Move slow db interactions to tokio blocking pool #905

lutter · 2019-05-02T15:45:22Z

All database interactions going through Diesel are synchronous, even though most of them happen within tokio's cooperative multitasking framework. That means that long-running db interactions will block the tokio worker they are running on; if enough workers (by default, one per CPU core) block, the whole system grinds to a halt. At that point, no work can be done, including work that doesn't even involve the database, but simply gets scheduled on the tokio runtime.

Upstream Diesel is not going to add async database interactions anytime soon so we need to come up with our own approach. One possible way to address this is with tokio's tokio_threadpool::blocking where slow work gets pushed to a separate pool of slow tasks, freeing up the main workers.

Open questions around this:

how to best integrate that and how to change our internal API's (should most Store functions return Future?)
how to determine which db work is slow and needs to be treated that way (might be that we should do all db work on the blocking pool)

The text was updated successfully, but these errors were encountered:

lutter · 2020-03-04T19:04:45Z

What we should do is change the store to move actual work to the blocking pool itself by having some internal function Store.with_connection(f: Fn(Connection) -> Result) -> impl Future<Result> which first acquires a semaphore sized according to the max number of connections in the pool and then executes f on the blocking pool. We'd then change all Store methods that right now just get a connection to use with_connection and do their work inside that.

Once we do that, we should also remove the semaphore introduced in #1522

leoyvens · 2021-04-05T23:38:57Z

We have with_conn now so the API question is solved. Maybe we don't use it everywhere we should but keeping this issue open isn't what's going to solve that.

leoyvens · 2021-04-06T21:06:06Z

Actually we have way too much spawn_blocking around our code to consider this done. This will be done when graph::task_spawn::spawn_blocking is no longer in use, since it assumes that a future may block.

neysofu · 2022-09-14T09:56:46Z

cc @mangas because we discussed this during yesterday's standup.

This was referenced May 10, 2019

Measure effect of tokio contention in event processing #926

Merged

Faster mapping execution #856

Closed

Reduce thread contention in subgraph indexing #937

Closed

Audit uses of Mutex or RwLock for performance #186

Closed

That3Percent mentioned this issue Jan 15, 2020

Update to tokio 0.2 and futures 0.3 #1448

Merged

lutter mentioned this issue Mar 4, 2020

Spawn websocket connections in the core pool #1522

Merged

leoyvens closed this as completed Apr 5, 2021

leoyvens reopened this Apr 6, 2021

leoyvens mentioned this issue Apr 7, 2021

Fix hanging when starting over 512 subgraphs #2354

Merged

mangas self-assigned this Dec 6, 2022

fordN added the Stale label Apr 9, 2024

mangas removed their assignment Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move slow db interactions to tokio blocking pool #905

Move slow db interactions to tokio blocking pool #905

lutter commented May 2, 2019 •

edited

Loading

lutter commented Mar 4, 2020 •

edited

Loading

leoyvens commented Apr 5, 2021

leoyvens commented Apr 6, 2021

neysofu commented Sep 14, 2022

Move slow db interactions to tokio blocking pool #905

Move slow db interactions to tokio blocking pool #905

Comments

lutter commented May 2, 2019 • edited Loading

lutter commented Mar 4, 2020 • edited Loading

leoyvens commented Apr 5, 2021

leoyvens commented Apr 6, 2021

neysofu commented Sep 14, 2022

lutter commented May 2, 2019 •

edited

Loading

lutter commented Mar 4, 2020 •

edited

Loading