fix(iroh-dns-server): remove accidental blocking from store #2985

dignifiedquire · 2024-12-02T09:41:23Z

Description

I found this during investigations of #2972. It turned out that both blocking locks and blocking IO calls are being made in the dns servers store implementation

Breaking Changes

None

Notes & open questions

This might be not optimal, but it is the safest way to get rid of blocking the whole runtime for now.

Change checklist

Self-review.
Documentation updates following the style guide, if relevant.
Tests if relevant.
All breaking changes documented.

github-actions · 2024-12-02T09:47:31Z

Documentation for this PR has been generated and is available at: https://n0-computer.github.io/iroh/pr/2985/docs/iroh/

Last updated: 2024-12-02T10:26:25Z

github-actions · 2024-12-02T09:54:26Z

Netsim report & logs for this PR have been generated and is available at: LOGS
This report will remain available for 3 days.

Last updated for commit: d80ba1a

rklaehn · 2024-12-02T09:54:50Z

Does this lead to a measurable improvement?

I would say the big performance problem is that we are creating a write transaction per op, so we are limited to a small number (~1000/s) of txns per second no matter how beefy the machine is.

If we want to really speed this up we need to do a similar design to what is done in the blob store - batch write transactions and sync in regular intervals. Even though in case of the discovery server we can be more sloppy. E.g. it losing the last second of updates in case of a crash is really no big deal since discovery information gets republished.

matheus23 · 2024-12-02T09:57:51Z

iroh-dns-server/src/store/signed_packets.rs

@@ -14,7 +14,7 @@ const SIGNED_PACKETS_TABLE: TableDefinition<&SignedPacketsKey, &[u8]> =

 #[derive(Debug)]
 pub struct SignedPacketStore {
-    db: Database,
+    db: Arc<Database>,


redb should really implement Clone for Database.

You mean Clone? This is how we do it with the protocols, they don't (need to) impl Clone, and people have to wrap them in an Arc... 🤣

Yeah sorry, I edited my comment :D

Arqu · 2024-12-02T10:02:23Z

I really should try to set up a benchmark for this.

matheus23

Yeah good fixes.

I also like @rklaehn's suggestion (happy to quickly add another PR for that if needed).

Also am slightly confused by ZoneCache. Why is one cache an LRU, and the other a TTL?
We wrap them both with a mutex in the end, even though we don't need the mutex for the TTL.
And I don't think we need ZoneCache::cache to be an LruCache, do we? They're both caches, we can use TtlCache for both of them, avoiding the top-level mutex entirely.

dignifiedquire · 2024-12-02T10:16:49Z

f we want to really speed this up we need to do a similar design to what is done in the blob store - batch write transactions and sync in regular intervals. Even though in case of the discovery server we can be more sloppy. E.g. it losing the last second of updates in case of a crash is really no big deal since discovery information gets republished.

I don't think that we should do this, until we have figured out the write amplification issues. Current write amplification is around 10x which means we might need to drop redb anyway. This is just a stopgap until we have done so

dignifiedquire · 2024-12-02T10:18:15Z

Does this lead to a measurable improvement?

Unclear (yet) as this hasn't been deployed yet. But we have seen reports that both read and write calls are quite slow with heavy load

dignifiedquire requested review from Arqu and matheus23 December 2, 2024 09:41

dignifiedquire added this to the v0.29.0 milestone Dec 2, 2024

matheus23 reviewed Dec 2, 2024

View reviewed changes

matheus23 approved these changes Dec 2, 2024

View reviewed changes

dignifiedquire added 2 commits December 2, 2024 11:23

fix(iroh-dns-server): remove accidental blocking from store

7f680b7

happy clippy

3fb51ae

dignifiedquire force-pushed the fix-dns-server-async branch from f4ed24c to 3fb51ae Compare December 2, 2024 10:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(iroh-dns-server): remove accidental blocking from store #2985

fix(iroh-dns-server): remove accidental blocking from store #2985

dignifiedquire commented Dec 2, 2024

github-actions bot commented Dec 2, 2024 •

edited

Loading

github-actions bot commented Dec 2, 2024 •

edited

Loading

rklaehn commented Dec 2, 2024 •

edited

Loading

matheus23 Dec 2, 2024 •

edited

Loading

rklaehn Dec 2, 2024

matheus23 Dec 2, 2024

Arqu commented Dec 2, 2024

matheus23 left a comment

dignifiedquire commented Dec 2, 2024

dignifiedquire commented Dec 2, 2024

fix(iroh-dns-server): remove accidental blocking from store #2985

Are you sure you want to change the base?

fix(iroh-dns-server): remove accidental blocking from store #2985

Conversation

dignifiedquire commented Dec 2, 2024

Description

Breaking Changes

Notes & open questions

Change checklist

github-actions bot commented Dec 2, 2024 • edited Loading

github-actions bot commented Dec 2, 2024 • edited Loading

rklaehn commented Dec 2, 2024 • edited Loading

matheus23 Dec 2, 2024 • edited Loading

Choose a reason for hiding this comment

rklaehn Dec 2, 2024

Choose a reason for hiding this comment

matheus23 Dec 2, 2024

Choose a reason for hiding this comment

Arqu commented Dec 2, 2024

matheus23 left a comment

Choose a reason for hiding this comment

dignifiedquire commented Dec 2, 2024

dignifiedquire commented Dec 2, 2024

github-actions bot commented Dec 2, 2024 •

edited

Loading

github-actions bot commented Dec 2, 2024 •

edited

Loading

rklaehn commented Dec 2, 2024 •

edited

Loading

matheus23 Dec 2, 2024 •

edited

Loading