Convert to synchronous GraphQL execution and data loading #1531

martijnwalraven · 2020-11-20T16:06:50Z

This PR removes our promise implementation and modifies GraphQLExecutor to execute GraphQL operations synchronously. It also changes the NormalizedCache protocol to expose synchronous methods instead of callbacks.

Perhaps surprisingly, this should actually improve our ability to execute operations concurrently. ApolloStore still supports concurrent reads and blocking writes through its existing transaction model. Operations within a transaction will execute synchronously however, which avoids the threading issues we were seeing before as a result of over-relying on fine grained asynchronous execution.

In the existing model, the overhead of constant context switches and threads blocking has been shown to negatively impact performance. And it also runs the danger of thread explosion and starvation, which we’ve seen mostly in tests but could also become an issue under real world conditions.

This is a pretty substantial change, so it would be great if people could try this out and report back. I've seen some performance improvements in tests, but real world usage may show different results.

Because we rely on lazy evaluation to perform efficient batched data loading, this PR adds a PossiblyDeferred wrapper that allows us to keep the same data loading semantics as before, while avoiding asynchronous execution. Instead, deferred values invoke a thunk when they are accessed, which is used to load the next batch of records from the cache.

Not sure why this only became an issue now.

martijnwalraven · 2020-11-20T16:12:06Z

@bharath2020 Since you mentioned slow data loading performance in #1519, it would be interesting to see what the numbers look like with this PR. There is much more we can do to improve performance, but this may already make a difference.

martijnwalraven · 2020-11-20T16:20:06Z

@RolandasRazma @AnthonyMDev This PR reverts the lock that was added to InMemoryNormalizedCache in #552. The transaction model of ApolloStore should be enough to guarantee there are no concurrent writes to the cache. The problem with locking at the normalized cache level is that multiple batches of records will be loaded in the course of executing the same GraphQL operation, so locking individual calls to the normalized cache won't provide proper transaction isolation. This also means sharing a NormalizedCache between clients is not supported. Sharing an ApolloStore between clients should work however.

RolandasRazma · 2020-11-20T16:27:37Z

thanks for info @martijnwalraven
It's not ideal as you would think of Cache/Store as a lowest possible level to share. Being it not thread safe feels like mine field.

bharath2020 · 2020-11-20T16:31:59Z

@martijnwalraven Will take a look at it and share the results soon.

martijnwalraven · 2020-11-20T16:36:37Z

thanks for info @martijnwalraven
It's not ideal as you would think of Cache/Store as a lowest possible level to share. Being it not thread safe feels like mine field.

The store is the lowest possible level to share, and it is thread-safe, but the normalized cache isn't designed to be used independently. Individual operations on the cache cannot offer any transaction guarantees, so that's why you always have to go through the store.

I hope this makes sense. The main issue is that multiple batches of records will be loaded in the course of loading a single GraphQL query from the store. So you want to make sure there are no intervening writes. That's why ApolloStore has withinReadTransaction and withinReadWriteTransaction.

designatednerd · 2020-11-20T23:20:29Z

The store is the lowest possible level to share, and it is thread-safe, but the normalized cache isn't designed to be used independently.

We should probably take a swing at updating the docs to a) Differentiate what is a store vs what is a cache and b) make sure we're crystal clear on the point that you can share a store between multiple clients, but not a cache between multiple stores (with the attendant multiple clients)

designatednerd · 2020-11-20T23:21:14Z

These are my favorite kinds of PRs 😃

AnthonyMillerSF · 2020-11-20T23:32:23Z

The store is the lowest possible level to share, and it is thread-safe, but the normalized cache isn't designed to be used independently.

We should probably take a swing at updating the docs to a) Differentiate what is a store vs what is a cache and b) make sure we're crystal clear on the point that you can share a store between multiple clients, but not a cache between multiple stores (with the attendant multiple clients)

Been a while since I've looked at this code. Do you think there is a simple way to add some asserts that fail if you attempt to use a cache with multiple stores? Make this fail in DEBUG builds and shoot off a warning in prod builds?

designatednerd

Cannot tell you how delighted I am to have you back to help rip this out, I'd have made such a hash of this 😄

I think for others reading this it's also important to note that while the cache itself is synchronous within itself, it's being read on an async thread via transactions in the store, so that it's not blocking the thread it's been called on. The wording of "synchronous GraphQL execution" I think may give some people a heart attack if they don't realize that.

designatednerd · 2020-11-20T23:23:49Z

...a/xcbaselines/9F54C8B3255D760B0065AFD6.xcbaseline/7A4A2454-7A24-425D-84C2-116320B87F1E.plist

+				<key>com.apple.XCTPerformanceMetric_WallClockTime</key>
+				<dict>
+					<key>baselineAverage</key>
+					<real>0.075663</real>


Are you setting up the perf tests to run in this PR? If so, we should probably get a baseline from CI rather than from one of our (almost certainly faster) local machines.

If it's not on this PR, keep that in mind for when we turn it on.

No, I haven't looked into establishing a baseline on CI. That would have to somehow remember the results and make a commit, not sure how other projects handle that. I also don't think we should run performance tests on CI by default, because they are pretty costly (they need a whole separate optimized build). So it might be ok to only run them locally for now.

Maybe we can set up a separate job to do the perf tests after merges - I'd definitely like to have them automated and running frequently, but it does make sense not to have them be a blocker to merging PRs.

(this should be handled in a separate PR thoguh)

Sources/Apollo/ApolloStore.swift

designatednerd · 2020-11-20T23:27:26Z

Sources/Apollo/ApolloStore.swift

+    self.queue.async {
+      do {
+        let returnValue = try body(ReadTransaction(store: self))
+        


looks like an extra newline snuck in here and below

Not sure what happened there, I can't seem to find it?

Weird, I'm still seeing it in the diff - lines 107 and 131

Ah, those were intentional. Do you think it's too much?

Sources/Apollo/DataLoader.swift

Sources/ApolloSQLite/SQLiteNormalizedCache.swift

Tests/ApolloTests/BatchedLoadTests.swift

Tests/ApolloTests/DataLoaderTests.swift

Tests/ApolloTests/ResultOrPromiseTests.swift

martijnwalraven · 2020-11-21T05:51:54Z

@martijnwalraven Will take a look at it and share the results soon.

Great, thanks! I should also mention it's important to run performance tests on a release build, because compile optimizations like inlining seem to make a huge difference for this code.

bharath2020 · 2020-11-23T19:07:48Z

@martijnwalraven I have the results for reading from SQLiteDatabase. For the same set of records (200) and device (iPhone XS), I used to measure the read timings in #1519, The read timing with the changes in this PR is brought down to 26ms from 80ms that I had in 1519 PR.

Sources/Apollo/ApolloStore.swift

Sources/Apollo/DataLoader.swift

Sources/Apollo/GraphQLExecutor.swift

Sources/Apollo/GraphQLQueryWatcher.swift

Sources/Apollo/PossiblyDeferred.swift

designatednerd · 2020-11-23T22:46:54Z

Tests/ApolloTests/PossiblyDeferredTests.swift

+  func testLazilyEvaluateAllIsDeferred() throws {
+    let possiblyDeferreds: [PossiblyDeferred<String>] = [.deferred { "foo" }, .deferred { "bar" }]
+
+    var numberOfInvocations = 0


all these numberOfInvocations tests make me wonder: Are we doing anything to limit things to one invocation? or is this something where the number of invocations could be arbitrary?

The number of invocations could be arbitrary, so any caching you want to do would happen downstream (in DataLoader for example). I thought about making PossiblyDeferred cache the result by changing itself from deferred to immediate the first time the deferred result gets resolved. But that won't really work because it is a value type. And changing it to a class makes these more heavy weight than I wanted.

True - I think i'm more thinking what happens if you accidentally call get twice - you'd get foobarbar, right? I guess at that point the format of the incorrect data should be a giveaway, but if this is in a big chain it could be difficult to tell where you accidentally called something twice.

Ah, you're talking specifically about what happens with the test. We also have assertions for numberOfInvocations, so that should tell you what happened?

No - I was using the test setup as an example. I'm wondering what happens if you call it twice in general. It's clear that in the test it's only getting called once.

I'm not sure I know what you mean. A PossiblyDeferred that's .deferred can be invoked multiple times, that's by design and should not be an issue. It's similar to how you could call result.get() multiple times.

martijnwalraven · 2020-11-24T03:31:59Z

@martijnwalraven I have the results for reading from SQLiteDatabase. For the same set of records (200) and device (iPhone XS), I used to measure the read timings in #1519, The read timing with the changes in this PR is brought down to 26ms from 80ms that I had in 1519 PR.

Thanks for checking, that's great news! Do you mean 260ms and 800ms by any chance, since #1519 mentions 0.8 seconds?

bharath2020 · 2020-11-24T18:27:04Z

Thanks for checking, that's great news! Do you mean 260ms and 800ms by any chance, since #1519 mentions 0.8 seconds?

Ah, typo. Yes It is 260ms v/s 800ms

Sources/Apollo/GraphQLExecutor.swift

designatednerd · 2020-11-24T19:41:47Z

Sources/Apollo/GraphQLQueryWatcher.swift

@@ -1,4 +1,5 @@
 import Foundation
+import ApolloCore


For CocoaPods purposes this needs to be wrapped in an #if !COOCAPODS gate because of how we have the lib set up. I'll make a ticket to deal with actually getting the libs to reflect the proper names for coocapods, but for now just slapping the wrapper on should do.

Ok, done. We should talk about this separately, but I'm wondering why ApolloCore is a separate module, and what should or shouldn't go in there.

The short answer is some stuff is used in both the codegen lib and in the main lib, the long answer is that eventually we'd like to separate the types needed for the codegen from the runtime to reduce what needs to be imported.

designatednerd

fix that merge conflict and you're good to go!

martijnwalraven added 4 commits November 20, 2020 14:39

Fix building performance tests for simulator

25e3f06

Not sure why this only became an issue now.

Convert to synchronous GraphQL execution and data loading

0f9a660

Remove now unnecessary read/write lock from ApolloStore

f15e2a5

Add lock to GraphQLQueryWatcher to avoid race conditions

e1a7c85

designatednerd reviewed Nov 21, 2020

View reviewed changes

martijnwalraven added 4 commits November 21, 2020 07:46

Make DataLoader.load private

31c7e24

Avoid nested map in GraphQLExecutor

5bcff6a

Document PossiblyDeferred and add tests

6110231

Add comment about ReferenceResolver to GraphQLExecutor

78f84e5

designatednerd reviewed Nov 23, 2020

View reviewed changes

martijnwalraven added 9 commits November 24, 2020 07:18

Add completion handler to ApolloStore.publish

cc70b29

Assert on unhandled failures in returnResultAsyncIfNeeded

9b3d7ed

Delegate ApolloStore.load(query:) completion to withinReadTransaction

5cfc96d

Import Foundation instead of Dispatch

2c0341d

Make single liner more readable

4c8a3e2

Make DataLoaderTests more readable

0b42988

Mention reference resolver in GraphQLExecutor init comments

dd0f6c7

Avoid one liner in GraphQLExecutor

968e404

Replace direct lock use with Atomic wrappers in GraphQLQueryWatcher

0a5c527

designatednerd reviewed Nov 24, 2020

View reviewed changes

Sources/Apollo/GraphQLExecutor.swift Outdated Show resolved Hide resolved

designatednerd reviewed Nov 24, 2020

View reviewed changes

designatednerd added the caching label Nov 24, 2020

designatednerd mentioned this pull request Nov 24, 2020

Remove setter from Atomic to prevent API misuse, fix TSAN error in network interceptor #1538

Merged

martijnwalraven added 2 commits November 26, 2020 09:07

Avoid one-liner

56bc70c

Guard ApolloCore import when using CocoaPods

256a4bc

designatednerd approved these changes Nov 30, 2020

View reviewed changes

Merge branch 'main' into synchronous-execution

9803e27

martijnwalraven merged commit 982195d into main Nov 30, 2020

designatednerd added this to the Next Release milestone Nov 30, 2020

calvincestari deleted the synchronous-execution branch June 30, 2021 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert to synchronous GraphQL execution and data loading #1531

Convert to synchronous GraphQL execution and data loading #1531

martijnwalraven commented Nov 20, 2020

martijnwalraven commented Nov 20, 2020

martijnwalraven commented Nov 20, 2020

RolandasRazma commented Nov 20, 2020

bharath2020 commented Nov 20, 2020

martijnwalraven commented Nov 20, 2020 •

edited

Loading

designatednerd commented Nov 20, 2020

designatednerd commented Nov 20, 2020

AnthonyMillerSF commented Nov 20, 2020

designatednerd left a comment

designatednerd Nov 20, 2020

martijnwalraven Nov 21, 2020

designatednerd Nov 22, 2020

designatednerd Nov 24, 2020

designatednerd Nov 20, 2020

martijnwalraven Nov 24, 2020

designatednerd Nov 24, 2020

martijnwalraven Nov 26, 2020

martijnwalraven commented Nov 21, 2020

bharath2020 commented Nov 23, 2020

designatednerd Nov 23, 2020

martijnwalraven Nov 24, 2020

designatednerd Nov 24, 2020

martijnwalraven Nov 25, 2020

designatednerd Nov 25, 2020

martijnwalraven Nov 26, 2020

martijnwalraven commented Nov 24, 2020

bharath2020 commented Nov 24, 2020 •

edited

Loading

designatednerd Nov 24, 2020

martijnwalraven Nov 26, 2020

designatednerd Nov 30, 2020

designatednerd left a comment

Convert to synchronous GraphQL execution and data loading #1531

Convert to synchronous GraphQL execution and data loading #1531

Conversation

martijnwalraven commented Nov 20, 2020

martijnwalraven commented Nov 20, 2020

martijnwalraven commented Nov 20, 2020

RolandasRazma commented Nov 20, 2020

bharath2020 commented Nov 20, 2020

martijnwalraven commented Nov 20, 2020 • edited Loading

designatednerd commented Nov 20, 2020

designatednerd commented Nov 20, 2020

AnthonyMillerSF commented Nov 20, 2020

designatednerd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martijnwalraven commented Nov 21, 2020

bharath2020 commented Nov 23, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martijnwalraven commented Nov 24, 2020

bharath2020 commented Nov 24, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

designatednerd left a comment

Choose a reason for hiding this comment

martijnwalraven commented Nov 20, 2020 •

edited

Loading

bharath2020 commented Nov 24, 2020 •

edited

Loading