(graphcache) - Fix API results from being reused incorrectly for queries #1196

kitten · 2020-12-04T19:59:53Z

This was originally reported by @sarmeyer.

Tracking down what the root culprit and intended fix is was difficult since the effects of this bug were subtle, the cause was hard to find, and this can poorly by fixed by several small issues. Ultimately, tl;dr, the bug can be seen here: https://github.com/FormidableLabs/urql/blob/eee828914fb6083b2aaa3203484ed6b36b7beff5/exchanges/graphcache/src/operations/query.ts#L93-L97

Summary

Fix reusing original query data from APIs accidentally, which can lead to subtle mismatches in results when the API's incoming query results are being updated by the cacheExchange, to apply resolvers. Specifically this may lead to relations from being set back to null when the resolver returns a different list of links than the result, since some null relations may unintentionally exist but aren't related. If you're using relayPagination then this fix is critical.

Investigation

When a result comes back from the API it is both written to the cached, but the cacheExchange also queries it again from the API, to update its data using resolvers. This is a great technique to get any API result from looking the same as if it was directly queried from the cache.

For mutations and subscriptions this is also nice, because we have a special readRoot case that copies values over from the originalData, since not all fields on "root results" are cached, since they're not normalised. This is easily confused with readSelection's concept of data which is the target where we write results too. At some point we must've gotten either confused or assumed that it'd be great for readSelection (which is for normalised data) to also use the original API data as its data input.

This becomes a problem because it's not necessary and can cause bugs. It's not necessary because we're dealing with normalised data, so the data should be completely queryable from the cache and reusing the original data will just cause confusion. It also causes bugs — which is how @sarmeyer discovered this — because if a resolver returns a list of items that in turn have links (i.e. relations) to other entities, then the result's original data may have items in a different order or length. This data will still be used, but if it's set to null then another special case causes the field to be considered null again, whether that's actually correct or not, see: https://github.com/FormidableLabs/urql/blob/eee828914fb6083b2aaa3203484ed6b36b7beff5/exchanges/graphcache/src/operations/query.ts#L499

In this line we check whether prevData === null. This is important because a past selection set may have contained uncached fields but a future one for the same path in the query may not, which would mean that the data would be returned even though it contained uncached fields. This test illustrates why this check exists: https://github.com/FormidableLabs/urql/blob/eee828914fb6083b2aaa3203484ed6b36b7beff5/exchanges/graphcache/src/operations/query.test.ts#L225-L233

But when we use the API result as an input then prevData isn't just the data that Graphcache built up from its cache, it's also the data from the API result that's being reused. And when a list contains unrelated items with null fields then this check will instead cause this relation to become null although the cached data exists.

Another side-effect of this bug is that because API results are reused, some references/objects/instances are overwritten by Graphcache, which is bad. It should never mutate data it gets from the API, but just write it to its cache then create a new query result.

Set of changes

Add a reproduction test case
Fix some tests that should've warned us about this being a problem 🤦
Ensure that readSelection in read always gets a new, empty object to write to.

changeset-bot · 2020-12-04T19:59:57Z

🦋 Changeset detected

Latest commit: 554cbcc

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@urql/exchange-graphcache	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

kitten · 2020-12-04T20:23:52Z

exchanges/graphcache/src/operations/query.ts

-      ? readRoot(ctx, rootKey, rootSelect, data)
-      : readSelection(ctx, rootKey, rootSelect, data);
+      ? readRoot(ctx, rootKey, rootSelect, input || ({} as Data))
+      : readSelection(ctx, rootKey, rootSelect, {} as Data);


This is the relevant line for review :)

@JoviDeCroock Merging as a hotfix; the review should be simple, but @sarmeyer has tested this and it works just fine 🙌

kitten added 5 commits December 4, 2020 19:41

Add test case demonstrating original data affecting results

7a99c55

Remove unexpected __typename fields from test fixtures

26a5e1c

Assert in test that result data shouldn't be the same reference

ba0ab0d

Prevent query original result data from being reused

48dd3cf

Add changeset

1438102

kitten requested a review from JoviDeCroock December 4, 2020 19:59

kitten changed the title ~~Fix/graphcache reused original data~~ (graphcache) - Fix API results from being reused incorrectly for queries Dec 4, 2020

Simplify some checks from undefined to truthiness

554cbcc

kitten commented Dec 4, 2020

View reviewed changes

kitten merged commit 1168694 into main Dec 4, 2020

kitten deleted the fix/graphcache-reused-original-data branch December 4, 2020 20:54

github-actions bot mentioned this pull request Dec 4, 2020

Version Packages #1197

Merged

kitten mentioned this pull request Feb 5, 2021

(graphcache) - Fix uncached GraphQLError fields making it impossible to get first API result #1367

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(graphcache) - Fix API results from being reused incorrectly for queries #1196

(graphcache) - Fix API results from being reused incorrectly for queries #1196

kitten commented Dec 4, 2020

changeset-bot bot commented Dec 4, 2020 •

edited

Loading

kitten Dec 4, 2020

kitten Dec 4, 2020

(graphcache) - Fix API results from being reused incorrectly for queries #1196

(graphcache) - Fix API results from being reused incorrectly for queries #1196

Conversation

kitten commented Dec 4, 2020

Summary

Investigation

Set of changes

changeset-bot bot commented Dec 4, 2020 • edited Loading

🦋 Changeset detected

kitten Dec 4, 2020

Choose a reason for hiding this comment

kitten Dec 4, 2020

Choose a reason for hiding this comment

changeset-bot bot commented Dec 4, 2020 •

edited

Loading