[graphql] query costing #14463

wlmyng · 2023-10-26T21:56:22Z

Description

Now that we've split out query building from execution, we can run a cost analysis through db explain, and stop early if it exceeds query cost limits set in config

Test Plan

Manual testing, making sure existing tests pass

If your changes are not user-facing and not a breaking change, you can skip the following section. Otherwise, please indicate what changed, and then add to the Release Notes section as highlighted during the release process.

Type of Change (Check all that apply)

protocol change
user-visible impact
breaking change for a client SDKs
breaking change for FNs (FN binary must upgrade)
breaking change for validators or node operators (must upgrade binaries)
breaking change for on-chain data layout
necessitate either a data wipe or data migration

Release notes

vercel · 2023-10-26T21:56:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
sui-typescript-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 27, 2023 0:50am

4 Ignored Deployments

Name	Status	Preview	Updated (UTC)
explorer	⬜️ Ignored (Inspect)	Visit Preview	Oct 27, 2023 0:50am
multisig-toolkit	⬜️ Ignored (Inspect)	Visit Preview	Oct 27, 2023 0:50am
mysten-ui	⬜️ Ignored (Inspect)	Visit Preview	Oct 27, 2023 0:50am
sui-kiosk	⬜️ Ignored (Inspect)	Visit Preview	Oct 27, 2023 0:50am

amnn · 2023-10-27T12:21:40Z

crates/sui-graphql-rpc/src/context_data/db_query_cost.rs

+    let re = Regex::new(r"(LIMIT\s+)\$(\d+)")
+        .map_err(|e| crate::error::Error::Internal(format!("Failed create valid regex: {}", e)))?;


If we need Regexes like this, we should be able to create them once (statically) and then re-use them.

Setting the limit to DEFAULT_PAGE_SIZE kind of defeats the point of this check because it means that someone can create a query that requests 1000x the default page size, and we'll think it costs as much as a default page. I'm fine with us incrementally building towards the costing algorithm, but do we have a plan for how to get the query with all the parameters in place? Without that, this form of costing is not going to be effective.

ah thanks, I'm not super familiar with regex but will look into that

and yes follow up PR to apply the correct bindings. Although on this note, does it make sense for us to limit to default page size or allow users to provide an arbitrary limit? I assumed the former (altho the code doesn't do the capping today)

But yes the true cost and estimated cost here are a bit off rn due to replacing the placeholder value with these arbitrary values

amnn · 2023-10-27T12:25:59Z

crates/sui-graphql-rpc/src/context_data/db_data_provider.rs

+        E: From<diesel::result::Error> + std::error::Error + Send + 'static,
+        T: Send + 'static,
+    {
+        let max_db_query_cost = self.limits.max_db_query_cost;


This limit applies to all queries that are run as part of a single request, but currently it looks like we're judging the cost of each transaction individually.

This was a q I had posed to @oxade , and I think I was convinced by the explanation (unless I misunderstood it) that the node and complexity limiting already cover the whole request, so the query cost limiting is just to make sure that each individual db operation is not overly complex

Ah okay, my recollection of past discussions was that this step was also intended to cover the whole request, so that each successive cost check (node + depth, explain, timeout) is successively more rigorous, because otherwise, a bad actor can take advantage of a discrepancy between our least and most expensive node to compute:

If we set a local explain limit (like in this PR), it has to be generous enough to run a sensible query involving our cheaper nodes, but then the bad actor can fill up a query with really expensive nodes to query (like the analytics queries for explorer, or the APY query), and overload the system. If we set a global limit, we don't have that problem.

vercel bot temporarily deployed to Preview – mysten-ui October 26, 2023 21:56 Inactive

vercel bot deployed to Preview – sui-typescript-docs October 26, 2023 21:56 View deployment

wlmyng force-pushed the graphql-query-costing branch from 1a2c95f to 1be43e1 Compare October 26, 2023 22:03

vercel bot temporarily deployed to Preview – mysten-ui October 26, 2023 22:04 Inactive

vercel bot deployed to Preview – sui-typescript-docs October 26, 2023 22:04 View deployment

wlmyng added 5 commits October 26, 2023 15:13

lord i hope this works

de0e849

wahoo

1ae5b98

push this out for look

25f88a5

ok

d42db04

??

45b69af

wlmyng force-pushed the graphql-query-costing branch from 1be43e1 to 45b69af Compare October 26, 2023 22:26

vercel bot deployed to Preview – sui-typescript-docs October 26, 2023 22:26 View deployment

vercel bot temporarily deployed to Preview – mysten-ui October 26, 2023 22:27 Inactive

wlmyng added 2 commits October 26, 2023 17:47

ok

7482c95

formatting

7ac64c5

wlmyng marked this pull request as ready for review October 27, 2023 00:48

wlmyng requested review from oxade, amnn and stefan-mysten as code owners October 27, 2023 00:48

vercel bot deployed to Preview – sui-typescript-docs October 27, 2023 00:48 View deployment

wlmyng added 2 commits October 26, 2023 17:48

ugh

9152df5

some more cleanup

3347964

vercel bot deployed to Preview – sui-typescript-docs October 27, 2023 00:50 View deployment

oxade approved these changes Oct 27, 2023

View reviewed changes

wlmyng merged commit 8904957 into main Oct 27, 2023
32 checks passed

wlmyng deleted the graphql-query-costing branch October 27, 2023 02:41

amnn reviewed Oct 27, 2023

View reviewed changes

jonas-lj pushed a commit to jonas-lj/sui that referenced this pull request Nov 2, 2023

[graphql] query costing (MystenLabs#14463)

e67ed7c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[graphql] query costing #14463

[graphql] query costing #14463

wlmyng commented Oct 26, 2023 •

edited

Loading

vercel bot commented Oct 26, 2023 •

edited

Loading

amnn Oct 27, 2023

wlmyng Oct 27, 2023

wlmyng Oct 27, 2023

amnn Oct 27, 2023

wlmyng Oct 27, 2023

amnn Oct 27, 2023

		let re = Regex::new(r"(LIMIT\s+)\$(\d+)")
		.map_err(\|e\| crate::error::Error::Internal(format!("Failed create valid regex: {}", e)))?;

[graphql] query costing #14463

[graphql] query costing #14463

Conversation

wlmyng commented Oct 26, 2023 • edited Loading

Description

Test Plan

Type of Change (Check all that apply)

Release notes

vercel bot commented Oct 26, 2023 • edited Loading

amnn Oct 27, 2023

Choose a reason for hiding this comment

wlmyng Oct 27, 2023

Choose a reason for hiding this comment

wlmyng Oct 27, 2023

Choose a reason for hiding this comment

amnn Oct 27, 2023

Choose a reason for hiding this comment

wlmyng Oct 27, 2023

Choose a reason for hiding this comment

amnn Oct 27, 2023

Choose a reason for hiding this comment

wlmyng commented Oct 26, 2023 •

edited

Loading

vercel bot commented Oct 26, 2023 •

edited

Loading