perf(robot-server): Store one command per row #14348

SyntaxColoring · 2024-01-24T19:54:00Z

Overview

Closes RSS-132. See that ticket for background.

Test Plan

I think we're sufficiently covered by existing automated integration tests.

I've reviewed some of the new queries with SQLite's EXPLAIN QUERY PLAN. They look like they're all using fast index-based searches instead of slow full table scans.

Changelog

Remove the run.commands column, where each value was a large list of commands.
In its place, add a run_commands table, where each row holds just a single command.

run_id index_in_run command_id command

run1 0 abcd [blob]

run1 1 efgh [blob]

run2 0 ijkl [blob]

run2 1 mnop [blob]

... ... ... ...
Update RunStore to use the new table.
Add a migration.

Review requests

This new table will be our biggest, by far. Tens of thousands of records, as opposed to ~20 in our existing tables. One of SQL's traps is that it's easy to accidentally write a very inefficient query. If the right indexes aren't set up, SQLite will degrade to a full O(n) table scan. So scrutinize my queries to make sure we're not doing that.

Also see my inline review comments.

Risk assessment

Medium. See review requests above.

robot-server/robot_server/persistence/_tables/schema_3.py

robot-server/robot_server/persistence/_migrations/up_to_3.py

SyntaxColoring · 2024-01-25T18:36:28Z

robot-server/robot_server/runs/run_store.py

    def _clear_caches(self) -> None:
        self.has.cache_clear()
        self.get.cache_clear()
        self.get_all.cache_clear()
        self.get_state_summary.cache_clear()
        self.get_command.cache_clear()
-        self._get_all_unparsed_commands.cache_clear()


This cache was mostly saving the computational work of repeatedly parsing the whole list of commands. It's removed here "accidentally," because we never parse the whole list of commands anymore: we only parse the slices that are requested, when they're requested.

If we wanted to retain the cache, I guess the spiritual sequel would be a pair of caches whose keys are (run_id, command_index) and (run_id, command_id). It wouldn't be as simple as an @lru_cache anymore. We could do this, I guess, but I'm skeptical that it's worthwhile. Our app no longer requests big batches of run commands. And if it did, I'd urge us to find ways to speed up the actual underlying processing, like what we did in #13425.

I suppose for consistency, we should remove the cache on get_command(id) too.

sfoster1

comments on db stuff

robot-server/robot_server/persistence/_migrations/up_to_3.py

robot-server/robot_server/persistence/_tables/schema_3.py

robot-server/robot_server/runs/run_store.py

codecov · 2024-01-30T17:26:04Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (ffcc548) 68.29% compared to head (98624e3) 68.25%.
Report is 1 commits behind head on edge.

❗ Current head 98624e3 differs from pull request most recent head a8156e5. Consider uploading reports for the commit a8156e5 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##             edge   #14348      +/-   ##
==========================================
- Coverage   68.29%   68.25%   -0.05%     
==========================================
  Files        1623     2512     +889     
  Lines       54858    71910   +17052     
  Branches     4115     9174    +5059     
==========================================
+ Hits        37466    49080   +11614     
- Misses      16705    20664    +3959     
- Partials      687     2166    +1479

Flag	Coverage Δ
app	`64.83% <ø> (+29.99%)`	⬆️
components	`49.62% <ø> (ø)`
labware-library	`41.10% <ø> (ø)`
protocol-designer	`38.01% <ø> (ø)`
react-api-client	`66.16% <ø> (ø)`
step-generation	`86.90% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
robot-server/robot_server/runs/run_store.py	`100.00% <ø> (ø)`

... and 897 files with indirect coverage changes

CaseyBatten

Looks good overall, some comments below with questions and clarifications.

robot-server/robot_server/runs/run_store.py

robot-server/robot_server/persistence/_migrations/up_to_3.py

SyntaxColoring requested a review from a team January 24, 2024 19:54

SyntaxColoring changed the base branch from edge to db_draft_migration January 24, 2024 19:54

SyntaxColoring commented Jan 25, 2024

View reviewed changes

robot-server/robot_server/persistence/_tables/schema_3.py Outdated Show resolved Hide resolved

SyntaxColoring force-pushed the db_run_commands_as_rows branch from e82a168 to d6d0713 Compare January 25, 2024 17:40

SyntaxColoring commented Jan 25, 2024

View reviewed changes

robot-server/robot_server/persistence/_migrations/up_to_3.py Show resolved Hide resolved

SyntaxColoring commented Jan 25, 2024

View reviewed changes

SyntaxColoring marked this pull request as ready for review January 25, 2024 18:46

SyntaxColoring requested a review from a team as a code owner January 25, 2024 18:46

sfoster1 reviewed Jan 25, 2024

View reviewed changes

SyntaxColoring mentioned this pull request Jan 26, 2024

refactor(robot-server): Store Pydantic objects as JSON instead of pickles, take 2 #14355

Merged

SyntaxColoring force-pushed the db_draft_migration branch from f6d285c to 32f0eb1 Compare January 30, 2024 17:14

SyntaxColoring force-pushed the db_run_commands_as_rows branch from 0e8d735 to 98624e3 Compare January 30, 2024 17:21

CaseyBatten approved these changes Jan 30, 2024

View reviewed changes

robot-server/robot_server/runs/run_store.py Show resolved Hide resolved

robot-server/robot_server/persistence/_migrations/up_to_3.py Show resolved Hide resolved

Base automatically changed from db_draft_migration to edge January 30, 2024 22:42

SyntaxColoring added 8 commits January 30, 2024 17:44

Add run_command table to replace run.commmand column.

dd9f37d

Update RunStore to use new table.

a31c437

Implement migration.

e307b33

Consolidate loops.

aedbbe4

Update todo comment with ticket reference.

016bc2d

Add missing ORDER BY to run migration.

ffced5e

Start a comment for the migration summary.

6a227df

Use a synthetic primary key.

a8156e5

SyntaxColoring force-pushed the db_run_commands_as_rows branch from 98624e3 to a8156e5 Compare January 30, 2024 22:45

SyntaxColoring merged commit 063a435 into edge Jan 30, 2024
15 checks passed

SyntaxColoring deleted the db_run_commands_as_rows branch January 30, 2024 22:48

ncdiehl11 pushed a commit that referenced this pull request Feb 1, 2024

perf(robot-server): Store one command per row (#14348)

927c049

SyntaxColoring mentioned this pull request Feb 1, 2024

chore(release): Add release notes for v7.2.0 database migrations #14409

Merged

SyntaxColoring mentioned this pull request Feb 9, 2024

test(robot-server): Check the number of returned commands in persistence snapshot tests #14466

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(robot-server): Store one command per row #14348

perf(robot-server): Store one command per row #14348

SyntaxColoring commented Jan 24, 2024 •

edited

Loading

SyntaxColoring Jan 25, 2024 •

edited

Loading

SyntaxColoring Jan 25, 2024 •

edited

Loading

sfoster1 left a comment

codecov bot commented Jan 30, 2024 •

edited

Loading

CaseyBatten left a comment

run_id	index_in_run	command_id	command
run1	0	abcd	[blob]
run1	1	efgh	[blob]
run2	0	ijkl	[blob]
run2	1	mnop	[blob]
...	...	...	...

perf(robot-server): Store one command per row #14348

perf(robot-server): Store one command per row #14348

Conversation

SyntaxColoring commented Jan 24, 2024 • edited Loading

Overview

Test Plan

Changelog

Review requests

Risk assessment

SyntaxColoring Jan 25, 2024 • edited Loading

Choose a reason for hiding this comment

SyntaxColoring Jan 25, 2024 • edited Loading

Choose a reason for hiding this comment

sfoster1 left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 30, 2024 • edited Loading

Codecov Report

CaseyBatten left a comment

Choose a reason for hiding this comment

SyntaxColoring commented Jan 24, 2024 •

edited

Loading

SyntaxColoring Jan 25, 2024 •

edited

Loading

SyntaxColoring Jan 25, 2024 •

edited

Loading

codecov bot commented Jan 30, 2024 •

edited

Loading