alter_table: Support adding columns to tables #30470

ParkMyCar · 2024-11-13T20:51:52Z

This PR implements the SQL feature ALTER TABLE ... ADD COLUMN ....

Note: There are a lot of lines changed but the majority are new tests!

Specifically it:

Uses VersionedRelationDesc on Tables to track new columns
Adds a CatalogCollectionEntry which adds some typing around getting the current RelationDesc for an entry.
Updates the storage-controller to create new Persist WriteHandles and pass them to the TxnsTableWorker
Updates the storage controller's user of Persist's CriticalSinceHandle to open one per-version of a table. This proved necessary to get the proper read handles for Mat Views on top of tables.

Otherwise it also adds several tests:

alter-table.slt which exercises a number of different scenarios
A new Check for the platform-checks test framework
A new Action for the parallel-workload test framework.
- This new action is currently disabled because off a race condition in Persist's schema registry
A legacy upgrade test to make sure we have coverage on the restart of MZ test case.

Motivation

Fixes https://github.com/MaterializeInc/database-issues/issues/8233

Tips for reviewer

I split the PR up into separate commits to ideally make it easier to review, most the changes here are new tests!

Changes to the Catalog APIs and name resolution to support versions.
Changes to sequencing and the storage controller. @bkirwi I would appreciate your eyes on this one!
Tests
Formatting and clippy

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

shepherdlybot · 2024-11-19T18:50:05Z

Mitigations

Completing required mitigations increases Resilience Coverage.

(Required) Code Review 🔍 Detected
(Required) Feature Flag
(Required) Integration Test 🔍 Detected
(Required) Observability 🔍 Detected
(Required) QA Review 🔍 Detected
(Required) Run Nightly Tests
Unit Test

Risk Summary:

The risk score for this pull request is high at 80, driven by predictors such as the sum of bug reports of files and the delta of executable lines. Historically, pull requests with these predictors are 114% more likely to cause a bug compared to the repository baseline. Additionally, the repository's observed and predicted bug trends are both decreasing, which is a positive sign.

Note: The risk score is not based on semantic analysis but on historical predictors of bug occurrence in the repository. The attributes above were deemed the strongest predictors based on that history. Predictors and the score may change as the PR evolves in code, time, and review activity.

Bug Hotspots:
What's This?

File	Percentile
../catalog/apply.rs	98
../src/main.rs	97
../catalog/state.rs	91
../src/coord.rs	100
../src/catalog.rs	98
../src/names.rs	92
../catalog/open.rs	99
../src/lib.rs	95
../catalog/transact.rs	93

def-

I very much appreciate all the tests!

What happens if you call ALTER TABLE ADD COLUMN on a continual task/table from source/mv/...? What if you try to add a column with the name of the table? Can you keep adding columns or do things get slower with number of columns? Could add some tests checking for correct errors in the SLT.

misc/python/materialize/checks/all_checks/alter_table.py

misc/python/materialize/parallel_workload/action.py

misc/python/materialize/checks/all_checks/alter_table.py

jkosh44

Adapter code LGTM

jkosh44 · 2024-11-19T19:53:58Z

src/adapter/src/catalog.rs

+                .map(|id| self.get_entry_by_global_id(id))
+                .filter_map(|entry| entry.index().map(|index| index.on));


I think I might be missing something, what was the change here?

This was just some Rust lifetime shenanigans. .index() returns an Option<&Index> but .get_entry_by_global_id(...) returns an owned type that only lives for the duration of the .map(...) call

jkosh44 · 2024-11-19T19:58:52Z

src/catalog/src/durable/transaction.rs

-            storage_collection_metadata: TableTransaction::new_with_uniqueness_fn(
-                storage_collection_metadata,
-                |a: &StorageCollectionMetadataValue, b| a.shard == b.shard,
-            )?,


Just confirming, now we can have multiple global IDs for the same object that all point to the same shard?

jkosh44 · 2024-11-19T20:03:24Z

src/adapter/src/catalog/state.rs

        let item_id = self
            .entry_by_global_id
            .get(id)
            .unwrap_or_else(|| panic!("catalog out of sync, missing id {id:?}"));
-        self.get_entry(item_id)
+
+        let entry = self.get_entry(item_id).clone();


It feels a little bad to clone the entry in this function. This used to be pretty cheap but now involves cloning potentially large expressions and create sql statements.

I totally agree, it's a bit tricky with Rust lifetimes and the trait CatalogItem, I'll circle back and see if I can improve this though. There might be a Cow<...> like thing we can do

jkosh44 · 2024-11-19T20:21:11Z

src/repr/src/relation.rs

+impl From<RelationVersion> for SchemaId {
+    fn from(value: RelationVersion) -> Self {
+        SchemaId(usize::cast_from(value.0))
+    }
+}


I don't think I understand this, what's the correlation b/w a relation version and a schema version?

Right now RelationVersions are 1:1 with SchemaIds. At some point we can break this relationship and store the mapping somewhere in the Catalog, but it's not necessary at the moment.

jkosh44 · 2024-11-19T20:24:33Z

src/catalog/src/memory/objects.rs

+    fn latest_version(&self) -> Option<RelationVersion> {
+        self.entry.latest_version()


I'm surprised that this returns an Option, when would an entry ever not have a version?

An entry only has a version if it's version-able, i.e. only Tables will return Some here.

src/catalog/src/memory/objects.rs

jkosh44 · 2024-11-19T21:30:57Z

src/sql/src/plan/statement/ddl.rs

+            let is_versioned = c
+                .options
+                .iter()
+                .any(|o| matches!(o.option, ColumnOption::Versioned { .. }));
+            !is_versioned


Why are we filtering here?

Added a comment, but it's because of how the names collection is used, I took a note for myself to refactor this entire block

bkirwi · 2024-12-10T16:49:05Z

src/storage-client/src/storage_collections.rs

@@ -642,14 +649,16 @@ where
        // Construct the handle in a separate block to ensure all error paths
        // are diverging
        let since_handle = {
+            // If the collection we're openning is versioned, be sure to use a
+            // different CriticalId so the SinceHandles don't conflict.
+            let reader_id = match version {


It's still not clear to me that we're managing the lifecycle of this handle properly... for example, I think the finalization task will only force-downgrade the handle of the controller-global handle, not these per-version handles. (And at a first pass it's not clear to me how all N critical handles get updated when the write frontier advances...)

ParkMyCar · 2024-12-17T00:12:03Z

Still iterating a bit, but the most recent commit removes the need for multiple CriticalSinceHandles and implements an approach @petrosagg and I talked about where earlier versions of a table track later versions as dependencies, and through initial testing things seems to workout!

At a high level the implementation is there, but pushed up the commits to run against CI

* changes Table to use a VersionedRelationDesc * adds CatalogCollectionEntry and move desc(...) method to it * renamed CatalogEntry::desc to CatalogEntry::desc_latest

* implement sequencing in the adapter * updates to the storage controller * change CriticalSinceHandles so we can have multiple per-shard

* add a good number of test cases to alter-table.slt * delete duplicate table_alter.slt * add a platform-check for Alter Table * add a (disable) parallel-workload case for Alter Table * add a legacy upgrade test for Alter Table

* refactors how alter_table_desc is implemented so the storage-controller tracks a dependency between different versions of the tables * removes the API for creating a CriticalId from a 'seed'

* update bootstraping storage collections to properly order Tables * fix upgrade tests by removing reference to old persist dyncfg

* change AstDisplay for ResolvedItemName to not print the version * remove commented our impl of previous sorting * update legacy upgrade test

* a few of the conditions we were testing fail when --auto-index-selects was enabled

* when creating collections for bootstrap, re-order them like we do in storage-collections

* in the storage-controller report the correct dependencies for tables * in the Coordinator register ReadPolicies * in the storage collections install read holds with the existing collections read frontier, not the implied capability * in storage collections set the write frontier of the new collection to the write frontier of the existing collection

ParkMyCar force-pushed the alter_table2/support-adding-columns-to-tables branch 6 times, most recently from b1ba847 to 6edd439 Compare November 19, 2024 18:13

ParkMyCar marked this pull request as ready for review November 19, 2024 18:47

ParkMyCar requested review from a team as code owners November 19, 2024 18:47

ParkMyCar requested review from jkosh44 and bkirwi November 19, 2024 18:47

def- reviewed Nov 19, 2024

View reviewed changes

misc/python/materialize/checks/all_checks/alter_table.py Outdated Show resolved Hide resolved

misc/python/materialize/parallel_workload/action.py Show resolved Hide resolved

def- reviewed Nov 19, 2024

View reviewed changes

misc/python/materialize/checks/all_checks/alter_table.py Show resolved Hide resolved

jkosh44 approved these changes Nov 19, 2024

View reviewed changes

ParkMyCar force-pushed the alter_table2/support-adding-columns-to-tables branch from 6edd439 to 02b0137 Compare December 5, 2024 15:13

jkosh44 approved these changes Dec 5, 2024

View reviewed changes

ParkMyCar requested a review from petrosagg December 10, 2024 15:10

bkirwi reviewed Dec 10, 2024

View reviewed changes

ParkMyCar force-pushed the alter_table2/support-adding-columns-to-tables branch 8 times, most recently from 1b30ffa to 7c3e244 Compare December 18, 2024 20:35

bobbyiliev mentioned this pull request Dec 20, 2024

Add support for ALTER TABLE ... ADD COLUMN ... MaterializeInc/terraform-provider-materialize#685

Open

ParkMyCar force-pushed the alter_table2/support-adding-columns-to-tables branch 4 times, most recently from f18f1b3 to 8e3af36 Compare December 22, 2024 19:17

ParkMyCar added 13 commits January 2, 2025 09:23

start, support ALTER TABLE ... ADD COLUMN ...

0aa564d

* changes Table to use a VersionedRelationDesc * adds CatalogCollectionEntry and move desc(...) method to it * renamed CatalogEntry::desc to CatalogEntry::desc_latest

implement alter_table

08b7ce9

* implement sequencing in the adapter * updates to the storage controller * change CriticalSinceHandles so we can have multiple per-shard

add new test cases

1bf693c

* add a good number of test cases to alter-table.slt * delete duplicate table_alter.slt * add a platform-check for Alter Table * add a (disable) parallel-workload case for Alter Table * add a legacy upgrade test for Alter Table

fix fmt, clippy, rebase fixes, and small test fixes

b7dc484

respond to GitHub feedback

6a4a738

fix clippy

de59911

remove the need for multiple CriticalSinceHandles

b448dbe

* refactors how alter_table_desc is implemented so the storage-controller tracks a dependency between different versions of the tables * removes the API for creating a CriticalId from a 'seed'

adjust some tests, small fixes after rebase

8235b59

another round of fixes

6a20883

* update bootstraping storage collections to properly order Tables * fix upgrade tests by removing reference to old persist dyncfg

one more round of fixes

feb4752

* change AstDisplay for ResolvedItemName to not print the version * remove commented our impl of previous sorting * update legacy upgrade test

update alter-table.slt

0a0f19b

* a few of the conditions we were testing fail when --auto-index-selects was enabled

update storage-controller create_collections_for_bootstrap

8412f3f

* when creating collections for bootstrap, re-order them like we do in storage-collections

ParkMyCar force-pushed the alter_table2/support-adding-columns-to-tables branch from 8e3af36 to 219f937 Compare January 2, 2025 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

alter_table: Support adding columns to tables #30470

alter_table: Support adding columns to tables #30470

ParkMyCar commented Nov 13, 2024 •

edited

Loading

shepherdlybot bot commented Nov 19, 2024 •

edited

Loading

def- left a comment

jkosh44 left a comment

jkosh44 Nov 19, 2024

ParkMyCar Dec 5, 2024

jkosh44 Nov 19, 2024

ParkMyCar Dec 5, 2024

jkosh44 Nov 19, 2024

ParkMyCar Dec 5, 2024

jkosh44 Nov 19, 2024

ParkMyCar Dec 5, 2024

jkosh44 Nov 19, 2024

ParkMyCar Dec 5, 2024

jkosh44 Nov 19, 2024

ParkMyCar Dec 5, 2024

bkirwi Dec 10, 2024

ParkMyCar commented Dec 17, 2024

		.map(\|id\| self.get_entry_by_global_id(id))
		.filter_map(\|entry\| entry.index().map(\|index\| index.on));

		fn latest_version(&self) -> Option<RelationVersion> {
		self.entry.latest_version()

alter_table: Support adding columns to tables #30470

Are you sure you want to change the base?

alter_table: Support adding columns to tables #30470

Conversation

ParkMyCar commented Nov 13, 2024 • edited Loading

Motivation

Tips for reviewer

Checklist

shepherdlybot bot commented Nov 19, 2024 • edited Loading

Mitigations

def- left a comment

Choose a reason for hiding this comment

jkosh44 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ParkMyCar commented Dec 17, 2024

ParkMyCar commented Nov 13, 2024 •

edited

Loading

shepherdlybot bot commented Nov 19, 2024 •

edited

Loading