sql: primary keys cannot be changed #19141

benesch · 2017-10-09T21:43:59Z

It is not currently possible to change a table's primary key, e.g. via

ALTER TABLE foo DROP CONSTRAINT old_pkey;
ALTER TABLE foo ADD PRIMARY KEY (new, pkey, cols)

This becomes more important in 1.2 because the upcoming partitioning feature (#18683) will only allow partitioning by prefixes of the primary key. For example, suppose you want to partition this table by country:

CREATE TABLE users (
  id INT PRIMARY KEY,
  name STRING,
  country STRING,
  ...
)

The table needs to have a composite primary key that lists country first:

CREATE TABLE users (
  id INT,
  name STRING,
  country STRING,
  ...,
  PRIMARY KEY (int, country)
) PARTITION BY LIST (country)...

This essentially requires that the schema be designed with partitioning in mind from the get-go until we can alter primary keys.

benesch · 2017-10-09T21:46:20Z

/cc @awoods187 for triage

a6802739 · 2017-10-31T08:11:31Z

@benesch, Do you mean we should support SQL syntax like:

ALTER TABLE foo ADD PRIMARY KEY (new, pkey, cols)

Or just change the primary key of table users from id to (id, country) , when we try to add the partition for this table but the partition column is not in the primary?

We should change the data organization when we try to change the primary key of the specified table, this operation's overhead is quite expensive:

we should try to insert the same data within the replica, but changed the key of data from /TableID/IndexID/ColumnID/id to /TableID/IndexID/ColumnID/(id, country), the value also changed with the same way, and clear the original key-value.
change the TableDescriptor.
backfill.

benesch · 2017-11-01T23:42:38Z

@a6802739 I mean we should support arbitrary primary key changes like

ALTER TABLE foo ADD PRIMARY KEY (new, pkey, cols)

Since changing a table's primary key is such an expensive operation, we definitely don't want to do it implicitly if the partitioning scheme specifies columns that are not in the primary key. Much rather error in that case, and let the user alter the pkey only if they so choose.

Note that traditionally changing the pkey on a table with an existing pkey has required two steps:

ALTER TABLE foo DROP CONSTRAINT foo_pkey;
ALTER TABLE foo ADD PRIMARY KEY (new, pkey, cols);

That's going to be needlessly expensive in Cockroach. Dropping the primary key requires creating a hidden rowid column to use as the implicit pkey—this already happens when tables without a primary key are created from scratch—and rewriting all the data in terms of rowdid, just to rewrite it all in terms of (new, pkey, cols) when the new primary key is added. We should do one or more of the following:

Introduce new syntax to alter a primary key in one step:

ALTER TABLE foo ALTER PRIMARY KEY TO (new, pkey, cols)

Coalesce the operations when they occur in the same statement:

ALTER TABLE foo DROP CONSTRAINT foo_pkey, ADD PRIMARY KEY (new, pkey, cols);

Coalesce the operations when they appear in ALTER TABLEs within the same txn with no intervening writes:

BEGIN;
ALTER TABLE foo DROP CONSTRAINT foo_pkey;
ALTER TABLE foo ADD PRIMARY KEY (new, pkey, cols);
COMMIT;

My preference would be to eventually support all three, but start by supporting just option one. Options two and three are better for compatibility, but changing a table's primary key probably isn't something you want to lean on your ORM for.

Option three has some pathological edge cases FWIW:

CREATE TABLE foo (a INT PRIMARY KEY, b STRING);
INSERT INTO foo (a, b) VALUES (1, "a");
BEGIN;
ALTER TABLE foo DROP CONSTRAINT foo_pkey;
INSERT INTO foo (a, b) VALUES (1, "b");
ALTER TABLE foo ADD PRIMARY KEY (b);
COMMIT;

@jordanlewis @vivekmenezes @danhhz @knz @cockroachdb/sql-async does this sound reasonable?

knz · 2017-11-02T11:52:52Z

Yes it's reasonable but @a6802739 please do not underestimate the complexity of the task! As much as possible, please split the work in small steps that can be easily reviewed. It might be beneficial that you study the problem and write down your implementation plan before you start (and perhaps share it with us).

knz · 2017-11-02T13:34:16Z

I meant to write "reasonable" in my previous comment.

a6802739 · 2017-11-02T14:38:10Z

@knz, I will first try to understand what should I do for this task, and then write down how to implement it. Maybe I will have some confusion in this process, hope could get some help then.

Thank you very much.

cuongdo · 2017-11-02T15:42:34Z

@a6802739 Thank you, as always, for your help. I've discussed this project with the team, and because other high-priority projects with upcoming deadlines rely on this feature, we'll need to develop this feature internally.

cuongdo · 2017-11-02T15:44:08Z

assigned to @danhhz to make sure this finds a good home

petermattis · 2018-03-13T19:54:31Z

Has there been any thought on how this would be implemented? Changing the primary key is difficult because it is stored in indexes. One thought is that we make a copy of all of the secondary indexes and backfill both them and the new primary index and afterwards delete the old secondary indexes and primary index. Seems like that would conceptually work, though it would be slow.

@mjibson, @danhhz, @vivekmenezes Thoughts?

jordanlewis · 2018-03-13T20:12:40Z

It depends on whether there's a requirement for this to happen online.

If it needs to be online, which is what @benesch's first suggestion implies (ALTER TABLE ALTER PRIMARY KEY), then I agree with your suggestion. The phases in that case would be the same as adding an index, more or less:

The new indexes and table descriptor would be created
The old table descriptor would be updated so that If the table had n secondary indexes, the write path would then end up writing 2 * (n + 1) rows per update, guaranteeing that all writes from then on are reflected in the new tables.
A backfill process begins for the table and all of its secondary indexes, at some time t before the write path began writing to the new copies.
Once the backfill is finished, it's safe to switch reads to come from the new copies, and begin deleting the old copies.

If it doesn't have to be online, then perhaps that first step of "dropping the primary key" could effectively just put the table into read only mode until a new primary key is added, at which point we could perform the backfills above without having to do the table descriptor double write path dance.

benesch · 2018-03-13T21:20:58Z

It depends on whether there's a requirement for this to happen online.

I'd be sad if changing a table's primary key took the table offline. That said, it would still a big improvement over our current "just dump it to CSV and reimport it!" recommendation.

petermattis · 2018-03-13T22:32:23Z

I've been assuming that any schema change needs to be asynchronous. Let's not lower this bar yet.

dianasaur323 · 2018-03-14T14:49:23Z

We talked about this yesterday briefly, but just one data point from a customer who wanted to change column types generally (not just primary keys) -> okay with this locking up a table while the change is going on. I don't fully understand the term backfilling, but are we also using this term to describe the faster process we've been talking about re doing something with sstables (vague because I only get 20% of it)? If not, I wonder if someone would want a fast, but offline primary key swap versus a slow but online primary key swap.

petermattis · 2018-03-14T14:56:29Z

okay with this locking up a table while the change is going on

I'm not sure I believe this. Well, I believe the customer has stated that locking the table is ok, but we've also heard many times that async schema changes massively reduce operational burden.

dianasaur323 · 2018-03-14T14:58:47Z

I think you're right - it's definitely 100x better to have async schema changes. That being said, if one takes way way longer than the other, perhaps it changes our decision? Either way, this is obviously an engineering decision, but just wanted to provide that single relevant data point here.

vivekmenezes · 2018-03-15T09:57:00Z

I think this is a low impact high effort issue. This change is valuable to a customer who is

running in production
running globally
running with partitioning
wants to change the primary key

I think number 4 in particular is rare.

I'll also bring up that when we do this we need to first refactor the codebase to accommodate this change (it should be clear that the code can be modifying two tables). It's very easy to make this change and land up with what we had with the sql executor but this time on the sql DML code paths.

I'm just saying that we better think about the weight of this change on the stability of the system and the priority of it for our customers for 2.1.

bdarnell · 2018-03-15T13:38:25Z

I agree that it's high effort, but I think you're understating the impact. We have at least two features where the primary key is crucial (partitioning and interleaving), and even without that people sometimes want to change their primary keys (existing databases all have this feature). It's unrealistic to expect people to be able to choose the correct PK when first creating the table.

okay with this locking up a table while the change is going on

I'm not sure I believe this. Well, I believe the customer has stated that locking the table is ok, but
we've also heard many times that async schema changes massively reduce operational burden.

If taking the table offline is acceptable, then an export/edit/import process would work. I think if we're going to do this we should do the work to make it async/online.

We talked about this yesterday briefly, but just one data point from a customer who wanted to change column types generally (not just primary keys)

Changing types of non-PK columns is a separate matter. It's much easier, and I think it's higher priority than changing PKs.

awoods187 · 2019-02-19T18:09:38Z

Needed for https://github.com/cockroachlabs/support/issues/267

Freeaqingme · 2019-09-26T20:20:05Z

This item only shows examples where the old primary key is removed first. As outlined in #40771 there's also a scenario possible where a table was defined without a primary key in the first place, and as such nothing was deleted.

@jordanlewis I'm perfectly fine with closing the issue I mentioned above. Should this issue also be given the label for postgres compatibility, or would that be counter-effective?

jordanlewis · 2019-09-26T20:42:17Z

In CockroachDB a table always has a primary key behind the scenes, so this issue does cover your case as well. Thanks for noticing the missing compat label - I've added it.

fire · 2020-03-15T14:12:08Z

Can this issue be closed once https://www.cockroachlabs.com/docs/v20.1/alter-primary-key.html is stable?

awoods187 · 2020-03-16T13:11:58Z

Yes! In fact, I'm going to go ahead and close this since it will be addressed in 20.1.

Recently, cockroachdb#35879 and cockroachdb#19141 were closed, so new tests began passing Release justification: non-productrion change Release note: None

@danhhz

46625: kv/kvserver: don't bump ReadTimestamp on EndTxn batch with bounded read latches r=nvanbenschoten a=nvanbenschoten This commit prevents an error-prone situation where requests with an EndTxn request that acquire read-latches at bounded timestamps were allowed to move their ReadTimestamp to their WriteTimestamp without re-acquiring new latches. This could allow the request to read at an unprotected timestamp and forgo proper synchronization with writes with respect to its lock-table scan and timestamp cache update. This relates to 11bffb2, which made a similar fix to server-side refreshes, but not to this new (as of this release) pre-emptive ReadTimestamp update mechanism. This bug was triggering the following assertion when stressing `TestKVNemesisSingleNode`: ``` F200326 15:56:11.350743 1199 kv/kvserver/concurrency/concurrency_manager.go:261 [n1,s1,r33/1:/{Table/50/"60…-Max}] caller violated contract: discovered non-conflicting lock ``` The batch that hit this issue looked like: ``` Get [/Table/50/"a1036fd0",/Min), Get [/Table/50/"e54a51a8",/Min), QueryIntent [/Table/50/"aafccb40",/Min), QueryIntent [/Table/50/"be7f37c9",/Min), EndTxn(commit:true tsflex:true) ``` This commit adds two subtests to `TestTxnCoordSenderRetries` that create this scenario. Release note (bug fix): A rare bug causing an assertion failure was fixed. The assertion error message was "caller violated contract: discovered non-conflicting lock". Release justification: fixes a series bug that could crash a server. Additionally, the bug could have theoretically allowed isolation violations between transactions without hitting the assertion, though this was never observed in practice. cc. @danhhz 46636: roachtest: update blacklists after new fixes r=rafiss a=rafiss Recently, #35879 and #19141 were closed, so new tests began passing Release justification: non-productrion change Release note: None 46732: build: disable required release justifications r=jordanlewis a=otan release-20.1 is cut. Release note: None 46733: licenses: Update BSL change date for master/20.2 r=bdarnell a=otan Release note: None Release justification: N/A Co-authored-by: Nathan VanBenschoten <[email protected]> Co-authored-by: Rafi Shamim <[email protected]> Co-authored-by: Oliver Tan <[email protected]>

benesch added this to the Later milestone Oct 9, 2017

benesch assigned awoods187 Oct 9, 2017

bdarnell assigned a6802739 Oct 30, 2017

bdarnell modified the milestones: Later, 1.2 Oct 30, 2017

cuongdo assigned danhhz and unassigned a6802739 Nov 2, 2017

awoods187 added the A-partitioning label Nov 9, 2017

awoods187 modified the milestones: 1.2, 1.3 Nov 9, 2017

petermattis assigned maddyblue and bobvawter and unassigned danhhz Mar 13, 2018

vivekmenezes added A-schema-changes and removed A-disaster-recovery labels Jul 24, 2018

vivekmenezes removed this from the 2.1 milestone Jul 24, 2018

dt unassigned dt, maddyblue and bobvawter Aug 17, 2018

jordanlewis mentioned this issue Sep 24, 2019

Default PK results in incompatibility with Postgres (/keycloak) #40771

Closed

jordanlewis added the A-sql-pgcompat Semantic compatibility with PostgreSQL label Sep 26, 2019

kimxogus mentioned this issue Oct 13, 2019

Add primary key to create table query in postgres dialect nafg/slick-migration-api#52

Open

apantel added the X-anchored-telemetry The issue number is anchored by telemetry references. label Dec 2, 2019

jseldess mentioned this issue Dec 13, 2019

[MH] Online primary key changes cockroachdb/docs#5955

Closed

awoods187 closed this as completed Mar 16, 2020

rafiss mentioned this issue Mar 26, 2020

roachtest: update blacklists after new fixes #46636

Merged

rafiss added a commit to rafiss/cockroach that referenced this issue Mar 26, 2020

roachtest: update blacklists after new fixes

4d9e96f

Recently, cockroachdb#35879 and cockroachdb#19141 were closed, so new tests began passing Release justification: non-productrion change Release note: None

rafiss mentioned this issue Apr 23, 2020

[Tracking] Re-triage roachtest failures for closed issues cockroachdb/appdev-issues#20

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: primary keys cannot be changed #19141

sql: primary keys cannot be changed #19141

benesch commented Oct 9, 2017 •

edited

Loading

benesch commented Oct 9, 2017

a6802739 commented Oct 31, 2017 •

edited

Loading

benesch commented Nov 1, 2017

knz commented Nov 2, 2017 •

edited

Loading

knz commented Nov 2, 2017

a6802739 commented Nov 2, 2017

cuongdo commented Nov 2, 2017 •

edited

Loading

cuongdo commented Nov 2, 2017

petermattis commented Mar 13, 2018

jordanlewis commented Mar 13, 2018

benesch commented Mar 13, 2018

petermattis commented Mar 13, 2018

dianasaur323 commented Mar 14, 2018

petermattis commented Mar 14, 2018

dianasaur323 commented Mar 14, 2018

vivekmenezes commented Mar 15, 2018 •

edited

Loading

bdarnell commented Mar 15, 2018

awoods187 commented Feb 19, 2019

Freeaqingme commented Sep 26, 2019

jordanlewis commented Sep 26, 2019

fire commented Mar 15, 2020

awoods187 commented Mar 16, 2020

sql: primary keys cannot be changed #19141

sql: primary keys cannot be changed #19141

Comments

benesch commented Oct 9, 2017 • edited Loading

benesch commented Oct 9, 2017

a6802739 commented Oct 31, 2017 • edited Loading

benesch commented Nov 1, 2017

knz commented Nov 2, 2017 • edited Loading

knz commented Nov 2, 2017

a6802739 commented Nov 2, 2017

cuongdo commented Nov 2, 2017 • edited Loading

cuongdo commented Nov 2, 2017

petermattis commented Mar 13, 2018

jordanlewis commented Mar 13, 2018

benesch commented Mar 13, 2018

petermattis commented Mar 13, 2018

dianasaur323 commented Mar 14, 2018

petermattis commented Mar 14, 2018

dianasaur323 commented Mar 14, 2018

vivekmenezes commented Mar 15, 2018 • edited Loading

bdarnell commented Mar 15, 2018

awoods187 commented Feb 19, 2019

Freeaqingme commented Sep 26, 2019

jordanlewis commented Sep 26, 2019

fire commented Mar 15, 2020

awoods187 commented Mar 16, 2020

benesch commented Oct 9, 2017 •

edited

Loading

a6802739 commented Oct 31, 2017 •

edited

Loading

knz commented Nov 2, 2017 •

edited

Loading

cuongdo commented Nov 2, 2017 •

edited

Loading

vivekmenezes commented Mar 15, 2018 •

edited

Loading