Inconsistent placement rules when retrying truncating tables/partitions #31540

xhebox · 2022-01-11T04:02:10Z

Bug Report

Lines 603 to 611 in 4f30a14

    
           var oldPartitionIDs []int64 
        
           if tblInfo.GetPartitionInfo() != nil { 
        
           	oldPartitionIDs = getPartitionIDs(tblInfo) 
        
           	// We use the new partition ID because all the old data is encoded with the old partition ID, it can not be accessed anymore. 
        
           	err = truncateTableByReassignPartitionIDs(t, tblInfo) 
        
           	if err != nil { 
        
           		return ver, errors.Trace(err) 
        
           	} 
        
           }

tidb/ddl/partition.go

Lines 1177 to 1192 in aa7ad03

    
           newPartitions := make([]model.PartitionDefinition, 0, len(oldIDs)) 
        
           for _, oldID := range oldIDs { 
        
           	for i := 0; i < len(pi.Definitions); i++ { 
        
           		def := &pi.Definitions[i] 
        
           		if def.ID == oldID { 
        
           			pid, err1 := t.GenGlobalID() 
        
           			if err1 != nil { 
        
           				return ver, errors.Trace(err1) 
        
           			} 
        
           			def.ID = pid 
        
           			// Shallow copy only use the def.ID in event handle. 
        
           			newPartitions = append(newPartitions, *def) 
        
           			break 
        
           		} 
        
           	} 
        
           }

Since truncating tables/partitions will allocate id just before placement rules operations, every retrying job will lead to placement rules of different IDs. It breaks idempotence.

It is reported by @CalvinNeo when testing tiflash placement rules, caused by writing conflict of concurrent DDL.

While it is a correctness problem, retrying only occurs with heavy DDL load. As an experimental feature, it does not need to be critical or major bug.

1. Minimal reproduce step (Required)

Start many concurrent session to execute truncate DDLs.

2. What did you expect to see? (Required)

Writing conflict leads to retrying, but retrying jobs have same IDs.

3. What did you see instead (Required)

Writing conflict leads to retrying, but retrying jobs have different IDs.

4. What is your TiDB version? (Required)

The text was updated successfully, but these errors were encountered:

xhebox · 2022-01-11T04:07:10Z

There are two possible solutions in my mind.:

we move id allocation into the first submit phase of DDL job
middle state truncation job, split id allocation and placement rules into two phases.

I prefer the solution 1, simple and easy, compatibility is possible with a careful taking on job arguments. Solution 2 is good for compatibility since no need of job argument changes. But it does need a new middle state, which is different from previous DDLs.

lcwangchao · 2022-01-11T04:19:12Z

There are two possible solutions in my mind.:

we move id allocation into the first submit phase of DDL job

middle state truncation job, split id allocation and placement rules into two phases.

I prefer the solution 1, simple and easy, compatibility is possible with a careful taking on job arguments. Solution 2 is good for compatibility since no need of job argument changes. But it does need a new middle state, which is different from previous DDLs.

For solution 1, we still need to handle some corner cases. Because the ddl operation is concurrent before inqueue, it can happens that after allocating the new partition ids, the number of partition changes. Though it may be not so easy to happen but I think we should add some checks...

CalvinNeo · 2022-01-11T04:32:01Z

If there are eventually redundant PD rules for TiFlash A, and if PD can't schedule regions away from TiFlash A, it may cause failure when scale in.

xhebox · 2022-01-11T05:11:32Z

There are two possible solutions in my mind.:

we move id allocation into the first submit phase of DDL job

middle state truncation job, split id allocation and placement rules into two phases.

I prefer the solution 1, simple and easy, compatibility is possible with a careful taking on job arguments. Solution 2 is good for compatibility since no need of job argument changes. But it does need a new middle state, which is different from previous DDLs.

For solution 1, we still need to handle some corner cases. Because the ddl operation is concurrent before inqueue, it can happens that after allocating the new partition ids, the number of partition changes. Though it may be not so easy to happen but I think we should add some checks...

From the code, partitions to be truncated are decided at the time of submiting/enqueuing. pids are responsible for that:

tidb/ddl/ddl_api.go

Line 3480 in aa7ad03

Args: []interface{}{pids},

So we don't actually truncate new partitions if new partitions are added after submitting. I don't think there is a problem, only if you say the current behavior is a bug..

EDIT: TruncateTable does have the problem, though, https://github.com/pingcap/tidb/blob/master/ddl/table.go#L603-L611

xhebox · 2022-01-11T05:14:37Z

If there are eventually redundant PD rules for TiFlash A, and if PD can't schedule regions away from TiFlash A, it may cause failure when scale in.

It does not affect the behavior, at least for the current code base. As long as the id is allocated increasing monotonically. But it does damage the performance of PD, if there are hundreds of redundant PD rules.

CalvinNeo · 2022-01-11T06:20:04Z

If there are eventually redundant PD rules for TiFlash A, and if PD can't schedule regions away from TiFlash A, it may cause failure when scale in.

It does not affect the behavior, at least for the current code base. As long as the id is allocated increasing monotonically. But it does damage the performance of PD, if there are hundreds of redundant PD rules.

In current version of TiFlash Cluster Manager(the old one that is in use), all pd rules are set when TableID is allocated, so will result in no redundant pd rules.

However, while the CM is going to be moved into TiDB server, pd rules will be set immediately while handling ddlJob, which can result in redundant pd rules.

I think these two behaviors are different

xhebox · 2022-01-11T06:25:34Z

I think these two behaviors are different

Yeah, but you misinterpret me. I mean redundant rules does not affect the schedule of PD, since id is allocated increasing monotonically, which means redundant rules are just no-op rules.

CalvinNeo · 2022-01-11T07:12:07Z

I think these two behaviors are different

Yeah, but you misinterpret me. I mean redundant rules does not affect the schedule of PD, since id is allocated increasing monotonically, which means redundant rules are just no-op rules.

So, the redundant rule table-%v-r set for TiFlash actually manages a non-existing table. So there will be no scheduled region from PD at the beginning?

xhebox · 2022-01-11T07:23:27Z

I think these two behaviors are different

Yeah, but you misinterpret me. I mean redundant rules does not affect the schedule of PD, since id is allocated increasing monotonically, which means redundant rules are just no-op rules.

So, the redundant rule table-%v-r set for TiFlash actually manages a non-existing table. So there will be no scheduled region from PD at the beginning?

Yes, rules are applied to regions. If there are no regions(no finished DDL to generate regions of that ID), rules are, well, redundant, as it says. And it does not affect the schedule at all.

xhebox added type/bug The issue is confirmed as a bug. severity/moderate feature/developing the related feature is in development labels Jan 11, 2022

xhebox mentioned this issue Jan 11, 2022

Define the placement of data by SQL statements #18030

Closed

xhebox added the affects-5.4 This bug affects the 5.4.x(LTS) versions. label Jan 11, 2022

xhebox self-assigned this Jan 11, 2022

jebter added the sig/sql-infra SIG: SQL Infra label Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent placement rules when retrying truncating tables/partitions #31540

Inconsistent placement rules when retrying truncating tables/partitions #31540

xhebox commented Jan 11, 2022 •

edited

Loading

xhebox commented Jan 11, 2022

lcwangchao commented Jan 11, 2022

CalvinNeo commented Jan 11, 2022

xhebox commented Jan 11, 2022 •

edited

Loading

xhebox commented Jan 11, 2022

CalvinNeo commented Jan 11, 2022 •

edited

Loading

xhebox commented Jan 11, 2022

CalvinNeo commented Jan 11, 2022

xhebox commented Jan 11, 2022

Inconsistent placement rules when retrying truncating tables/partitions #31540

Inconsistent placement rules when retrying truncating tables/partitions #31540

Comments

xhebox commented Jan 11, 2022 • edited Loading

Bug Report

1. Minimal reproduce step (Required)

2. What did you expect to see? (Required)

3. What did you see instead (Required)

4. What is your TiDB version? (Required)

xhebox commented Jan 11, 2022

lcwangchao commented Jan 11, 2022

CalvinNeo commented Jan 11, 2022

xhebox commented Jan 11, 2022 • edited Loading

xhebox commented Jan 11, 2022

CalvinNeo commented Jan 11, 2022 • edited Loading

xhebox commented Jan 11, 2022

CalvinNeo commented Jan 11, 2022

xhebox commented Jan 11, 2022

xhebox commented Jan 11, 2022 •

edited

Loading

xhebox commented Jan 11, 2022 •

edited

Loading

CalvinNeo commented Jan 11, 2022 •

edited

Loading