(5.7) PS-7806: Column compression breaks async replication on PS #5161

kamil-holubicki · 2023-11-17T10:16:24Z

https://jira.percona.com/browse/PS-7806

Work based on the original patch for 8.0 by Nitendra Bhosle.

Problem:
When the statement related to the partitioned table containing compressed BLOB columns is replicated, replica stops with error. e.g. DELETE FROM t1 WHERE d2 = 0.00000 ;

Cause:
Queries like mentioned delete are implemented in the following way:

Index scan is performed
For every matching row, the row is deleted
Query is rewritten, using all columns in WHERE clause.

For partitioned tables, during the ordered scan, we read (and keep) the next record from every partition, and then do ordering of cached records, returning the first in order
(logic in Partition_helper::handle_ordered_index_scan()). However, we use common prebuilt->compression_heap which is cleaned up before every row read. This causes that rows cached for particular partitions are freed and overwritten by next partition's row during rows read loop in Partition_helper::handle_ordered_index_scan(). Then the query is being binlogged, but blob pointer may be invalid, pointing to overwritten memory, so rewritten query contains wrong value for BLOB column.
When received by replica, such row does not exists and replica stops.

Solution:
Implemented dedicated compression_heap for every partition, similarly to already existing blob_heap.

Note: no GCA, because we need PS-8879 fix for this.

kamil-holubicki · 2023-11-17T10:19:36Z

https://ps57.cd.percona.com/view/5.7/job/percona-server-5.7-pipeline/34900/

percona-ysorokin · 2023-11-21T16:16:08Z

storage/innobase/handler/ha_innopart.cc

@@ -1353,6 +1355,7 @@ ha_innopart::open(
 	if (m_ins_node_parts == NULL
 	    || m_upd_node_parts == NULL
 	    || m_blob_heap_parts == NULL
+		|| m_compress_heap_parts == NULL


Fixed. This file is inconsistent in tabs and spaces usage.

https://jira.percona.com/browse/PS-7806 Work based on the original patch for 8.0 by Nitendra Bhosle. Problem: When the statement related to the partitioned table containing compressed BLOB columns is replicated, replica stops with error. e.g. DELETE FROM t1 WHERE d2 = 0.00000 ; Cause: Queries like mentioned delete are implemented in the following way: 1. Index scan is performed 2. For every matching row, the row is deleted 3. Query is rewritten, using all columns in WHERE clause. For partitioned tables, during the ordered scan, we read (and keep) the next record from every partition, and then do ordering of cached records, returning the first in order (logic in Partition_helper::handle_ordered_index_scan()). However, we use common prebuilt->compression_heap which is cleaned up before every row read. This causes that rows cached for particular partitions are freed and overwritten by next partition's row during rows read loop in Partition_helper::handle_ordered_index_scan(). Then the query is being binlogged, but blob pointer may be invalid, pointing to overwritten memory, so rewritten query contains wrong value for BLOB column. When received by replica, such row does not exists and replica stops. Solution: Implemented dedicated compression_heap for every partition, similarly to already existing blob_heap.

percona-ysorokin

LGTM

kamil-holubicki requested review from percona-ysorokin and satya-bodapati November 17, 2023 10:16

kamil-holubicki mentioned this pull request Nov 17, 2023

PS-7806: Column compression breaks async replication on PS #4460

Closed

kamil-holubicki changed the title ~~PS-7806: Column compression breaks async replication on PS~~ (5.7) PS-7806: Column compression breaks async replication on PS Nov 17, 2023

kamil-holubicki force-pushed the PS-7806-5.7 branch from 33380ab to f36687e Compare November 17, 2023 10:50

percona-ysorokin reviewed Nov 21, 2023

View reviewed changes

kamil-holubicki force-pushed the PS-7806-5.7 branch from f36687e to 601abac Compare November 21, 2023 20:37

kamil-holubicki requested a review from percona-ysorokin November 21, 2023 20:38

percona-ysorokin approved these changes Nov 21, 2023

View reviewed changes

kamil-holubicki merged commit 074aadb into percona:5.7 Nov 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(5.7) PS-7806: Column compression breaks async replication on PS #5161

(5.7) PS-7806: Column compression breaks async replication on PS #5161

kamil-holubicki commented Nov 17, 2023

kamil-holubicki commented Nov 17, 2023

percona-ysorokin Nov 21, 2023

kamil-holubicki Nov 21, 2023

percona-ysorokin left a comment

(5.7) PS-7806: Column compression breaks async replication on PS #5161

(5.7) PS-7806: Column compression breaks async replication on PS #5161

Conversation

kamil-holubicki commented Nov 17, 2023

kamil-holubicki commented Nov 17, 2023

percona-ysorokin Nov 21, 2023

Choose a reason for hiding this comment

kamil-holubicki Nov 21, 2023

Choose a reason for hiding this comment

percona-ysorokin left a comment

Choose a reason for hiding this comment