Improve performance of TPC-H q19 #572

andygrove · 2024-06-14T16:19:15Z

What is the problem the feature request solves?

Comet is currently slower than Spark for query 19.

Some initial observations:

The sort merge join cannot run natively due to Support sort merge join with a join condition #398
The Parquet scan of lineitem seems to take ~10% longer than Spark and 60%+ of the time is spent in native decoding, so perhaps we should add criterion benchmarks for decoding for all types in lineitem and look for optimization opportunities there. I tested both before and after the recent changes to this code and saw no difference.
Comet avoids a very expensive C2R on 600 million rows from lineitem because it applies a filter before any C2R, so it is suprising that we are still slower
With Comet, there is a really slow C2R on the part table where it takes 18 seconds for 48k rows. Spark performs the C2R on 20 million rows and then filters down to 48k and that whole process only takes 3.2 seconds.
Spark coalesces down to 9 partitions and the HashAggregate takes 5.7 seconds and produces 9 rows, but we disable coalesce partitions with Comet and the HashAggregate there takes 11.2 seconds and produces 200 rows. Fixing Enable Comet shuffle with AQE coalesce partitions #387 would help with this

Describe the potential solution

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

parthchandra · 2024-06-20T17:07:01Z

The Parquet scan of lineitem seems to take ~10% longer than Spark and 60%+ of the time is spent in native decoding, so perhaps we should add criterion benchmarks for decoding for all types in lineitem and look for optimization opportunities there. I tested both before and after the recent changes to this code and saw no difference.

There is a chance that this is not in native decoding but in CometVector.getDecimal depending on if useDecimal128 is enabled or not. This part has a lot of data copying going on.

      byte[] bytes = getBinaryDecimal(i);
      BigInteger bigInteger = new BigInteger(bytes);
      BigDecimal javaDecimal = new BigDecimal(bigInteger, scale);
      try {
        return Decimal.apply(javaDecimal, precision, scale);

andygrove · 2024-09-22T16:54:21Z

Comet is now faster than Spark for this query, and there is no longer a C2R in the Comet plan, so closing this

andygrove added enhancement New feature or request performance labels Jun 14, 2024

andygrove added this to the 0.2.0 milestone Jun 14, 2024

andygrove mentioned this issue Jun 18, 2024

Investigate TPC-H queries that are slower when Comet is enabled #530

Closed

andygrove removed this from the 0.2.0 milestone Aug 16, 2024

andygrove mentioned this issue Sep 20, 2024

[EPIC] Improve performance of TPC-H queries #391

Open

15 tasks

andygrove closed this as completed Sep 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of TPC-H q19 #572

Improve performance of TPC-H q19 #572

andygrove commented Jun 14, 2024

parthchandra commented Jun 20, 2024

andygrove commented Sep 22, 2024

Improve performance of TPC-H q19 #572

Improve performance of TPC-H q19 #572

Comments

andygrove commented Jun 14, 2024

What is the problem the feature request solves?

Describe the potential solution

Additional context

parthchandra commented Jun 20, 2024

andygrove commented Sep 22, 2024