[BUG] TPC-ds 14a and 14b failed to run #650

JustPlay · 2020-09-03T13:00:18Z

TPC-DS query 14a && 14b failed to run

we have two machine, each machine has 4 V100 (16GB) (total 8 executors)
rapids-0.2
cuDF-0.15 with PTDS=off
gpu concurrent = 2
rapids batch size=256M
for the tpc-ds dataset, we use a scale-factor=10000 (the final parquet file is around 3.6T)

i have tried multiple shuffle partition settings from 384 to 1024, all failed to run those two query

I see RMM failure and other failure from the log; But I want to ensure which failure is the root cause?

BTW:
when rapids batch-size=64M, gpu concurrent=1 and shuffle partition=3000, then we can run them successfully, but the performance is very very poor.

the tar package is the log (please rm the .txt postfix)
tpc-ds.q14.tgz.txt

Thanks

The text was updated successfully, but these errors were encountered:

revans2 · 2020-09-03T13:45:02Z

From a quick look through the logs, it looks like you ran out of memory on the GPU as a part of a concat operation. Not really sure if it is the same type of concat operation that we have seen in other places where we need all of the data in memory for a join or a sort, but I would guess that is the case. It sounds like we need to work on our out of core processing, but I am not 100% sure on that.

JustPlay · 2020-09-08T13:22:02Z

out of core processing

what is out of core processing?

revans2 · 2020-09-08T13:28:06Z

out of core processing generally refers to algorithms that support processing data that is larger that fits in memory.

sameerz · 2020-09-08T20:59:13Z

This may be related to data skew, issue #20.

revans2 · 2021-05-03T19:09:26Z

@JustPlay

We just merged in #2310 to do some initial support for out of core joins. It is not perfect, but should help a lot with large joins. I have tested it on TPC-DS 14a and 14b at scale factor 200, but with a much smaller number of shuffle partitions.

--conf 'spark.sql.shuffle.partitions=2' 
--conf 'spark.rapids.sql.concurrentGpuTasks=2'
--conf 'spark.rapids.sql.batchSizeBytes=2047m'
--conf 'spark.rapids.memory.pinnedPool.size=32g'
--conf 'spark.rapids.memory.host.spillStorageSize=16g'

2 shuffle partitions at scale factor 200 should be equivalent to 100 shuffle partitions at scale factor 10,000, assuming that there is not some kind of skew that shows up at larger scale factors.

If you could try to retest this it would be great.

revans2 · 2021-05-18T21:06:27Z

Closing this for now as I think it is fixed. Please reopen if you see more issue with this after the current SNAPSHOT version.

Signed-off-by: Robert (Bobby) Evans <[email protected]>

JustPlay added ? - Needs Triage Need team to review and classify bug Something isn't working labels Sep 3, 2020

sameerz removed the ? - Needs Triage Need team to review and classify label Sep 8, 2020

revans2 closed this as completed May 18, 2021

tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023

Add in support for hilbert index as a part of zorder work (NVIDIA#650)

97c827d

Signed-off-by: Robert (Bobby) Evans <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] TPC-ds 14a and 14b failed to run #650

[BUG] TPC-ds 14a and 14b failed to run #650

JustPlay commented Sep 3, 2020 •

edited

Loading

revans2 commented Sep 3, 2020

JustPlay commented Sep 8, 2020

revans2 commented Sep 8, 2020

sameerz commented Sep 8, 2020

revans2 commented May 3, 2021

revans2 commented May 18, 2021

[BUG] TPC-ds 14a and 14b failed to run #650

[BUG] TPC-ds 14a and 14b failed to run #650

Comments

JustPlay commented Sep 3, 2020 • edited Loading

revans2 commented Sep 3, 2020

JustPlay commented Sep 8, 2020

revans2 commented Sep 8, 2020

sameerz commented Sep 8, 2020

revans2 commented May 3, 2021

revans2 commented May 18, 2021

JustPlay commented Sep 3, 2020 •

edited

Loading