-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU sample exec #3789
GPU sample exec #3789
Conversation
Signed-off-by: Chong Gao <[email protected]>
It's a draft PR.
@revans2 Help to check if it's possible that GpuRangePartitioner.computeBoundsAndClose will throw error when columnar batch is empty? |
build |
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Show resolved
Hide resolved
Signed-off-by: Chong Gao <[email protected]>
build |
@@ -0,0 +1,38 @@ | |||
# Copyright (c) 2020-2021, NVIDIA CORPORATION. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The file is new here? then should be 2021
from pyspark.sql.types import * | ||
from marks import * | ||
|
||
_table_gen = [ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like this PR has enabled many data types for the GpuSampleExec, but the test seems not to be covering them?
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPartitionwiseSampledRDD.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPoissonSampler.scala
Show resolved
Hide resolved
builder => | ||
(0 until numRows).foreach(_ => { | ||
val x = rng.nextDouble() | ||
val n = if ((x >= lowerBound) && (x < upperBound)) 1 else 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You may need "BernoulliCellSampler"
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPartitionwiseSampledRDD.scala
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPoissonSampler.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPoissonSampler.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPoissonSampler.scala
Outdated
Show resolved
Hide resolved
Signed-off-by: Chong Gao <[email protected]>
build |
Signed-off-by: Chong Gao <[email protected]>
build |
Signed-off-by: Chong Gao <[email protected]>
build |
…test cases; refactor code Signed-off-by: Chong Gao <[email protected]>
Addressed the comments. |
build |
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuRangePartitioner.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/GpuPoissonSampler.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Show resolved
Hide resolved
Signed-off-by: Chong Gao <[email protected]>
build |
Signed-off-by: Chong Gao <[email protected]>
build |
@revans2 help to review |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly just a few more nits. Looking good
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuRangePartitioner.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/basicPhysicalOperators.scala
Show resolved
Hide resolved
Signed-off-by: Chong Gao <[email protected]>
build |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems there are many unrelated diffs in file supported_ops.md
. Could you confirm it?
@firestarman About the supported_ops.md diff problem you mentioned. |
Created one issue to spark about the sort order: |
GPU sample exec
This fixes #3419
Signed-off-by: Chong Gao [email protected]