Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] pinned blocking alloc with spill test timeout #9924

Open
jlowe opened this issue Dec 1, 2023 · 2 comments
Open

[BUG] pinned blocking alloc with spill test timeout #9924

jlowe opened this issue Dec 1, 2023 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@jlowe
Copy link
Contributor

jlowe commented Dec 1, 2023

[2023-11-30T23:08:18.310Z] - pinned blocking alloc with spill *** FAILED ***
[2023-11-30T23:08:18.310Z]   java.util.concurrent.TimeoutException:
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite$TaskThread$TaskThreadTrackingOp.get(HostAllocSuite.scala:107)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite$AllocOnAnotherThread.waitForAlloc(HostAllocSuite.scala:218)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite.$anonfun$new$41(HostAllocSuite.scala:628)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite.$anonfun$new$41$adapted(HostAllocSuite.scala:626)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:29)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite.$anonfun$new$40(HostAllocSuite.scala:626)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite.$anonfun$new$40$adapted(HostAllocSuite.scala:618)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.Arm$.withResource(Arm.scala:29)
[2023-11-30T23:08:18.310Z]   at com.nvidia.spark.rapids.HostAllocSuite.$anonfun$new$39(HostAllocSuite.scala:618)
[2023-11-30T23:08:18.310Z]   at org.scalatest.enablers.Timed$$anon$1.timeoutAfter(Timed.scala:127)
[2023-11-30T23:08:18.310Z]   ...
[2023-11-30T23:08:19.674Z] *** RUN ABORTED ***
[2023-11-30T23:08:19.674Z]   java.lang.AssertionError: Leaked 1 pinned allocations
[2023-11-30T23:08:19.674Z]   at ai.rapids.cudf.PinnedMemoryPool.close(PinnedMemoryPool.java:317)
[2023-11-30T23:08:19.674Z]   at ai.rapids.cudf.PinnedMemoryPool.shutdown(PinnedMemoryPool.java:217)
[2023-11-30T23:08:19.674Z]   at com.nvidia.spark.rapids.HostAllocSuite.beforeEach(HostAllocSuite.scala:382)
[2023-11-30T23:08:19.674Z]   at org.scalatest.BeforeAndAfterEach.runTest(BeforeAndAfterEach.scala:233)
[2023-11-30T23:08:19.674Z]   at org.scalatest.BeforeAndAfterEach.runTest$(BeforeAndAfterEach.scala:227)
[2023-11-30T23:08:19.674Z]   at com.nvidia.spark.rapids.HostAllocSuite.runTest(HostAllocSuite.scala:34)
[2023-11-30T23:08:19.674Z]   at org.scalatest.funsuite.AnyFunSuiteLike.$anonfun$runTests$1(AnyFunSuiteLike.scala:269)
[2023-11-30T23:08:19.674Z]   at org.scalatest.SuperEngine.$anonfun$runTestsInBranch$1(Engine.scala:413)
[2023-11-30T23:08:19.674Z]   at scala.collection.immutable.List.foreach(List.scala:431)
[2023-11-30T23:08:19.674Z]   at org.scalatest.SuperEngine.traverseSubNodes$1(Engine.scala:401)
[2023-11-30T23:08:19.675Z]   ...
@jlowe jlowe added bug Something isn't working ? - Needs Triage Need team to review and classify labels Dec 1, 2023
@abellina
Copy link
Collaborator

abellina commented Dec 1, 2023

I think you filed this already #9671

@revans2 revans2 self-assigned this Dec 1, 2023
@jlowe
Copy link
Contributor Author

jlowe commented Dec 1, 2023

It's a different test failing, and after asking @revans2 before filing he was not convinced they were the same root cause.

@mattahrens mattahrens removed the ? - Needs Triage Need team to review and classify label Dec 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants