API Doc for Polars GPU Engine #16753

singhmanas1 · 2024-09-05T01:48:48Z

Modified the cudf API docs to add a page on cudf pandas detailing - 1) How to use? 2) How to learn more? 3) How to try on Google Colab?

copy-pr-bot · 2024-09-05T01:48:51Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

wence-

I am not fully convinced that it makes sense to show the basic queries and install instructions, rather than just linking to the polars docs.

Rationale: this leaves us two places we need to update things if anything needs changed.

docs/cudf/source/cudf_polars/index.rst

wence- · 2024-09-05T15:12:22Z

docs/cudf/source/cudf_polars/index.rst

+.. code-block:: bash
+
+   pip install polars[gpu] --extra-index-url=https://pypi.nvidia.com 
+
+GPU-based execution can be triggered by simply running ``.collect(engine="gpu")`` instead of ``.collect()``.
+
+.. code-block:: python
+
+   # Import the necessary library
+   import polars as pl
+
+   # Define the data for the LazyFrame
+   ldf = pl.LazyFrame({
+      "a": [1.242, 1.535],
+   })
+
+   print(ldf.select(pl.col("a").round(1)).collect(engine="gpu"))
+
+
+For finer control, you can pass a GPUEngine object with additional configuration parameters to the ``engine=`` parameter.
+
+.. code-block:: python
+
+   # Import the necessary library
+   import polars as pl
+
+   # Define the data for the LazyFrame
+   ldf = pl.LazyFrame({
+      "a": [1.242, 1.535],
+   })
+
+   # Configure the GPU engine with advanced settings
+   gpu_engine = pl.GPUEngine(
+      device=0,
+      raise_on_Fail=True  # Ensure the engine fails loudly if it cannot execute on the GPU
+   )
+
+   # Execute the collection with the custom GPU engine configuration
+   print(ldf.select(pl.col("a").round(1)).collect(engine=gpu_engine))


This replicates (approximately) the information that we are maintaining on the polars site. I think the better approach is to not have that here, but to just immediately link there. Perhaps we can have some benchmark results on this landing page?

removed the installation and sample code snippet.

singhmanas1 · 2024-09-06T17:47:53Z

I am aligned with the flow. Will add the benchmarks to the page next week.

See - latest flow 093ce0c

bdice · 2024-09-09T17:54:51Z

@singhmanas1 Can you write a proper title for this PR?

Speed ups experience with Polars GPU Engine

Speed up with Polars GPU Engine for an 80 GB dataset

Added the benchmarks- 1. Query processing time versus dataset size. 2. Per query speedup for all 22 PDS-H queries

wence-

Thanks, LGTM

bdice · 2024-09-16T13:58:54Z

Why is there no CI being run here? I want to preview these docs...

raydouglass · 2024-09-16T14:00:26Z

/ok to test

bdice

A few change requests - the only blocker is the "TBD" link. Everything else can be fixed in a follow-up PR if needed.

bdice · 2024-09-16T13:55:26Z

docs/cudf/source/_static/Polars_GPU_speedup_80GB.png

Y axis label should be “Speedup (Polars CPU runtime / Polars GPU runtime)”

docs/cudf/source/cudf_polars/index.rst

bdice · 2024-09-16T14:13:37Z

docs/cudf/source/cudf_polars/index.rst

+   :width: 200px
+   :target: https://colab.research.google.com/github/rapidsai-community/showcase/blob/main/accelerated_data_processing_examples/polars_gpu_engine_demo.ipynb
+
+   Take the cuDF backend for Polars for a test-drive in a free GPU-enabled notebook environment using your Google account by `launching on Colab <TBD>`_  


Reminder to fix this before merging!

I think I fixed this. I assume it's supposed to point to https://colab.research.google.com/github/rapidsai-community/showcase/blob/main/accelerated_data_processing_examples/polars_gpu_engine_demo.ipynb. If that's incorrect please fix it.

bdice · 2024-09-16T14:17:38Z

One other change request -- where do we link to this page? It needs to be linked from the cuDF docs somewhere, it should not be an orphaned page. Maybe in https://github.com/rapidsai/cudf/blob/branch-24.10/docs/cudf/source/index.rst.

bdice · 2024-09-16T14:22:21Z

/ok to test

1. Updated benchmark with a graph of speed ups on. compute heavy queries 2. Updated text description for the graph with compute heavy queries

Minor edits to the language

Minor language edits

Added hardware configuration for the benchmark

Updated the hardware specs

…t/manas_polars_docs

brandon-b-miller · 2024-09-16T19:24:40Z

/ok to test

brandon-b-miller · 2024-09-16T19:57:46Z

/ok to test

docs/cudf/source/cudf_polars/index.rst

Manas Singh added 8 commits September 5, 2024 01:05

added index file for cudf polars

5dd765e

added Announcement and Google Colab sections

4f11aaa

Added Google Colab image

7bab15f

Added Google Colab image

311deb3

Added Google Colab image

8bf9cd2

Added Google Colab image

b7dd2d8

text alignment

6a2ffb5

change headings

2925068

added extra index to the install command

afdfe5b

wence- requested changes Sep 5, 2024

View reviewed changes

Update index.rst

093ce0c

removed the installation and sample code snippet.

wence- added the cudf.polars Issues specific to cudf.polars label Sep 9, 2024

Added polars pds benchmark

a8ac87e

github-actions bot removed the cudf.polars Issues specific to cudf.polars label Sep 11, 2024

singhmanas1 added 6 commits September 11, 2024 15:05

Delete docs/cudf/source/_static/polars_pds_benchmark.png

ac4ecf0

Added polars pds benchmark

8d27437

Add files via upload

1d21c92

Speed ups experience with Polars GPU Engine

Delete docs/cudf/source/_static/polars_GPU_speedups_80GB.png

87e9810

Add files via upload

c95f440

Speed up with Polars GPU Engine for an 80 GB dataset

Update index.rst

9048a91

Added the benchmarks- 1. Query processing time versus dataset size. 2. Per query speedup for all 22 PDS-H queries

singhmanas1 changed the title ~~Feat/manas polars docs~~ API Doc for Polars GPU Engine Sep 12, 2024

Apply suggestions from code review

f22acc7

wence- approved these changes Sep 16, 2024

View reviewed changes

bdice requested changes Sep 16, 2024

View reviewed changes

Fix links and style check.

f47a035

bdice added doc Documentation non-breaking Non-breaking change labels Sep 16, 2024

singhmanas1 and others added 13 commits September 16, 2024 10:45

Added graph with compute heavy queries

2a6a844

Delete docs/cudf/source/_static/polars_compute_heavy_query.png

9b21c66

Adding polars compute heavy queries

79ac363

Delete docs/cudf/source/_static/polars_compute_heavy_queries.png

ca3517c

Adding TPC-H benchmark results

c006acd

Updated index.rst

c395baa

1. Updated benchmark with a graph of speed ups on. compute heavy queries 2. Updated text description for the graph with compute heavy queries

Update index.rst

e39539f

Minor edits to the language

Update index.rst

4606930

Minor language edits

Update index.rst

c4ef847

Minor language edits

Update index.rst

7e71672

Minor language edits

Update index.rst

f076bd4

Added hardware configuration for the benchmark

Update index.rst

36bb894

Updated the hardware specs

qMerge remote-tracking branch 'upstream/feature/cudf-polars' into fea…

71ea75a

…t/manas_polars_docs

cudf_polars index page added to toc

bb66967

bdice approved these changes Sep 16, 2024

View reviewed changes

docs/cudf/source/cudf_polars/index.rst Outdated Show resolved Hide resolved

Update docs/cudf/source/cudf_polars/index.rst

d403419

bdice merged commit b6a110e into rapidsai:feature/cudf-polars Sep 16, 2024
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API Doc for Polars GPU Engine #16753

API Doc for Polars GPU Engine #16753

singhmanas1 commented Sep 5, 2024

copy-pr-bot bot commented Sep 5, 2024

wence- left a comment

wence- Sep 5, 2024

singhmanas1 commented Sep 6, 2024

bdice commented Sep 9, 2024

wence- left a comment

bdice commented Sep 16, 2024

raydouglass commented Sep 16, 2024

bdice left a comment

bdice Sep 16, 2024 •

edited

Loading

bdice Sep 16, 2024

bdice Sep 16, 2024

bdice commented Sep 16, 2024

bdice commented Sep 16, 2024

brandon-b-miller commented Sep 16, 2024

brandon-b-miller commented Sep 16, 2024

API Doc for Polars GPU Engine #16753

API Doc for Polars GPU Engine #16753

Conversation

singhmanas1 commented Sep 5, 2024

copy-pr-bot bot commented Sep 5, 2024

wence- left a comment

Choose a reason for hiding this comment

wence- Sep 5, 2024

Choose a reason for hiding this comment

singhmanas1 commented Sep 6, 2024

bdice commented Sep 9, 2024

wence- left a comment

Choose a reason for hiding this comment

bdice commented Sep 16, 2024

raydouglass commented Sep 16, 2024

bdice left a comment

Choose a reason for hiding this comment

bdice Sep 16, 2024 • edited Loading

Choose a reason for hiding this comment

bdice Sep 16, 2024

Choose a reason for hiding this comment

bdice Sep 16, 2024

Choose a reason for hiding this comment

bdice commented Sep 16, 2024

bdice commented Sep 16, 2024

brandon-b-miller commented Sep 16, 2024

brandon-b-miller commented Sep 16, 2024

bdice Sep 16, 2024 •

edited

Loading