Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable hugepage for arrow host allocations (#13914)
This PR enables Transparent Huge Pages (THP) for large (>4MB) arrow allocations (host memory only). ### Performance results on a DGX-1 (`dgx14`) | | 8MB | 80MB | 800MB | 8GB | Method | |:--------------:|:-----:|:-----:|:-----:|:-----:|------------------------------------| | cudf-native | 0.006 | 0.049 | 0.485 | 4.787 | `df.to_arrow()` (branch-23.10) | | Dask-serialize | 0.004 | 0.032 | 0.310 | 3.122 | `distributed.protocol.serialize(df)` | | cudf-hugepage | 0.004 | 0.030 | 0.299 | 3.046 | `df.to_arrow()` (this PR) | | speedup | 1.5 | 1.63 | 1.62 | 1.57 | cudf-native vs. cudf-hugepage | Notice, Dask-serialize also use THP, which is why its performance is on par with cudf-hugepage. Authors: - Mads R. B. Kristensen (https://github.com/madsbk) Approvers: - Mark Harris (https://github.com/harrism) - Vukasin Milovanovic (https://github.com/vuule) URL: #13914
- Loading branch information