[FEA] Provide a way to specify the maximum allowable precision for integers/floats #10558

shwina · 2022-03-31T18:28:33Z

Is your feature request related to a problem? Please describe.

GPU memory is a valuable resource, and using int64/float64 columns where int32/float32 would suffice means using 2x as much memory unnecessarily. As opposed to scientific computing, 32-bit data types (or lower) are sufficient for many data science applications.

Even only 32-bit data types as inputs, the resulting output can be a 64-bit type:

>>> cudf.Series([1, 2, 3], dtype="int32") + cudf.Scalar(1, dtype="float32")
0    2.0
1    3.0
2    4.0
dtype: float64

(this is consistent with Pandas and NumPy)

Describe the solution you'd like

It would be nice to be able to specify a maximum bitwidth for integer/floating types. If an operation would result in a value greater than could be accommodated, simply overflowing would be acceptable.

This could be another use case for cudf.config.

Describe alternatives you've considered

The user can carefully cast results back from 64bit to 32bit to reduce memory usage, but this is tedious and does not help with peak memory usage.

github-actions · 2022-04-30T19:02:56Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

github-actions · 2022-09-26T00:16:02Z

This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

shwina added feature request New feature or request Needs Triage Need team to review and classify labels Mar 31, 2022

shwina added Python Affects Python cuDF API. and removed Needs Triage Need team to review and classify labels Mar 31, 2022

github-actions bot added the inactive-30d label Apr 30, 2022

isVoid self-assigned this Jun 28, 2022

This was referenced Jun 30, 2022

[FEA] Allow Configuring Default Bit Width Used for Json and Csv Readers #11182

Closed

Add cudf.options #11193

Merged

github-actions bot added the inactive-90d label Sep 26, 2022

GregoryKimball removed the inactive-90d label Apr 3, 2023

vyasr removed the inactive-30d label Feb 23, 2024

vyasr added this to cuDF Python Nov 5, 2024

github-project-automation bot moved this to Todo in cuDF Python Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Provide a way to specify the maximum allowable precision for integers/floats #10558

[FEA] Provide a way to specify the maximum allowable precision for integers/floats #10558

shwina commented Mar 31, 2022 •

edited

Loading

github-actions bot commented Apr 30, 2022

github-actions bot commented Sep 26, 2022

[FEA] Provide a way to specify the maximum allowable precision for integers/floats #10558

[FEA] Provide a way to specify the maximum allowable precision for integers/floats #10558

Comments

shwina commented Mar 31, 2022 • edited Loading

github-actions bot commented Apr 30, 2022

github-actions bot commented Sep 26, 2022

shwina commented Mar 31, 2022 •

edited

Loading