[PERF/ENH] Series.map
sorts a larger dataset than it needs to
#14485
Labels
no-oom
Reducing memory footprint of cudf algorithms
Performance
Performance related issue
Python
Affects Python cuDF API.
Series.map
which substitutes values inself
that match some key with its corresponding value does:So the order is the same as the input.
This has two pessimisations:
s
, we can get away withsort_by_key
ofres["s"]
rather than sorting a multi-column dataframeThe text was updated successfully, but these errors were encountered: