[FEA] Attempt to JIT GroupBy.apply
functions by default and fall back to iterative algorithm
#13103
Labels
GroupBy.apply
functions by default and fall back to iterative algorithm
#13103
With #11452 we introduced a framework for JIT compiling groupby UDFs with numba, along with the
GroupBy.apply
engine='jit'
kwarg. This is an o.k. approach since generally we are alright with introducing things that are a superset of the Pandas API.Recently we've discussed changing things so that when a user uses
GroupBy.apply
we try and JIT the UDF first and if it doesn't work, then fall back to the iterative method. This would provide a unified API with less to learn for users and no wondering if the UDF conforms to the restrictions on JIT apply. It also provides an easier internal interface for features that build on top ofGroupBy.apply
, such asfilter
. However it introduces JIT overhead to workflows that ultimately won't even use it. This is not ideal, but iterative groupby apply is pretty slow already.The text was updated successfully, but these errors were encountered: