-
-
Notifications
You must be signed in to change notification settings - Fork 18.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: add to cython groupby functions #4095
Comments
@jreback can I take this? If it's for 0.13 you have some time and I'd be really interested in a chance to dig into more of the Cython internals. |
go for it! you will need to add a function template in src/generate_code.py and add the appropriate places in groupby.py |
@jreback cool - thanks :) I'm looking forward to figuring all that out |
@jreback were you thinking this would cover time series shifting too? |
yes the index type actually doesn't matter though its based on position shiftting (well the time-series stuff happens at a high level and is just translated to positions to move anyhow) |
@jreback - if it's already translate to positions to move, then that makes On Tue, Jul 9, 2013 at 7:34 PM, jreback [email protected] wrote:
|
yep...look at pandas/core/frame/shift.... |
I've let this slide... I will try to circle back to this when I have a chance, but if someone else wants to take it go go go |
I've started looking at this one. |
@jreback I have a good start here on Currently I've done this by creating a list of ops that won't compress (currently just |
these are kind of like (and I think should be implemented like) transform ops another example is fillna can u show a branch that u have so far? |
Alright, that helps. Let me get this cleaned up a little bit and I'll start a PR so that we can look at real code. |
closed by #10901 |
Looks like there might be a problem http://stackoverflow.com/questions/33903624/python-pandas-memoryerror-skiplist-failed-when-using-rolling-median-and-apply |
@jreback sorry to revive this but in my dataframe, a simple
takes just forever. Is this just because the data is big, or this is related to the old problems that are mentioned here? Happy to help if I can! |
anything with a lambda function will by definition be slow its basically a python loop. but what you are doing is NOT a transform, which must return a scalar per group. |
you probably want
|
Thanks Jeff
Wait but my understanding is that
|
that's what I said |
got it thanks. by the way |
I think I will write a book soon: the 100 most common errors every Pandas user has to make bestseller on amazon for sure |
The text was updated successfully, but these errors were encountered: