-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
group function #815
group function #815
Conversation
Needs tests and docs... |
I have added tests & a doc. I have added a skipna option : should observations with some column equal to NA grouped into a NA group? The option defaults to true. This returns NA for any observation with a original column equals to NA, which is generally what the user wants (especially when using |
Group using Uint32
remove poolall
change group in docs
Change name in test + add refs type test
add == for test
I have updated my commit to solve a bug I spotted in df = DataFrame(v1 = pool(1:1000), v2 = pool(fill(1, 1000)))
groupby(df, [:v1, :v2]) Grouping by |
I think it's good on my end now. |
The group function allows to combine multiple columns of a dataframe into one PooledDataArray column. A user case is to create groups based on multiple variables before fitting a model.
The function
group
basically rewraps code fromgroupby
.I'm not sure whether this functionality already exists or not.