API: specification of functions in .agg #8593

jreback · 2014-10-21T11:57:10Z

pd.Summary
dict-of-dict DOC: Dict of Dicts for renaming Groupby Aggregations #9052
named functions feature request: Support for 'named' lambda functions in DataFrame.agg([]) #10100

From SO

In [100]: df = pd.DataFrame({'A': ['group1', 'group1', 'group2', 'group2', 'group3', 'group3'],
                   'B': [10, 12, 10, 25, 10, 12],
                   'C': [100, 102, 100, 250, 100, 102]})

In [101]: df
Out[101]: 
        A   B    C
0  group1  10  100
1  group1  12  102
2  group2  10  100
3  group2  25  250
4  group3  10  100
5  group3  12  102

In [102]: df.groupby('A').agg(['mean',lambda x: x.iloc[1]])
Out[102]: 
           B               C          
        mean  <lambda>  mean  <lambda>
A                                     
group1  11.0        12   101       102
group2  17.5        25   175       250
group3  11.0        12   101       102

Would be nice to be able to use lambda x: x.nth(1) or maybe 'nth(1)'
In [103]: df.groupby('A').agg(['mean','nth(1')])

The text was updated successfully, but these errors were encountered:

jreback · 2015-05-12T11:26:22Z

@shoyer heres my use-case. This is a specification engine (similar in concept to pd.Grouper), to enable one to easily specify ordering of output summary values (e.g. no need for dict-like), and to bind meta-data to the function, (mostly name), w/o too much boilerplate). Should be fully back-compat.

S = pd.Summary

# 8593
df.groupby('A').agg([S('mean',name='nth(1)']))

# 10100
dfAggregated = grouped.agg([np.mean,np.std,S(lambda v: v.mean()/v.max(),name='normalized mean'])

# 9052
df.groupby('B').agg({ 'A': [S('mean',name='mean1'),S('median',name='med1')],
                      'C': [S('mean',name='mean2'),S('median',name='med2')],
                    })

# maybe could also do
df.groupby('B').agg([ S('mean',column='A',name='mean1'),
                      S('median',column='A',name='med1'),
                      S('mean',columns='C',name='mean2'),
                      S('median',columns='C',name='med2')   
              ])

RylanSchaeffer · 2020-04-16T19:24:49Z

I know this isn't the place, but how does one use nth() with a groupby's .agg() method?

jbrockmendel · 2023-02-22T22:15:49Z

Dicussed on today's dev call, main comments were 1) does NamedAgg cover some of this? 2) "No" to doing eval on "nth(1)", 3) pd.Grouper is one of the harder-to-reason-about parts of the codebase and we don't want more things like it. Closing as no action.

jreback added Groupby API Design labels Oct 21, 2014

jreback added this to the 0.15.1 milestone Oct 21, 2014

jreback modified the milestones: 0.16.0, 0.15.2 Nov 29, 2014

jreback mentioned this issue Dec 10, 2014

DOC: Dict of Dicts for renaming Groupby Aggregations #9052

Closed

jreback mentioned this issue Mar 4, 2015

ENH: groupby aggregate with multi-level columns #9585

Open

jreback modified the milestones: 0.16.0, Next Major Release Mar 6, 2015

jreback added the Master Tracker High level tracker for similar issues label May 11, 2015

jreback mentioned this issue May 11, 2015

feature request: Support for 'named' lambda functions in DataFrame.agg([]) #10100

Closed

jorisvandenbossche mentioned this issue Jan 23, 2017

ENH: add Series & DataFrame .agg/.aggregate #14668

Merged

4 tasks

jreback modified the milestones: Next Major Release, High Level Issue Tracking Sep 24, 2017

TomAugspurger removed the Master Tracker High level tracker for similar issues label Jul 6, 2018

TomAugspurger removed this from the High Level Issue Tracking milestone Jul 6, 2018

jbrockmendel added the Apply Apply, Aggregate, Transform, Map label Dec 1, 2019

mroeschke added the Enhancement label Jun 28, 2020

mroeschke removed API Design Groupby labels Apr 11, 2021

jbrockmendel added the Closing Candidate May be closeable, needs more eyeballs label Feb 11, 2023

jbrockmendel closed this as completed Feb 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: specification of functions in .agg #8593

API: specification of functions in .agg #8593

jreback commented Oct 21, 2014 •

edited by mroeschke

Loading

jreback commented May 12, 2015

RylanSchaeffer commented Apr 16, 2020

jbrockmendel commented Feb 22, 2023

API: specification of functions in .agg #8593

API: specification of functions in .agg #8593

Comments

jreback commented Oct 21, 2014 • edited by mroeschke Loading

jreback commented May 12, 2015

RylanSchaeffer commented Apr 16, 2020

jbrockmendel commented Feb 22, 2023

jreback commented Oct 21, 2014 •

edited by mroeschke

Loading