BUG/TST: verify that groupby apply with a column aggregation does not return the column #7002

jreback · 2014-04-29T20:26:56Z

related #7000

In [1]:  df = DataFrame({'foo1' : ['one', 'two', 'two', 'three', 'one', 'two'],
                                      'foo2' : np.random.randn(6)})

In [2]: df
Out[2]: 
    foo1      foo2
0    one  1.006666
1    two  0.002063
2    two  1.507785
3  three  1.865921
4    one  0.141202
5    two -1.079792

[6 rows x 2 columns]

In [3]: df.groupby('foo1').mean()
Out[3]: 
           foo2
foo1           
one    0.573934
three  1.865921
two    0.143352

[3 rows x 1 columns]

In [4]: df.groupby('foo1').apply(lambda x: x.mean())
Out[4]: 
           foo2
foo1           
one    0.573934
three  1.865921
two    0.143352

[3 rows x 1 columns]

This should return the foo1 column as well

[6]: df.groupby('foo1',as_index=False).apply(lambda x: x.mean())
Out[6]: 
       foo2
0  0.573934
1  1.865921
2  0.143352

[3 rows x 1 columns]

In [7]: df.groupby('foo1',as_index=False).mean()
Out[7]: 
    foo1      foo2
0    one  0.573934
1  three  1.865921
2    two  0.143352

[3 rows x 2 columns]

The text was updated successfully, but these errors were encountered:

jonathanlb · 2016-10-14T22:26:59Z

Extending the example to
df.groupby('foo1').apply(lambda x: x.sum())
produces a DataFrame with column foo1 values oneone three twotwotwo

Should that behavior be preserved?

OmerJog · 2019-01-22T08:26:54Z

Should that behavior be preserved?

Why not?

drburrow · 2020-04-15T18:24:17Z

If nobody is currently working on this, I'd like to look more into it.

fangchenli · 2020-06-08T05:56:41Z

I guess this could be closed.

df = pd.DataFrame({'foo1': ['one', 'two', 'two', 'three', 'one', 'two'],
  ...:                    'foo2': np.random.randn(6)})

df
Out[4]: 
    foo1      foo2
0    one  0.404196
1    two -0.484634
2    two  1.033869
3  three -0.368001
4    one -2.506380
5    two  0.807768

df.groupby('foo1',as_index=False).apply(lambda x: x.mean())
Out[5]: 
    foo1      foo2
0    one -1.051092
1  three -0.368001
2    two  0.452334

df.groupby('foo1',as_index=False).mean()
Out[6]: 
    foo1      foo2
0    one -1.051092
1  three -0.368001
2    two  0.452334

jreback · 2020-06-08T11:04:30Z

this was recently patched and we should have a test for it

if you like to check (otherwise a PR with this test case would be ok)

…olumn (pandas-dev#7002)

fangchenli · 2020-06-08T19:49:01Z

Sorry for this many commits... It was my first PR.

…ns the column (pandas-dev#7002)" This reverts commit 2e77cef.

…olumn #7002 (#34647)

jreback added this to the 0.15.0 milestone Apr 29, 2014

jreback added Bug labels Apr 29, 2014

hayd mentioned this issue Apr 29, 2014

Consistency with groupby as_index #5755

Closed

8 tasks

jreback modified the milestones: 0.16.0, Next Major Release Mar 3, 2015

jreback added Difficulty Novice labels Feb 17, 2016

TomAugspurger added the good first issue label Oct 11, 2017

jreback removed the Difficulty Novice label Dec 15, 2017

jbrockmendel removed the Effort Low label Oct 21, 2019

mroeschke removed the Testing pandas testing functions or related to the test suite label May 22, 2020

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST: groupby apply with indexing and colunm aggregation returns the c…

2e77cef

…olumn (pandas-dev#7002)

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST: groupby apply with indexing and column aggregation returns the c…

99fee1f

…olumn (pandas-dev#7002)

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST: groupby apply with indexing and column aggregation returns the c…

e3a49fe

…olumn (pandas-dev#7002)

fangchenli mentioned this issue Jun 8, 2020

TST: groupby apply with indexing and column aggregation returns the column #7002 #34647

Merged

4 tasks

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST/CLN: reformat (pandas-dev#7002)

eef7d59

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST: indent (pandas-dev#7002)

cfdbcf0

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST: indent (pandas-dev#7002)

e8f181c

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 8, 2020

TST: deterministic test case (pandas-dev#7002)

0bd567c

jreback removed this from the Contributions Welcome milestone Jun 8, 2020

jreback added this to the 1.1 milestone Jun 8, 2020

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 9, 2020

TST: explicitly constructed expected DataFrame (pandas-dev#7002)

787d9ba

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 9, 2020

TST: explicitly constructed expected DataFrame (pandas-dev#7002)

8139d8e

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 9, 2020

TST: explicitly constructed expected DataFrame (pandas-dev#7002)

c8d5fca

fangchenli added a commit to fangchenli/pandas that referenced this issue Jun 14, 2020

Revert "TST: groupby apply with indexing and colunm aggregation retur…

ce4db3c

…ns the column (pandas-dev#7002)" This reverts commit 2e77cef.

jreback closed this as completed in #34647 Jun 14, 2020

jreback pushed a commit that referenced this issue Jun 14, 2020

TST: groupby apply with indexing and column aggregation returns the c…

9dca069

…olumn #7002 (#34647)

rhshadrach mentioned this issue Nov 11, 2022

BUG: groupby.describe with as_index=False incorrect #49643

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG/TST: verify that groupby apply with a column aggregation does not return the column #7002

BUG/TST: verify that groupby apply with a column aggregation does not return the column #7002

jreback commented Apr 29, 2014

jonathanlb commented Oct 14, 2016

OmerJog commented Jan 22, 2019

drburrow commented Apr 15, 2020

fangchenli commented Jun 8, 2020

jreback commented Jun 8, 2020

fangchenli commented Jun 8, 2020

BUG/TST: verify that groupby apply with a column aggregation does not return the column #7002

BUG/TST: verify that groupby apply with a column aggregation does not return the column #7002

Comments

jreback commented Apr 29, 2014

jonathanlb commented Oct 14, 2016

OmerJog commented Jan 22, 2019

drburrow commented Apr 15, 2020

fangchenli commented Jun 8, 2020

jreback commented Jun 8, 2020

fangchenli commented Jun 8, 2020