Difference in groupby behavior between Pandas 0.13.1 and 0.15.2 #9560

dakoner · 2015-02-26T23:30:36Z

Hi, I am seeing a difference in behavior on this groupby between Pandas 0.13.1 and 0.15.2. Specifically, it's like 0.15.2 is doing a cross join while 0.13.1 isn't.

print pandas.DataFrame([
  {'a': 1, 'b': 2, 'c': 3},
  {'a': 4, 'b': 5, 'c': 6}, ]).set_index(
    list('ab')).groupby(level=list('ab')).mean()

0.13.1 produces:

     c
a b   
1 2  3
4 5  6
[2 rows x 1 columns]

while 0.15.2 produces

basically, the same matrix, but with extra cross NaN entries.

We're wondering if this behavior is intentional, or a bug. It wasn't entirely clear from the set of release notes that the groupby behavior changed so much.

The text was updated successfully, but these errors were encountered:

jreback · 2015-02-27T11:51:04Z

the resulting resulting cartesian product of the indices (e.g. the nan entries), were a bug in 0.15+ (maybe only in 0.15.2), and are fixed in 0.16.0 (coming soon), fixed in #9177

jreback closed this as completed Feb 27, 2015

jreback added Bug Groupby labels Feb 27, 2015

jreback mentioned this issue Feb 27, 2015

INT: support DatetimeBlock with timezones #8260

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference in groupby behavior between Pandas 0.13.1 and 0.15.2 #9560

Difference in groupby behavior between Pandas 0.13.1 and 0.15.2 #9560

dakoner commented Feb 26, 2015

jreback commented Feb 27, 2015

Difference in groupby behavior between Pandas 0.13.1 and 0.15.2 #9560

Difference in groupby behavior between Pandas 0.13.1 and 0.15.2 #9560

Comments

dakoner commented Feb 26, 2015

jreback commented Feb 27, 2015