KeyError: 'dummy' for pd.crosstab in pandas #10291

songhuiming · 2015-06-05T23:20:24Z

get ~~ KeyError: 'dummy' ~~ when I run the following:

np.random.seed(seed = 99)
s = np.random.randint(1,10,200)
s = pd.Series(np.where(s > 9, np.nan, s))
s1 = s[:100]
s2 = s[100:]
pd.crosstab(s1, s2)

KeyError: '__dummy__

The text was updated successfully, but these errors were encountered:

lexual · 2015-06-06T10:04:39Z

Even simpler example. Perhaps something to do with the indices not overlapping at all.

s1 = pd.Series([1, 2, 3], index=[1, 2, 3])
s2 = pd.Series([4, 5, 6], index=[4, 5, 6])
pd.crosstab(s1, s2)

lexual · 2015-06-06T13:04:00Z

Believe this is the root cause of things:

http://pandas.pydata.org/pandas-docs/stable/groupby.html#na-group-handling

lexual · 2015-06-06T13:07:55Z

Yes http://pandas.pydata.org/pandas-docs/stable/groupby.html#na-group-handling is this cause.

Because the 2 indices have no overlapping indexes, this means that each groupby ends up including a nan which then excludes it from groupby result.

You then end up with an empty dataframe and that is the cause of the KeyError, as you're accessing df['dummy'] on an empty dataframe.

jreback · 2015-06-07T20:50:45Z

yeh, this should just be an empty frame, as there are no cross-tabulations.

lexual · 2015-06-08T02:55:50Z

So this is not a bug?

should we:

raise exception
return an empty dataframe?

jreback · 2015-06-08T02:57:17Z

return an empty frame

dan7davis · 2016-01-05T09:51:43Z

I'm getting the same KeyError: 'dummy' for my grouped data.

And I'm not really sure how to fix it / what you mean by 'return an empty frame.' Care to dumb it down/show precisely what you mean?

Thanks!

jreback · 2016-01-05T13:16:28Z

@dan7davis this needs a fix that would return an empty frame when catching the KeyError exception raised by the example above

https://github.com/pydata/pandas/blob/master/pandas/tools/pivot.py#L151, just need something like:

try:
    table = table[values[0]]
except KeyError:
    pass

dan7davis · 2016-01-05T14:25:47Z

@jreback problem solved. thank you! really appreciate the alacrity

jreback · 2016-01-05T14:32:28Z

want to do a pull request to fix in master?

dan7davis · 2016-01-06T09:07:18Z

I'm (very) new to coding/python/GitHub, so unfortunately I have no idea
what that means. But it sounds useful for me to know & helpful for others,
so I'd be happy to learn/try..

On Tue, Jan 5, 2016 at 3:32 PM, Jeff Reback [email protected]
wrote:

want to do a pull request to fix in master?

—
Reply to this email directly or view it on GitHub
#10291 (comment).

jreback · 2016-01-06T13:32:27Z

contributing is a great way to learn ...., see our docs: http://pandas.pydata.org/pandas-docs/stable/contributing.html

any questions, pls ask.

jreback added Bug Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Jun 7, 2015

jreback added this to the Next Major Release milestone Jun 7, 2015

jreback added Difficulty Novice labels Jan 6, 2016

jreback mentioned this issue Jan 25, 2016

crosstab dropna=False breaks columns.names #12133

Closed

tshauck mentioned this issue Feb 11, 2016

BUG: Fixes KeyError when indexes don't overlap. #12287

Closed

jreback modified the milestones: 0.18.0, Next Major Release Feb 11, 2016

jreback closed this as completed in dcc7cca Feb 12, 2016

jreback mentioned this issue May 25, 2016

BUG: pd.crosstab cannot handle series with the same name #13279

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError: 'dummy' for pd.crosstab in pandas #10291

KeyError: 'dummy' for pd.crosstab in pandas #10291

songhuiming commented Jun 5, 2015

lexual commented Jun 6, 2015

lexual commented Jun 6, 2015

lexual commented Jun 6, 2015

jreback commented Jun 7, 2015

lexual commented Jun 8, 2015

jreback commented Jun 8, 2015

dan7davis commented Jan 5, 2016

jreback commented Jan 5, 2016

dan7davis commented Jan 5, 2016

jreback commented Jan 5, 2016

dan7davis commented Jan 6, 2016

jreback commented Jan 6, 2016

KeyError: '__dummy__' for pd.crosstab in pandas #10291

KeyError: '__dummy__' for pd.crosstab in pandas #10291

Comments

songhuiming commented Jun 5, 2015

lexual commented Jun 6, 2015

lexual commented Jun 6, 2015

lexual commented Jun 6, 2015

jreback commented Jun 7, 2015

lexual commented Jun 8, 2015

jreback commented Jun 8, 2015

dan7davis commented Jan 5, 2016

jreback commented Jan 5, 2016

dan7davis commented Jan 5, 2016

jreback commented Jan 5, 2016

dan7davis commented Jan 6, 2016

jreback commented Jan 6, 2016

KeyError: 'dummy' for pd.crosstab in pandas #10291

KeyError: 'dummy' for pd.crosstab in pandas #10291