int64 Index in 0.17 typcasts '0' string to integer #11836

AlbertDeFusco · 2015-12-13T21:11:01Z

Here's what I got. Is this expected behavior? I could not find a reference for this functionality in the release notes.

0.16

conda create -n pd pandas=0.16 python=3.4 ipython

In 0.16 the dtype of the index changes to object when adding a row with '0'.

In [1]: import pandas as pd
In [2]: import numpy as np
In [3]: data = np.random.random(10)                                                                                     
In [4]: m=pd.Series(data)
In [5]: m[0]=0.4444
In [6]: m.index
Out[6]: Int64Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9], dtype='int64')
In [7]: m['0']=0.5555
In [8]: m.index
Out[8]: Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, '0'], dtype='object')

0.17.1

conda create -n pd pandas=0.17 python=3.4 ipython

In 0.17.1 The index dtype does not change, but typecasts to the integer 0. Repeated assignment at the integer 0 index appends more 0s to the index.

In [1]: import pandas as pd
In [2]: import numpy as np
In [3]: data = np.random.random(10)                                                                                     
In [4]: m=pd.Series(data)
In [5]: m[0]=0.4444
In [6]: m.index
Out[6]: Int64Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9], dtype='int64')
In [7]: m['0']=0.5555
In [8]: m.index
Out [8]: Int64Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 0], dtype='int64')
In [9]: m['0']=0.6666
In [10]: m.index
Out[10]: Int64Index([0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 0, 0], dtype='int64')
In [11]: m[0]
Out[11]: 
0    0.4444
0    0.5555
0    0.6666
dtype: float64

The text was updated successfully, but these errors were encountered:

jreback · 2015-12-13T21:43:12Z

I suppose this should work to be consistent. Might be a bit tricky. This is a failing of Index.insert where the dtype of the inserted element is inferred using the dtype of the current index, which is a tricky thing; you almost alway want to do this because it will raise if its not a compatible element, except when it happens that a string version is DIRECTLY convertible (in this case to a numpy array).

DavidMertz · 2015-12-14T17:16:04Z

It's worse still though, because m[0]=123 will modify an existing row, but m["0"]=123 will add more rows. So even in the crazy world of PHP-style type casting, the behavior is different depending on the type of the thing that gets cast.

…nal setting, and raise a TypeError, xref pandas-dev#4892 BUG: index type coercion when setting with an integer-like closes pandas-dev#11836

jreback added Bug Indexing Related to indexing on series/frames, not to indexes themselves Dtype Conversions Unexpected or buggy dtype conversions Difficulty Intermediate labels Dec 13, 2015

jreback added this to the Next Major Release milestone Dec 13, 2015

jreback modified the milestones: 0.18.0, Next Major Release Feb 6, 2016

jreback mentioned this issue Feb 6, 2016

DEPR: removal of deprecation warnings for float indexers #12246

Closed

jreback closed this as completed in a8be55c Feb 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

int64 Index in 0.17 typcasts '0' string to integer #11836

int64 Index in 0.17 typcasts '0' string to integer #11836

AlbertDeFusco commented Dec 13, 2015

jreback commented Dec 13, 2015

DavidMertz commented Dec 14, 2015

int64 Index in 0.17 typcasts '0' string to integer #11836

int64 Index in 0.17 typcasts '0' string to integer #11836

Comments

AlbertDeFusco commented Dec 13, 2015

0.16

0.17.1

jreback commented Dec 13, 2015

DavidMertz commented Dec 14, 2015