Float conversion makes fillna lossy #24537

jbrockmendel · 2019-01-01T19:44:15Z

For int64 data near the int64 implementation bounds, astype('float64') or ensure_float64 is lossy. The motivating case is PeriodArray.fillna

dti = pd.date_range(pd.Timestamp.max - pd.Timedelta(nanoseconds=10), periods=5, freq='ns')
pi = dti.to_period('ns')
parr = pi._data
parr[2] = pd.NaT

>>> parr.fillna(method='pad')
<PeriodArray>
['NaT', 'NaT', 'NaT', 'NaT', 'NaT']
Length: 5, dtype: period[N]

The text was updated successfully, but these errors were encountered:

jreback · 2019-01-01T20:44:47Z

In [1]: dti = pd.date_range(pd.Timestamp.max - pd.Timedelta(nanoseconds=10), periods=5, freq='ns')
   ...: pi = dti.to_period('ns')
   ...: parr = pi._data
   ...: parr[2] = pd.NaT
   ...: 

In [2]: parr
Out[2]: 
<PeriodArray>
['2262-04-11 23:47:16.854775797', '2262-04-11 23:47:16.854775798',
                           'NaT', '2262-04-11 23:47:16.854775800',
 '2262-04-11 23:47:16.854775801']
Length: 5, dtype: period[N]

In [4]: Series(parr).fillna(parr[0])
Out[4]: 
0    2262-04-11 23:47:16.854775797
1    2262-04-11 23:47:16.854775798
2    2262-04-11 23:47:16.854775797
3    2262-04-11 23:47:16.854775800
4    2262-04-11 23:47:16.854775801
dtype: period[N]

~~Series goes thru the correct path, wonder what its dispatching too~~

actually no, you are right this is lossy.

jbrockmendel · 2019-01-01T20:58:24Z

(Internet is down, typing with thumbs for a while)

I’ve poked at this a bit for interpolate; if the int64 vals fall inside int32 bounds then we are OK. Otherwise need to cast to float128 to be assured lossless.

mroeschke · 2021-06-25T04:44:54Z

This looks okay on master now. Could use a test

In [20]: parr.fillna(method='pad')
Out[20]:
<PeriodArray>
['2262-04-11 23:47:16.854775797', '2262-04-11 23:47:16.854775798',
 '2262-04-11 23:47:16.854775798', '2262-04-11 23:47:16.854775800',
 '2262-04-11 23:47:16.854775801']
Length: 5, dtype: period[N]

KianYang-Lee · 2021-07-05T09:26:34Z

Hi @mroeschke, I would like to contribute. Can you provide a little guideline on what to test on? Thanks

mroeschke · 2021-07-05T23:23:37Z

@KianYang-Lee would need a test to validate that the code snippet in the original post returns the result in my previous comment.

KianYang-Lee · 2021-07-06T04:34:16Z

OK taking this. will come up with the test and result soon

jackgoldsmith4 · 2022-06-13T00:02:49Z

@KianYang-Lee are you still working on this?

HoWeiChin · 2022-10-06T11:25:45Z

@jackgoldsmith4
may I take it instead?

KianYang-Lee · 2022-11-15T03:29:29Z

No, I'm not. Got caught up with work. Sorry and Please proceed

@KianYang-Lee are you still working on this?

jreback added the Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate label Jan 1, 2019

jbrockmendel mentioned this issue Jan 6, 2019

REF: clear out a bunch of algos, de-duplicate a bunch of core.missing #24652

Merged

4 tasks

mroeschke added good first issue Needs Tests Unit test(s) needed to prevent regressions and removed Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate labels Jun 25, 2021

phofl mentioned this issue Nov 16, 2022

TST: Fixed issues that need tests noatamir/pyladies-berlin-sprints#3

Open

17 tasks

mstazherova mentioned this issue Jan 10, 2023

TST: Add a test for fillna in PeriodArray #50671

Merged

1 task

phofl added a commit to mstazherova/pandas-dev that referenced this issue Jan 15, 2023

Merge branch 'main' into pandas-devGH-24537/parr-fillna-test

e80ae82

phofl closed this as completed in #50671 Jan 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float conversion makes fillna lossy #24537

Float conversion makes fillna lossy #24537

jbrockmendel commented Jan 1, 2019

jreback commented Jan 1, 2019 •

edited

Loading

jbrockmendel commented Jan 1, 2019

mroeschke commented Jun 25, 2021

KianYang-Lee commented Jul 5, 2021

mroeschke commented Jul 5, 2021

KianYang-Lee commented Jul 6, 2021

jackgoldsmith4 commented Jun 13, 2022

HoWeiChin commented Oct 6, 2022

KianYang-Lee commented Nov 15, 2022

Float conversion makes fillna lossy #24537

Float conversion makes fillna lossy #24537

Comments

jbrockmendel commented Jan 1, 2019

jreback commented Jan 1, 2019 • edited Loading

jbrockmendel commented Jan 1, 2019

mroeschke commented Jun 25, 2021

KianYang-Lee commented Jul 5, 2021

mroeschke commented Jul 5, 2021

KianYang-Lee commented Jul 6, 2021

jackgoldsmith4 commented Jun 13, 2022

HoWeiChin commented Oct 6, 2022

KianYang-Lee commented Nov 15, 2022

jreback commented Jan 1, 2019 •

edited

Loading