[BUG] Empty DataFrame object `columns` property doesn't match pandas for `data=None` or `data={}`. #15372

wence- · 2024-03-22T11:45:37Z

Describe the bug

When constructing an empty dataframe where one does not explicitly specify the column names, pandas produces a RangeIndex for the .columns property.

In contrast, cudf produces an Index(dtype=object) if data={} or data=None.

Steps/Code to reproduce bug

import cudf
import pandas as pd

for data in [{}, None]:
    columns = cudf.DataFrame(data=data).columns
    expect = pd.DataFrame(data=data).columns

    assert type(columns) == type(expect)

Expected behavior

Matching pandas. This works if data is an empty list-like object (e.g. data=[]) so it's probably just another condition to handle.

The text was updated successfully, but these errors were encountered:

- Closes rapidsai#15372

wence- added the bug Something isn't working label Mar 22, 2024

wence- self-assigned this Mar 22, 2024

wence- added the pandas label Mar 22, 2024

wence- added this to the Pandas API Alignment and Coverage milestone Mar 22, 2024

wence- added a commit to wence-/cudf that referenced this issue Mar 22, 2024

Match pandas in column index type for empty dataframes

3ad0e2d

- Closes rapidsai#15372

wence- added a commit to wence-/cudf that referenced this issue Mar 22, 2024

Add test of rapidsai#15372

bb3f865

wence- mentioned this issue Mar 22, 2024

Fix arrow-based round trip of empty dataframes #15373

Merged

3 tasks

vyasr removed the pandas label May 16, 2024

vyasr added this to cuDF Python May 24, 2024

github-project-automation bot moved this to Todo in cuDF Python May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Empty DataFrame object `columns` property doesn't match pandas for `data=None` or `data={}`. #15372

[BUG] Empty DataFrame object `columns` property doesn't match pandas for `data=None` or `data={}`. #15372

wence- commented Mar 22, 2024

[BUG] Empty DataFrame object columns property doesn't match pandas for data=None or data={}. #15372

[BUG] Empty DataFrame object columns property doesn't match pandas for data=None or data={}. #15372

Comments

wence- commented Mar 22, 2024

[BUG] Empty DataFrame object `columns` property doesn't match pandas for `data=None` or `data={}`. #15372

[BUG] Empty DataFrame object `columns` property doesn't match pandas for `data=None` or `data={}`. #15372