We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mangle_dupe_cols
CSV reader
Describe the bug Starting pandas 1.4.0, there is a change in behavior of mangle_dupe_cols in cudf.read_csv. The change in behavior is explained well here: https://pandas.pydata.org/docs/whatsnew/v1.4.0.html#mangle-dupe-cols-in-read-csv-no-longer-renames-unique-columns-conflicting-with-target-names
pandas 1.4.0
cudf.read_csv
Steps/Code to reproduce bug
>>> import pandas as pd >>> import cudf >>> from io import StringIO >>> buffer = """A,A,A.1,A,A.2,A,A.4,A,A ... 1,2,3.1,4,a.2,a,a.4,a,a ... 2,4,6.1,8,b.2,b,b.4,b,b""" >>> pdf = pd.read_csv(StringIO(buffer)) >>> gdf = cudf.read_csv(StringIO(buffer)) >>> gdf A A.1 A.1.1 A.2 A.2.1 A.3 A.4 A.4.1 A.5 0 1 2 3.1 4 a.2 a a.4 a a 1 2 4 6.1 8 b.2 b b.4 b b >>> pdf A A.3 A.1 A.5 A.2 A.6 A.4 A.7 A.8 0 1 2 3.1 4 a.2 a a.4 a a 1 2 4 6.1 8 b.2 b b.4 b b
Expected behavior
>>> gdf A A.3 A.1 A.5 A.2 A.6 A.4 A.7 A.8 0 1 2 3.1 4 a.2 a a.4 a a 1 2 4 6.1 8 b.2 b b.4 b b
Environment overview (please complete the following information)
Environment details Please run and paste the output of the cudf/print_env.sh script here, to gather any other relevant environment details
cudf/print_env.sh
**git*** commit bb3a6b995f20ef419705cf67821804441e485b9c (HEAD -> pandas_1.4x) Merge: e6098bf963 fb03c8bc91 Author: galipremsagar <[email protected]> Date: Thu Apr 7 07:39:23 2022 -0700 Merge remote-tracking branch 'upstream/branch-22.06' into pandas_1.4x **git submodules*** ***OS Information*** DISTRIB_ID=Ubuntu DISTRIB_RELEASE=18.04 DISTRIB_CODENAME=bionic DISTRIB_DESCRIPTION="Ubuntu 18.04.4 LTS" NAME="Ubuntu" VERSION="18.04.4 LTS (Bionic Beaver)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 18.04.4 LTS" VERSION_ID="18.04" HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" VERSION_CODENAME=bionic UBUNTU_CODENAME=bionic Linux dt07 4.15.0-76-generic #86-Ubuntu SMP Fri Jan 17 17:24:28 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux ***GPU Information*** Thu Apr 7 08:36:31 2022 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 495.29.05 Driver Version: 495.29.05 CUDA Version: 11.5 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 Tesla T4 On | 00000000:3B:00.0 Off | 0 | | N/A 44C P8 10W / 70W | 0MiB / 15109MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ | 1 Tesla T4 On | 00000000:5E:00.0 Off | 0 | | N/A 38C P8 9W / 70W | 0MiB / 15109MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ | 2 Tesla T4 On | 00000000:AF:00.0 Off | 0 | | N/A 33C P8 9W / 70W | 0MiB / 15109MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ | 3 Tesla T4 On | 00000000:D8:00.0 Off | 0 | | N/A 32C P8 9W / 70W | 0MiB / 15109MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | No running processes found | +-----------------------------------------------------------------------------+ ***CPU*** Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian CPU(s): 64 On-line CPU(s) list: 0-63 Thread(s) per core: 2 Core(s) per socket: 16 Socket(s): 2 NUMA node(s): 2 Vendor ID: GenuineIntel CPU family: 6 Model: 85 Model name: Intel(R) Xeon(R) Gold 6130 CPU @ 2.10GHz Stepping: 4 CPU MHz: 1766.199 BogoMIPS: 4200.00 Virtualization: VT-x L1d cache: 32K L1i cache: 32K L2 cache: 1024K L3 cache: 22528K NUMA node0 CPU(s): 0,2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62 NUMA node1 CPU(s): 1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,33,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63 Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd mba ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts pku ospke md_clear flush_l1d ***CMake*** /nvme/0/pgali/envs/cudfdev/bin/cmake cmake version 3.22.3 CMake suite maintained and supported by Kitware (kitware.com/cmake). ***g++*** /usr/bin/g++ g++ (Ubuntu 9.4.0-1ubuntu1~18.04) 9.4.0 Copyright (C) 2019 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. ***nvcc*** /usr/local/cuda/bin/nvcc nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2021 NVIDIA Corporation Built on Mon_Sep_13_19:13:29_PDT_2021 Cuda compilation tools, release 11.5, V11.5.50 Build cuda_11.5.r11.5/compiler.30411180_0 ***Python*** /nvme/0/pgali/envs/cudfdev/bin/python Python 3.8.13 ***Environment Variables*** PATH : /nvme/0/pgali/envs/cudfdev/bin:~/anaconda3/bin:/nvme/0/pgali/envs/cudfdev/bin:~/anaconda3/bin:/nvme/0/pgali/envs/cudfdev/bin:/home/nfs/pgali/anaconda3/condabin:/home/nfs/pgali/.vscode-server/bin/e18005f0f1b33c29e81d732535d8c0e47cafb0b5/bin/remote-cli:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin:/usr/local/cuda/bin:/usr/local/cuda/bin LD_LIBRARY_PATH : /usr/local/cuda/lib64:/usr/local/cuda/lib64::/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64 NUMBAPRO_NVVM : NUMBAPRO_LIBDEVICE : CONDA_PREFIX : /nvme/0/pgali/envs/cudfdev PYTHON_PATH : ***conda packages*** /home/nfs/pgali/anaconda3/condabin/conda # packages in environment at /nvme/0/pgali/envs/cudfdev: # # Name Version Build Channel _libgcc_mutex 0.1 conda_forge conda-forge _openmp_mutex 4.5 1_llvm conda-forge abseil-cpp 20210324.2 h9c3ff4c_0 conda-forge alabaster 0.7.12 py_0 conda-forge argon2-cffi 21.3.0 pyhd8ed1ab_0 conda-forge argon2-cffi-bindings 21.2.0 py38h497a2fe_1 conda-forge arrow-cpp 7.0.0 py38hf79e478_4_cuda conda-forge arrow-cpp-proc 3.0.0 cuda conda-forge asttokens 2.0.5 pyhd8ed1ab_0 conda-forge attrs 21.4.0 pyhd8ed1ab_0 conda-forge aws-c-cal 0.5.11 h95a6274_0 conda-forge aws-c-common 0.6.2 h7f98852_0 conda-forge aws-c-event-stream 0.2.7 h3541f99_13 conda-forge aws-c-io 0.10.5 hfb6a706_0 conda-forge aws-checksums 0.1.11 ha31a3da_7 conda-forge aws-sdk-cpp 1.8.186 hb4091e7_3 conda-forge babel 2.9.1 pyh44b312d_0 conda-forge backcall 0.2.0 pyh9f0ad1d_0 conda-forge backports 1.0 py_2 conda-forge backports-zoneinfo 0.2.1 pypi_0 pypi backports.functools_lru_cache 1.6.4 pyhd8ed1ab_0 conda-forge beautifulsoup4 4.10.0 pyha770c72_0 conda-forge binutils_impl_linux-64 2.36.1 h193b22a_2 conda-forge black 22.3.0 pyhd8ed1ab_0 conda-forge bleach 4.1.0 pyhd8ed1ab_0 conda-forge bokeh 2.4.2 py38h578d9bd_0 conda-forge brotlipy 0.7.0 py38h0a891b7_1004 conda-forge bzip2 1.0.8 h7f98852_4 conda-forge c-ares 1.18.1 h7f98852_0 conda-forge ca-certificates 2021.10.8 ha878542_0 conda-forge cachetools 5.0.0 pyhd8ed1ab_0 conda-forge certifi 2021.10.8 py38h578d9bd_2 conda-forge cffi 1.15.0 py38h3931269_0 conda-forge cfgv 3.3.1 pyhd8ed1ab_0 conda-forge charset-normalizer 2.0.12 pyhd8ed1ab_0 conda-forge clang 11.1.0 ha770c72_1 conda-forge clang-11 11.1.0 default_ha53f305_1 conda-forge clang-tools 11.1.0 default_ha53f305_1 conda-forge clangxx 11.1.0 default_ha53f305_1 conda-forge click 8.1.2 py38h578d9bd_0 conda-forge cloudpickle 2.0.0 pyhd8ed1ab_0 conda-forge cmake 3.22.3 h5432695_0 conda-forge cmake_setuptools 0.1.3 py_0 rapidsai colorama 0.4.4 pyh9f0ad1d_0 conda-forge commonmark 0.9.1 py_0 conda-forge cryptography 36.0.2 py38h2b5fc30_1 conda-forge cuda-python 11.6.1 py38h3fd9d12_0 nvidia cudatoolkit 11.5.1 hcf5317a_9 nvidia cudf 22.6.0a0+117.gbb3a6b995f.dirty pypi_0 pypi cupy 10.3.0 py38h405e1b6_0 conda-forge cyrus-sasl 2.1.27 h230043b_5 conda-forge cython 0.29.28 py38hfa26641_2 conda-forge cytoolz 0.11.2 py38h0a891b7_2 conda-forge dask 2022.3.0 pyhd8ed1ab_1 conda-forge dask-core 2022.3.0 pyhd8ed1ab_0 conda-forge dask-cudf 22.6.0a0+117.gbb3a6b995f pypi_0 pypi dataclasses 0.8 pyhc8e2a94_3 conda-forge debugpy 1.5.1 py38h709712a_0 conda-forge decorator 5.1.1 pyhd8ed1ab_0 conda-forge defusedxml 0.7.1 pyhd8ed1ab_0 conda-forge distlib 0.3.4 pyhd8ed1ab_0 conda-forge distributed 2022.3.0 pyhd8ed1ab_0 conda-forge dlpack 0.5 h9c3ff4c_0 conda-forge docutils 0.17.1 py38h578d9bd_1 conda-forge double-conversion 3.2.0 h9c3ff4c_0 conda-forge entrypoints 0.4 pyhd8ed1ab_0 conda-forge execnet 1.9.0 pyhd8ed1ab_0 conda-forge executing 0.8.3 pyhd8ed1ab_0 conda-forge expat 2.4.8 h27087fc_0 conda-forge fastavro 1.4.10 py38h0a891b7_0 conda-forge fastrlock 0.8 py38hfa26641_1 conda-forge filelock 3.6.0 pyhd8ed1ab_0 conda-forge flake8 3.8.3 py_1 conda-forge flit-core 3.7.1 pyhd8ed1ab_0 conda-forge freetype 2.10.4 h0708190_1 conda-forge fsspec 2022.3.0 pyhd8ed1ab_0 conda-forge future 0.18.2 py38h578d9bd_5 conda-forge gcc_impl_linux-64 11.2.0 h82a94d6_14 conda-forge gettext 0.19.8.1 h73d1719_1008 conda-forge gflags 2.2.2 he1b5a44_1004 conda-forge giflib 5.2.1 h36c2ea0_2 conda-forge glog 0.6.0 h6f12383_0 conda-forge gmp 6.2.1 h58526e2_0 conda-forge grpc-cpp 1.43.2 h9e046d8_1 conda-forge heapdict 1.0.1 py_0 conda-forge huggingface_hub 0.5.1 pyhd8ed1ab_0 conda-forge hypothesis 6.41.0 pyhd8ed1ab_0 conda-forge identify 2.4.12 pyhd8ed1ab_0 conda-forge idna 3.3 pyhd8ed1ab_0 conda-forge imagesize 1.3.0 pyhd8ed1ab_0 conda-forge importlib-metadata 4.11.3 py38h578d9bd_1 conda-forge importlib_metadata 4.11.3 hd8ed1ab_1 conda-forge importlib_resources 5.6.0 pyhd8ed1ab_0 conda-forge iniconfig 1.1.1 pyh9f0ad1d_0 conda-forge ipykernel 6.12.1 py38h7f3c49e_0 conda-forge ipython 8.2.0 py38h578d9bd_0 conda-forge ipython_genutils 0.2.0 py_1 conda-forge isort 5.6.4 py_0 conda-forge jbig 2.1 h7f98852_2003 conda-forge jedi 0.18.1 py38h578d9bd_1 conda-forge jinja2 3.0.3 pyhd8ed1ab_0 conda-forge joblib 1.1.0 pyhd8ed1ab_0 conda-forge jpeg 9e h7f98852_0 conda-forge jsonschema 4.4.0 pyhd8ed1ab_0 conda-forge jupyter_client 7.2.2 pyhd8ed1ab_0 conda-forge jupyter_core 4.9.2 py38h578d9bd_0 conda-forge jupyterlab_pygments 0.1.2 pyh9f0ad1d_0 conda-forge kernel-headers_linux-64 2.6.32 he073ed8_15 conda-forge keyutils 1.6.1 h166bdaf_0 conda-forge krb5 1.19.3 h3790be6_0 conda-forge lcms2 2.12 hddcbb42_0 conda-forge ld_impl_linux-64 2.36.1 hea4e1c9_2 conda-forge lerc 3.0 h9c3ff4c_0 conda-forge libblas 3.9.0 14_linux64_mkl conda-forge libbrotlicommon 1.0.9 h166bdaf_7 conda-forge libbrotlidec 1.0.9 h166bdaf_7 conda-forge libbrotlienc 1.0.9 h166bdaf_7 conda-forge libcblas 3.9.0 14_linux64_mkl conda-forge libclang-cpp11.1 11.1.0 default_ha53f305_1 conda-forge libcrc32c 1.1.2 h9c3ff4c_0 conda-forge libcurl 7.82.0 h7bff187_0 conda-forge libdeflate 1.10 h7f98852_0 conda-forge libedit 3.1.20191231 he28a2e2_2 conda-forge libev 4.33 h516909a_1 conda-forge libevent 2.1.10 h9b69904_4 conda-forge libffi 3.4.2 h7f98852_5 conda-forge libgcc-devel_linux-64 11.2.0 h0952999_14 conda-forge libgcc-ng 11.2.0 h1d223b6_14 conda-forge libgcrypt 1.10.0 h7f98852_0 conda-forge libgomp 11.2.0 h1d223b6_14 conda-forge libgoogle-cloud 1.35.0 h6945097_2 conda-forge libgpg-error 1.44 h9eb791d_0 conda-forge libgsasl 1.10.0 h5b4c23d_0 conda-forge liblapack 3.9.0 14_linux64_mkl conda-forge libllvm11 11.1.0 hf817b99_3 conda-forge libnghttp2 1.47.0 h727a467_0 conda-forge libnsl 2.0.0 h7f98852_0 conda-forge libntlm 1.4 h7f98852_1002 conda-forge libpng 1.6.37 h21135ba_2 conda-forge libprotobuf 3.19.4 h780b84a_0 conda-forge librdkafka 1.7.0 hc49e61c_1 conda-forge librmm 22.06.00a220407 cuda11_g921d286_22 rapidsai-nightly libsanitizer 11.2.0 he4da1e4_14 conda-forge libsodium 1.0.18 h36c2ea0_1 conda-forge libssh2 1.10.0 ha56f1ee_2 conda-forge libstdcxx-ng 11.2.0 he4da1e4_14 conda-forge libthrift 0.16.0 h519c5ea_1 conda-forge libtiff 4.3.0 h542a066_3 conda-forge libutf8proc 2.7.0 h7f98852_0 conda-forge libuuid 2.32.1 h7f98852_1000 conda-forge libuv 1.43.0 h7f98852_0 conda-forge libwebp 1.2.2 h3452ae3_0 conda-forge libwebp-base 1.2.2 h7f98852_1 conda-forge libxcb 1.13 h7f98852_1004 conda-forge libzlib 1.2.11 h166bdaf_1014 conda-forge llvm-openmp 13.0.1 he0ac6c6_1 conda-forge llvmlite 0.38.0 py38h38d86a4_1 conda-forge locket 0.2.0 py_2 conda-forge lz4 4.0.0 py38h1bf946c_1 conda-forge lz4-c 1.9.3 h9c3ff4c_1 conda-forge markdown 3.3.6 pyhd8ed1ab_0 conda-forge markupsafe 2.1.1 py38h0a891b7_1 conda-forge matplotlib-inline 0.1.3 pyhd8ed1ab_0 conda-forge mccabe 0.6.1 py_1 conda-forge mimesis 4.0.0 pyh9f0ad1d_0 conda-forge mistune 0.8.4 py38h497a2fe_1005 conda-forge mkl 2022.0.1 h8d4b97c_803 conda-forge msgpack-python 1.0.3 py38h43d8883_1 conda-forge mypy 0.782 py_0 conda-forge mypy_extensions 0.4.3 py38h578d9bd_5 conda-forge nbclient 0.5.13 pyhd8ed1ab_0 conda-forge nbconvert 6.4.5 pyhd8ed1ab_2 conda-forge nbconvert-core 6.4.5 pyhd8ed1ab_2 conda-forge nbconvert-pandoc 6.4.5 pyhd8ed1ab_2 conda-forge nbformat 5.3.0 pyhd8ed1ab_0 conda-forge nbsphinx 0.8.8 pyhd8ed1ab_0 conda-forge ncurses 6.3 h9c3ff4c_0 conda-forge nest-asyncio 1.5.5 pyhd8ed1ab_0 conda-forge ninja 1.10.2 h4bd325d_1 conda-forge nodeenv 1.6.0 pyhd8ed1ab_0 conda-forge notebook 6.4.10 pyha770c72_0 conda-forge numba 0.55.1 py38h4bf6c61_0 conda-forge numpy 1.21.5 py38h87f13fb_0 conda-forge numpydoc 1.2.1 pyhd8ed1ab_2 conda-forge nvtx 0.2.3 py38h497a2fe_1 conda-forge openjpeg 2.4.0 hb52868f_1 conda-forge openssl 1.1.1n h166bdaf_0 conda-forge orc 1.7.3 h1be678f_0 conda-forge packaging 21.3 pyhd8ed1ab_0 conda-forge pandas 1.4.2 py38h47df419_0 conda-forge pandoc 1.19.2 0 conda-forge pandocfilters 1.5.0 pyhd8ed1ab_0 conda-forge parquet-cpp 1.5.1 2 conda-forge parso 0.8.3 pyhd8ed1ab_0 conda-forge partd 1.2.0 pyhd8ed1ab_0 conda-forge pathspec 0.9.0 pyhd8ed1ab_0 conda-forge pexpect 4.8.0 pyh9f0ad1d_2 conda-forge pickleshare 0.7.5 py_1003 conda-forge pillow 9.1.0 py38h0ee0e06_0 conda-forge pip 22.0.4 pyhd8ed1ab_0 conda-forge platformdirs 2.5.1 pyhd8ed1ab_0 conda-forge pluggy 1.0.0 py38h578d9bd_3 conda-forge pre-commit 2.18.1 py38h578d9bd_0 conda-forge prometheus_client 0.14.0 pyhd8ed1ab_0 conda-forge prompt-toolkit 3.0.29 pyha770c72_0 conda-forge protobuf 3.19.4 py38h709712a_0 conda-forge psutil 5.9.0 py38h0a891b7_1 conda-forge pthread-stubs 0.4 h36c2ea0_1001 conda-forge ptxcompiler 0.2.0 py38h98f4b32_0 rapidsai ptyprocess 0.7.0 pyhd3deb0d_0 conda-forge pure_eval 0.2.2 pyhd8ed1ab_0 conda-forge py 1.11.0 pyh6c4a22f_0 conda-forge py-cpuinfo 8.0.0 pyhd8ed1ab_0 conda-forge pyarrow 7.0.0 py38h17143e8_4_cuda conda-forge pycodestyle 2.6.0 pyh9f0ad1d_0 conda-forge pycparser 2.21 pyhd8ed1ab_0 conda-forge pydata-sphinx-theme 0.8.1 pyhd8ed1ab_0 conda-forge pydocstyle 6.1.1 pyhd8ed1ab_0 conda-forge pyflakes 2.2.0 pyh9f0ad1d_0 conda-forge pygments 2.11.2 pyhd8ed1ab_0 conda-forge pyopenssl 22.0.0 pyhd8ed1ab_0 conda-forge pyorc 0.6.0 pypi_0 pypi pyparsing 3.0.7 pyhd8ed1ab_0 conda-forge pyrsistent 0.18.1 py38h0a891b7_1 conda-forge pysocks 1.7.1 py38h578d9bd_5 conda-forge pytest 7.1.1 py38h578d9bd_1 conda-forge pytest-benchmark 3.4.1 pyhd8ed1ab_0 conda-forge pytest-forked 1.4.0 pyhd8ed1ab_0 conda-forge pytest-xdist 2.5.0 pyhd8ed1ab_0 conda-forge python 3.8.13 h582c2e5_0_cpython conda-forge python-confluent-kafka 1.7.0 py38h497a2fe_2 conda-forge python-dateutil 2.8.2 pyhd8ed1ab_0 conda-forge python-fastjsonschema 2.15.3 pyhd8ed1ab_0 conda-forge python-snappy 0.6.0 py38h49bdff1_1 conda-forge python_abi 3.8 2_cp38 conda-forge pytorch 1.11.0 cpu_py38hde1b6bc_0 conda-forge pytz 2022.1 pyhd8ed1ab_0 conda-forge pyyaml 6.0 py38h0a891b7_4 conda-forge pyzmq 22.3.0 py38hfc09fa9_2 conda-forge rapidjson 1.1.0 he1b5a44_1002 conda-forge re2 2022.02.01 h9c3ff4c_0 conda-forge readline 8.1 h46c0cb4_0 conda-forge recommonmark 0.7.1 pyhd8ed1ab_0 conda-forge regex 2022.3.15 py38h0a891b7_1 conda-forge requests 2.27.1 pyhd8ed1ab_0 conda-forge rhash 1.4.1 h7f98852_0 conda-forge rmm 22.06.00a220407 cuda11_py38_g921d286_22 rapidsai-nightly s2n 1.0.10 h9b69904_0 conda-forge sacremoses 0.0.49 pyhd8ed1ab_0 conda-forge send2trash 1.8.0 pyhd8ed1ab_0 conda-forge setuptools 62.0.0 py38h578d9bd_0 conda-forge six 1.16.0 pyh6c4a22f_0 conda-forge sleef 3.5.1 h9b69904_2 conda-forge snappy 1.1.8 he1b5a44_3 conda-forge snowballstemmer 2.2.0 pyhd8ed1ab_0 conda-forge sortedcontainers 2.4.0 pyhd8ed1ab_0 conda-forge soupsieve 2.3.1 pyhd8ed1ab_0 conda-forge spdlog 1.8.5 h4bd325d_1 conda-forge sphinx 4.5.0 pyh6c4a22f_0 conda-forge sphinx-copybutton 0.5.0 pyhd8ed1ab_0 conda-forge sphinx-markdown-tables 0.0.15 pyhd3deb0d_0 conda-forge sphinxcontrib-applehelp 1.0.2 py_0 conda-forge sphinxcontrib-devhelp 1.0.2 py_0 conda-forge sphinxcontrib-htmlhelp 2.0.0 pyhd8ed1ab_0 conda-forge sphinxcontrib-jsmath 1.0.1 py_0 conda-forge sphinxcontrib-qthelp 1.0.3 py_0 conda-forge sphinxcontrib-serializinghtml 1.1.5 pyhd8ed1ab_1 conda-forge sphinxcontrib-websupport 1.2.4 pyhd8ed1ab_1 conda-forge sqlite 3.37.1 h4ff8645_0 conda-forge stack_data 0.2.0 pyhd8ed1ab_0 conda-forge streamz 0.6.3 pyh6c4a22f_0 conda-forge sysroot_linux-64 2.12 he073ed8_15 conda-forge tbb 2021.5.0 h924138e_1 conda-forge tblib 1.7.0 pyhd8ed1ab_0 conda-forge terminado 0.13.3 py38h578d9bd_1 conda-forge testpath 0.6.0 pyhd8ed1ab_0 conda-forge tk 8.6.12 h27826a3_0 conda-forge tokenizers 0.10.3 py38hb63a372_1 conda-forge toml 0.10.2 pyhd8ed1ab_0 conda-forge tomli 2.0.1 pyhd8ed1ab_0 conda-forge toolz 0.11.2 pyhd8ed1ab_0 conda-forge tornado 6.1 py38h0a891b7_3 conda-forge tqdm 4.64.0 pyhd8ed1ab_0 conda-forge traitlets 5.1.1 pyhd8ed1ab_0 conda-forge transformers 4.10.3 pyhd8ed1ab_0 conda-forge typed-ast 1.4.3 py38h497a2fe_1 conda-forge typing-extensions 4.1.1 hd8ed1ab_0 conda-forge typing_extensions 4.1.1 pyha770c72_0 conda-forge ukkonen 1.0.1 py38h1fd1430_1 conda-forge urllib3 1.26.9 pyhd8ed1ab_0 conda-forge virtualenv 20.14.0 py38h578d9bd_1 conda-forge wcwidth 0.2.5 pyh9f0ad1d_2 conda-forge webencodings 0.5.1 py_1 conda-forge wheel 0.37.1 pyhd8ed1ab_0 conda-forge xorg-libxau 1.0.9 h7f98852_0 conda-forge xorg-libxdmcp 1.1.3 h7f98852_0 conda-forge xz 5.2.5 h516909a_1 conda-forge yaml 0.2.5 h7f98852_2 conda-forge zeromq 4.3.4 h9c3ff4c_1 conda-forge zict 2.1.0 pyhd8ed1ab_0 conda-forge zipp 3.8.0 pyhd8ed1ab_0 conda-forge zlib 1.2.11 h166bdaf_1014 conda-forge zstd 1.5.2 ha95c52a_0 conda-forge
Additional context
We have a test coverage for this, it is disable currently:
cudf/python/cudf/cudf/tests/test_csv.py
Lines 1278 to 1286 in fb03c8b
The text was updated successfully, but these errors were encountered:
Our conda environment mentions - pandas>=1.0,<1.4.0dev0 are we moving to/beyond pandas 1.4.0 in cudf 22.06?
- pandas>=1.0,<1.4.0dev0
Sorry, something went wrong.
Relevant pandas commit pandas-dev/pandas@7f06a8a
yes: #10584
update mangle_dupe_cols behavior in csv reader to match pandas 1.4.0 …
e58d049
…behavior (#10749) Fixes #10618 Depends on #10584 Authors: - Karthikeyan (https://github.com/karthikeyann) Approvers: - GALI PREM SAGAR (https://github.com/galipremsagar) - Yunsong Wang (https://github.com/PointKernel) URL: #10749
karthikeyann
Successfully merging a pull request may close this issue.
Describe the bug
Starting
pandas 1.4.0
, there is a change in behavior ofmangle_dupe_cols
incudf.read_csv
. The change in behavior is explained well here: https://pandas.pydata.org/docs/whatsnew/v1.4.0.html#mangle-dupe-cols-in-read-csv-no-longer-renames-unique-columns-conflicting-with-target-namesSteps/Code to reproduce bug
Expected behavior
Environment overview (please complete the following information)
Environment details
Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsClick here to see environment details
Additional context
We have a test coverage for this, it is disable currently:
cudf/python/cudf/cudf/tests/test_csv.py
Lines 1278 to 1286 in fb03c8b
The text was updated successfully, but these errors were encountered: