Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python/Cython bindings for multibyte_split #8998

Merged
merged 88 commits into from
Sep 17, 2021

Conversation

jdye64
Copy link
Contributor

@jdye64 jdye64 commented Aug 9, 2021

Provides the Python/Cython bindings for #8702 multibyte_split. This PR depends on #8702 being merged first.

Closes #8557

cwharris added 30 commits June 26, 2021 21:59
@codecov
Copy link

codecov bot commented Sep 3, 2021

Codecov Report

❗ No coverage uploaded for pull request base (branch-21.10@eb09d14). Click here to learn what that means.
The diff coverage is n/a.

❗ Current head 799b7fb differs from pull request most recent head 6a06a5d. Consider uploading reports for the commit 6a06a5d to get more accurate results
Impacted file tree graph

@@               Coverage Diff               @@
##             branch-21.10    #8998   +/-   ##
===============================================
  Coverage                ?   10.85%           
===============================================
  Files                   ?      116           
  Lines                   ?    19168           
  Branches                ?        0           
===============================================
  Hits                    ?     2080           
  Misses                  ?    17088           
  Partials                ?        0           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update eb09d14...6a06a5d. Read the comment docs.

Copy link
Contributor

@vyasr vyasr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add a couple of tests? Some of my changes are necessary for the code to work, and the API question I raised would be easier to discuss in the context of how this API would be called.

python/cudf/cudf/_lib/cpp/io/text.pxd Outdated Show resolved Hide resolved
python/cudf/cudf/_lib/cpp/io/text.pxd Outdated Show resolved Hide resolved
python/cudf/cudf/_lib/cpp/io/text.pxd Outdated Show resolved Hide resolved
python/cudf/cudf/_lib/cpp/io/text.pxd Outdated Show resolved Hide resolved
python/cudf/cudf/_lib/cpp/io/text.pxd Outdated Show resolved Hide resolved
python/cudf/cudf/_lib/text.pyx Outdated Show resolved Hide resolved
python/cudf/cudf/_lib/text.pyx Outdated Show resolved Hide resolved
python/cudf/cudf/io/text.py Outdated Show resolved Hide resolved
python/cudf/cudf/utils/ioutils.py Show resolved Hide resolved
Copy link
Contributor

@vyasr vyasr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just some minor nits to improve Python style.

python/cudf/cudf/_lib/text.pyx Outdated Show resolved Hide resolved
python/cudf/cudf/io/text.py Outdated Show resolved Hide resolved
python/cudf/cudf/io/text.py Outdated Show resolved Hide resolved
python/cudf/cudf/tests/test_text.py Outdated Show resolved Hide resolved
python/cudf/cudf/tests/test_text.py Outdated Show resolved Hide resolved
python/cudf/cudf/utils/ioutils.py Show resolved Hide resolved
python/cudf/cudf/tests/test_text.py Outdated Show resolved Hide resolved
@vuule vuule requested a review from vyasr September 15, 2021 23:40
@jdye64
Copy link
Contributor Author

jdye64 commented Sep 16, 2021

rerun tests

@jdye64
Copy link
Contributor Author

jdye64 commented Sep 16, 2021

Keep getting git timeouts .... trying again. rerun tests

@jdye64
Copy link
Contributor Author

jdye64 commented Sep 16, 2021

rerun tests

@jdye64
Copy link
Contributor Author

jdye64 commented Sep 17, 2021

Github checkout of dask returned 429 code. Trying again

@jdye64
Copy link
Contributor Author

jdye64 commented Sep 17, 2021

rerun tests

Copy link
Contributor

@vyasr vyasr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@galipremsagar galipremsagar added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 4 - Needs cuDF (Python) Reviewer labels Sep 17, 2021
@vuule
Copy link
Contributor

vuule commented Sep 17, 2021

@gpucibot merge

@rapids-bot rapids-bot bot merged commit 11781e8 into rapidsai:branch-21.10 Sep 17, 2021
@jdye64 jdye64 deleted the read_text_pyx_only branch September 18, 2021 13:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Ready to Merge Testing and reviews complete, ready to merge feature request New feature or request non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Support multi-character newline delimiters with a read_text API
8 participants