-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add an Environment Variable for debugging the fast path in cudf.pandas #15837
Add an Environment Variable for debugging the fast path in cudf.pandas #15837
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I had a few questions on how to move forward with this PR.
2dee2bf
to
8a40567
Compare
/ok to test |
/ok to test |
9ca34ff
to
af0d9ce
Compare
/ok to test |
/ok to test |
Just an idea that might avoid the need to monkeypatch. What if if we use a UDF that should always be different between cudf and pandas like: import time
def udf(*args, *kwargs):
return time.time() Then have |
@mroeschke I considered that kind of thing but I wasn't able to come up with a function (or UDF) that we support in cudf that would intentionally deviate from pandas behavior and be deterministic. The only documented difference is the row ordering of joins, and that's nondeterministic: https://docs.rapids.ai/api/cudf/nightly/cudf_pandas/faq/#are-there-any-known-limitations |
I think the results returned are the same. When I tried the udf you suggested before here, I got the same thing. |
Ah thanks both for checking 👍 |
This PR provides documentation for cudf.pandas in the Developer Guide. It will describe the fast-slow proxy wrapping scheme as well as document the `CUDF_PANDAS_DEBUGGING` environment variable created in PR #15837 for issue #14975. Authors: - Matthew Murray (https://github.com/Matt711) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) - Lawrence Mitchell (https://github.com/wence-) URL: #15889
/okay to test |
4637b04
to
4a61868
Compare
/okay to test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the many iterations here. I would like a few clarifying comments about why the explicit monkeypatch undo is needed, otherwise I think this is good to go. Add those comments and you can merge once other reviewers are happy.
/okay to test |
/merge |
Description
Part of #14975 This PR adds a pandas debugging option to
_fast_slow_function_call
that runs the slow path after the fast and returns a warning if the results differ.Checklist