Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP]Add support for approx parameter with cutoff as pandas dataframes #1

Merged
merged 2 commits into from
Jan 12, 2021

Conversation

VibhuJawa
Copy link
Owner

@VibhuJawa VibhuJawa commented Jan 11, 2021

Add support for approx parameter with cutoff as pandas dataframes

This pr adds support for approx parameter with the cutoffs as pandas dataframes .

See docs page for handling cutoff times.

This is useful for porting workflows that use this parameter like the https://github.com/Featuretools/predict-taxi-trip-duration/blob/master/NYC%20Taxi%203%20-%20Simple%20Featuretools.ipynb to cudf .

We cant support cudf DataFrame as input to cutoff frames as it will require rapidsai/cudf#7102 to work.

@VibhuJawa VibhuJawa merged commit 5dab7dd into cudf_poc Jan 12, 2021
VibhuJawa added a commit that referenced this pull request Jan 12, 2021
* inital experiments with negate

* inital working with agg

* added working agg stats

* working+verified with agg columns

* fixed fillna mistake

* added back comment

* decreased inconsitentcies

* 2 percent incorrect left

* Verified correctness across 2000 columns

* Code cleanup

* more cleanup

* Added more primitives

* 10/12 tests working

* working test_add_last_time_indexes

* working with most tests

* removed test_debug.py

* Major code cleanup

* passes most tests

* Create nvidia_featuretools_copyright.md

Adding Nvidia license.

* Update and rename nvidia_featuretools_copyright.md to nvidia_featuretools_license.md

* Unpin graphviz on windows ci tests (alteryx#1296)

* Unpin graphviz on windows ci tests

* Update release_notes.rst

* fixed test failures

* remove unused cudf import from test_dfs_method.py

* add cudf computational backend

* Update latest_dependencies.txt (alteryx#1300)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

* [WIP]Add support for approx parameter  with cutoff as pandas dataframes  (#1)

* working approx implementation with cutoff as a pandas dataframe.

* style fix for comment at line :463

Co-authored-by: Roy Wedge <[email protected]>
Co-authored-by: machineFL <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant