Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update split_data to call split_multiseries_data #4312

Merged
merged 4 commits into from
Sep 19, 2023

Conversation

eccabay
Copy link
Contributor

@eccabay eccabay commented Sep 18, 2023

Closes #4311

@codecov
Copy link

codecov bot commented Sep 18, 2023

Codecov Report

Patch coverage: 100.0% and project coverage change: +0.1% 🎉

Comparison is base (90b0e5c) 99.7% compared to head (4f80686) 99.7%.

Additional details and impacted files
@@           Coverage Diff           @@
##            main   #4312     +/-   ##
=======================================
+ Coverage   99.7%   99.7%   +0.1%     
=======================================
  Files        357     357             
  Lines      39739   39767     +28     
=======================================
+ Hits       39619   39647     +28     
  Misses       120     120             
Files Changed Coverage Δ
evalml/preprocessing/utils.py 100.0% <100.0%> (ø)
...valml/tests/preprocessing_tests/test_split_data.py 100.0% <100.0%> (ø)

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@eccabay eccabay marked this pull request as ready for review September 18, 2023 21:07
Copy link
Collaborator

@jeremyliweishih jeremyliweishih left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some quick comments but otherwise LGTM

evalml/preprocessing/utils.py Show resolved Hide resolved
@@ -29,6 +30,8 @@ def test_split_data(
X, y = X_y_regression
problem_configuration = None
if is_time_series(problem_type):
if is_multiseries(problem_type):
pytest.skip("Multiseries time series is tested separately")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of skipping could we mock out the multi series split and asset that it's called?

@eccabay eccabay enabled auto-merge (squash) September 19, 2023 17:10
@eccabay eccabay merged commit cf6bc94 into main Sep 19, 2023
@eccabay eccabay deleted the 4311_split_data_calls_ms branch September 19, 2023 17:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Holdout data splitter creates NaN values with multiseries datasets
4 participants