feat: add seq2seq forecasting training job #1196

TheMichaelHu · 2022-05-04T22:13:01Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes b/229909845 🦕

Adds a SequenceToSequencePlusForecastingTrainingJob to training jobs. This job has the exact same signature as AutoMLForecastingTrainingJob, but we are creating a separate job in case the two models diverge in the future.

The logic for AutoMLForecastingTrainingJob has been moved to a new abstract base class _ForecastingTrainingJob. The only things that differ between the seq2seq and automl training jobs that extend it are the model_type and training_task_definition.

google/cloud/aiplatform/training_jobs.py

geraint0923 · 2022-05-04T22:56:00Z

Overall LG, just one minor comment.

geraint0923

LGTM, thanks!

google/cloud/aiplatform/training_jobs.py

ivanmkc · 2022-05-19T20:49:12Z

google/cloud/aiplatform/training_jobs.py

-
-        self._optimization_objective = optimization_objective
-        self._additional_experiments = []
-
    def run(


Can you just remove the run override and defer to super?

If we remove this, Sphinx will not generate documentation for the run() function. It can pull the docstring from the parent class (tested with nox -s docs), but I couldn't find a way to make it document the function signature other than by writing it out manually.

Hm, that's quite annoying. Let me ask the SDK team for more solutions.

google/cloud/aiplatform/training_jobs.py

ivanmkc

It looks like you can simplify both AutoMLForecastingTrainingJob and SequenceToSequencePlusForecastingTrainingJob by removing all methods, since they are the exact same as the super's implementation.

We also need to add system tests for these. See tests/system/aiplatform/test_e2e_tabular.py for an example.

ivanmkc · 2022-05-19T21:30:41Z

Regarding my last comment, you can simplify to:



class AutoMLForecastingTrainingJob(_ForecastingTrainingJob):
    _model_type = "AutoML"
    _training_task_definition = schema.training_job.definition.automl_forecasting
    _supported_training_schemas = (schema.training_job.definition.automl_forecasting,)

class SequenceToSequencePlusForecastingTrainingJob(_ForecastingTrainingJob):
    _model_type = "Seq2Seq"
    _training_task_definition = schema.training_job.definition.seq2seq_plus_forecasting
    _supported_training_schemas = (
        schema.training_job.definition.seq2seq_plus_forecasting,
    )

I ran the unit tests and it works fine. Otherwise, everything else looks pretty good.

TheMichaelHu · 2022-05-19T21:34:19Z

Thanks for the review @ivanmkc! qq: how will simplifying the way you suggested impact the documentation? Is there a way to check/ensure that there will be no impact? Like if the code used to generate the docs has an option to document functions from super()?

tests/unit/aiplatform/test_automl_forecasting_training_jobs.py

ivanmkc

LGTM if system tests pass.

Moves AutoMLForecasting training logic into a base class so we can reuse it for other forecasting models. Adds tests for a future seq2seq training job.

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly: - [x] Make sure to open an issue as a [bug/issue](https://github.com/googleapis/python-aiplatform/issues/new/choose) before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea - [x] Ensure the tests and linter pass - [x] Code coverage does not decrease (if any source code was changed) - [x] Appropriate docs were updated (if necessary) Fixes b/229909845 🦕 --- Adds a `SequenceToSequencePlusForecastingTrainingJob` to training jobs. This job has the exact same signature as `AutoMLForecastingTrainingJob`, but we are creating a separate job in case the two models diverge in the future. The logic for `AutoMLForecastingTrainingJob` has been moved to a new abstract base class `_ForecastingTrainingJob`. The only things that differ between the seq2seq and automl training jobs that extend it are the `model_type` and `training_task_definition`.

product-auto-label bot added the size: xl Pull request size is extra large. label May 4, 2022

TheMichaelHu marked this pull request as draft May 4, 2022 22:13

TheMichaelHu requested a review from geraint0923 May 4, 2022 22:16

TheMichaelHu force-pushed the mh-seq2seq branch from 65e3dc6 to 714feb5 Compare May 4, 2022 22:22

geraint0923 reviewed May 4, 2022

View reviewed changes

google/cloud/aiplatform/training_jobs.py Outdated Show resolved Hide resolved

geraint0923 approved these changes May 5, 2022

View reviewed changes

product-auto-label bot added the api: vertex-ai Issues related to the googleapis/python-aiplatform API. label May 5, 2022

sararob added the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label May 5, 2022

TheMichaelHu marked this pull request as ready for review May 6, 2022 17:07

TheMichaelHu added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 6, 2022

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 6, 2022

TheMichaelHu force-pushed the mh-seq2seq branch 2 times, most recently from 7f16202 to e16cf4e Compare May 9, 2022 14:21

TheMichaelHu requested a review from ivanmkc May 9, 2022 14:39

TheMichaelHu force-pushed the mh-seq2seq branch from a29dce3 to 8f56579 Compare May 9, 2022 15:42

TheMichaelHu added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 9, 2022

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 9, 2022

TheMichaelHu added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 9, 2022

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label May 9, 2022

sararob removed the do not merge Indicates a pull request not ready for merge, due to either quality or timing. label May 10, 2022

TheMichaelHu force-pushed the mh-seq2seq branch 2 times, most recently from 22098ca to 2cf47c9 Compare May 17, 2022 14:17

ivanmkc reviewed May 19, 2022

View reviewed changes

google/cloud/aiplatform/training_jobs.py Outdated Show resolved Hide resolved

ivanmkc reviewed May 19, 2022

View reviewed changes

google/cloud/aiplatform/training_jobs.py Outdated Show resolved Hide resolved

ivanmkc reviewed May 19, 2022

View reviewed changes

google/cloud/aiplatform/training_jobs.py Outdated Show resolved Hide resolved

ivanmkc suggested changes May 19, 2022

View reviewed changes

ivanmkc reviewed May 26, 2022

View reviewed changes

tests/unit/aiplatform/test_automl_forecasting_training_jobs.py Show resolved Hide resolved

ivanmkc reviewed May 26, 2022

View reviewed changes

tests/unit/aiplatform/test_automl_forecasting_training_jobs.py Show resolved Hide resolved

ivanmkc reviewed May 26, 2022

View reviewed changes

tests/unit/aiplatform/test_automl_forecasting_training_jobs.py Show resolved Hide resolved

TheMichaelHu force-pushed the mh-seq2seq branch 2 times, most recently from 32fb0d4 to 7577c31 Compare May 28, 2022 01:14

TheMichaelHu requested a review from ivanmkc May 28, 2022 01:52

ivanmkc approved these changes Jun 2, 2022

View reviewed changes

TheMichaelHu force-pushed the mh-seq2seq branch from 2485b61 to 09f6799 Compare June 3, 2022 05:00

TheMichaelHu added 13 commits June 3, 2022 12:09

Create abstract forecasting training job class.

1c4ac24

Moves AutoMLForecasting training logic into a base class so we can reuse it for other forecasting models. Adds tests for a future seq2seq training job.

Add a seq2seq training job.

26c70a2

fix context window description in _run

7505165

blacken

a1ba2c0

add supported schemas to forecasting base class

f6d3add

Add seq2seq job to init file.

a4e2a33

only keep super methods that need documentation

4e2cfaf

add seq2seq e2e tests

a084a21

foo

4af79c7

blacken

d68e530

fix merge conflict issues

8302756

make e2e test parameterized

6022bba

fix prediction format issue

a44d055

TheMichaelHu force-pushed the mh-seq2seq branch from 6a06b07 to a44d055 Compare June 3, 2022 16:09

TheMichaelHu added the automerge Merge the pull request once unit tests and other checks pass. label Jun 3, 2022

blacken

e5a9caa

gcf-merge-on-green bot merged commit 643d335 into googleapis:main Jun 3, 2022

gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Jun 3, 2022

release-please bot mentioned this pull request Jun 3, 2022

chore(main): release 1.14.0 #1400

Merged

mikelawrence-google mentioned this pull request Sep 22, 2022

feat: Adds the temporal fusion transformer (TFT) forecasting job #1654

Closed

4 tasks

release-please bot mentioned this pull request Jun 8, 2023

chore(main): release 1.24.1 #2196

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add seq2seq forecasting training job #1196

feat: add seq2seq forecasting training job #1196

TheMichaelHu commented May 4, 2022 •

edited

Loading

geraint0923 commented May 4, 2022

geraint0923 left a comment

ivanmkc May 19, 2022

TheMichaelHu May 23, 2022

ivanmkc May 26, 2022

ivanmkc left a comment

ivanmkc commented May 19, 2022

TheMichaelHu commented May 19, 2022 •

edited

Loading

ivanmkc left a comment

feat: add seq2seq forecasting training job #1196

feat: add seq2seq forecasting training job #1196

Conversation

TheMichaelHu commented May 4, 2022 • edited Loading

geraint0923 commented May 4, 2022

geraint0923 left a comment

Choose a reason for hiding this comment

ivanmkc May 19, 2022

Choose a reason for hiding this comment

TheMichaelHu May 23, 2022

Choose a reason for hiding this comment

ivanmkc May 26, 2022

Choose a reason for hiding this comment

ivanmkc left a comment

Choose a reason for hiding this comment

ivanmkc commented May 19, 2022

TheMichaelHu commented May 19, 2022 • edited Loading

ivanmkc left a comment

Choose a reason for hiding this comment

TheMichaelHu commented May 4, 2022 •

edited

Loading

TheMichaelHu commented May 19, 2022 •

edited

Loading