Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DataFrame] batch dataframe transform fails to start after it stopped at runtime #45339

Closed
hendrikmuhs opened this issue Aug 8, 2019 · 1 comment · Fixed by #45340
Closed

Comments

@hendrikmuhs
Copy link

hendrikmuhs commented Aug 8, 2019

Repro:

  • create a batch data frame transform with sufficient amount of data
  • start it
  • stop it before it can finish, progress should be between 0 and 1
  • try to start it again

-> fails with NPE, see stack trace below

regression introduced in #44219

The issue also applies to continuous data frames, however only during the bootstrap phase until checkpoint 1 has been reached.

stacktrace from 8.0, also applies to 7.3

[2019-08-08T14:45:39,802][INFO ][o.e.x.d.t.DataFrameTransformTask] [xyz] Updating state for data frame transform [fq1405] to [{"task_state":"started","indexer_state":"stopped","position":{"indexer_position":{"@timestamp":1549675260000,"airline":"KLM"}},"checkpoint":0,"progress":{"total_docs":86274,"docs_remaining":50410,"percent_complete":41.56988200384821}}]
[2019-08-08T14:45:39,878][INFO ][o.e.x.d.t.DataFrameTransformPersistentTasksExecutor] [xyz] Successfully completed and scheduled task in node operation
[2019-08-08T14:45:39,887][WARN ][o.e.x.d.t.DataFrameTransformTask] [xyzl] Data frame transform [fq1405] encountered an exception:
java.lang.NullPointerException: null
at org.elasticsearch.xpack.dataframe.transforms.DataFrameIndexer.determineRunStateAtStart(DataFrameIndexer.java:488) ~[data-frame-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT]
        at org.elasticsearch.xpack.dataframe.transforms.DataFrameIndexer.onStart(DataFrameIndexer.java:159) [data-frame-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT]
        at org.elasticsearch.xpack.dataframe.transforms.DataFrameTransformTask$ClientDataFrameIndexer.lambda$onStart$3(DataFrameTransformTask.java:633) [data-frame-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT]
        at org.elasticsearch.action.ActionListener$1.onResponse(ActionListener.java:62) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT]
        at org.elasticsearch.xpack.dataframe.transforms.DataFrameTransformTask$ClientDataFrameIndexer.onStart(DataFrameTransformTask.java:663) [data-frame-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT]
@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core

hendrikmuhs pushed a commit that referenced this issue Aug 9, 2019
…ime (#45340)

fix loading of next checkpoint after data frame transform has been stopped/started within one run

closes #45339
hendrikmuhs pushed a commit to hendrikmuhs/elasticsearch that referenced this issue Aug 9, 2019
…ime (elastic#45340)

fix loading of next checkpoint after data frame transform has been stopped/started within one run

closes elastic#45339
hendrikmuhs pushed a commit to hendrikmuhs/elasticsearch that referenced this issue Aug 9, 2019
…ime (elastic#45340)

fix loading of next checkpoint after data frame transform has been stopped/started within one run

closes elastic#45339
hendrikmuhs pushed a commit that referenced this issue Aug 9, 2019
…ime (#45340) (#45381)

fix loading of next checkpoint after data frame transform has been stopped/started within one run

closes #45339
hendrikmuhs pushed a commit that referenced this issue Aug 9, 2019
…ime (#45340) (#45380)

fix loading of next checkpoint after data frame transform has been stopped/started within one run

closes #45339
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants