-
Notifications
You must be signed in to change notification settings - Fork 126
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Loading status checks…
Properly designate model state for actively training models when node…
…s crash or leave cluster (#1317) * Initial implementation Signed-off-by: Ryan Bogan <[email protected]> * Fix compile errors for tests Signed-off-by: Ryan Bogan <[email protected]> * Temporary tests Signed-off-by: Ryan Bogan <[email protected]> * Ensure backwards compatibility and add zombie to model state enum Signed-off-by: Ryan Bogan <[email protected]> * Update current tests Signed-off-by: Ryan Bogan <[email protected]> * Fix current integration tests Signed-off-by: Ryan Bogan <[email protected]> * Fix unit tests with new changes Signed-off-by: Ryan Bogan <[email protected]> * Add unit tests Signed-off-by: Ryan Bogan <[email protected]> * Fix spotless Signed-off-by: Ryan Bogan <[email protected]> * Add changelog entry Signed-off-by: Ryan Bogan <[email protected]> * Delete temporary test file Signed-off-by: Ryan Bogan <[email protected]> * Remove temporary changes to build.gradle Signed-off-by: Ryan Bogan <[email protected]> * Add more backwards compatibility Signed-off-by: Ryan Bogan <[email protected]> * Attempt to fix bwc tests Signed-off-by: Ryan Bogan <[email protected]> * Fix spotless Signed-off-by: Ryan Bogan <[email protected]> * Remove star imports Signed-off-by: Ryan Bogan <[email protected]> * Add another unit test Signed-off-by: Ryan Bogan <[email protected]> * Modify unit test to increase coverage Signed-off-by: Ryan Bogan <[email protected]> * Change unit test to increase coverage Signed-off-by: Ryan Bogan <[email protected]> * Add method description for clusterChanged Signed-off-by: Ryan Bogan <[email protected]> * Address PR feedback Signed-off-by: Ryan Bogan <[email protected]> * Refactor into TrainingJobClusterStateListener Signed-off-by: Ryan Bogan <[email protected]> * Make node assignment final and added in the constructor of TrainingJob Signed-off-by: Ryan Bogan <[email protected]> * Remove clusterService from TrainingJobRunner Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> * Add flag when node rejoins and check when serializing model Signed-off-by: Ryan Bogan <[email protected]> * Address PR feedback Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> * Fix spotless Signed-off-by: Ryan Bogan <[email protected]> * Test new version check for StreamInput Signed-off-by: Ryan Bogan <[email protected]> * Remove check to test new method Signed-off-by: Ryan Bogan <[email protected]> * Add version check for stream input/output logic Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> * Address PR Feedback Signed-off-by: Ryan Bogan <[email protected]> --------- Signed-off-by: Ryan Bogan <[email protected]>
Showing
31 changed files
with
706 additions
and
89 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.