Refactoring AbstractLinearSGDModel and Trainer to extract SGD base classes #134

Craigacp · 2021-04-20T20:18:40Z

Description

This PR extracts out AbstractSGDModel and AbstractSGDTrainer from AbstractLinearSGDModel and AbstractLinearSGDTrainer, and introduces a FeedForwardParameters which has predict and gradient methods. They don't land on Parameters because that's used in the CRF as well, and sequences have a differently shaped input.

AbstractSGDTrainer has a lot of generic parameters, but those are all hidden from users and the concrete subclasses are still typed with just the output type like most of Tribuo. The code is tested by the existing tests, and can still deserialize Tribuo 4.0 models.

Motivation

The recent introduction of AbstractLinearSGDModel/Trainer wasn't quite abstract enough. The SGD package could be used for things beyond linear models like factorization machines. This PR will make it straightforward to subclass AbstractSGDTrainer for a different model class so you don't have to reimplement the training loop.

…Model and Trainer which operate on Parameters. This will allow future non-linear additions to Tribuo's SGD models.

pogren

Introduces AbstractSGDModel which basically moves AbstractLinearSGDModel methods 'predictSingle' and the supporting inner class PredAndActive into this class. Also introduces AbstractSGDTrainer which steals most of its code from AbstractLinearSGDTrainer which is now basically delegates to its super class. Two protected constructors were removed from AbstractLinearSGDTrainer so this is not a backwards compatible change and will affect subclasses of AbstractLinearSGDTrainer (if they exist.)

LinearParameters now implements FeedForwardParameters which defines a 'predict' method whose return type is DenseVector which LinearParameters now implements. Previously LinearParameters implemented Parameters directly and defined its own 'predict' method which returned an SGDVector - so this is not a backwards compatible change and will affect subclasses of LinearParameters (if they exist.) The methods 'predict' and 'gradients' are now annotated with '@OverRide' because they are defined in FeedForwardParameters which also introduces the 'copy' method.

Other than the above noted concerns - this is a straightforward refactoring of the abstract super classes of the SGD model and training code to better share code.

Craigacp · 2021-05-11T15:08:44Z

The AbstractLinearSGDTrainer was introduced after the last release, so that doesn't change the compatibility as the concrete LinearSGDTrainer for Label and Regressor still have the same constructors. We'll note the LinearParameters change in the release notes, but I think it's unlikely to break anyone (that class should probably be final, but we didn't do a thorough job hardening Tribuo wrt this before the initial release).

Craigacp · 2021-05-11T15:08:52Z

Thanks Philip!

Refactoring AbstractLinearSGDModel and Trainer to extract AbstractSGD…

079e496

…Model and Trainer which operate on Parameters. This will allow future non-linear additions to Tribuo's SGD models.

Craigacp added the Oracle employee This PR is from an Oracle employee label Apr 20, 2021

Quieting the logging in the LinearSGD tests.

ee05a80

pogren approved these changes May 11, 2021

View reviewed changes

Craigacp merged commit e901b9c into main May 11, 2021

Craigacp deleted the yet-another-sgd-refactor branch May 11, 2021 15:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring AbstractLinearSGDModel and Trainer to extract SGD base classes #134

Refactoring AbstractLinearSGDModel and Trainer to extract SGD base classes #134

Craigacp commented Apr 20, 2021

pogren left a comment

Craigacp commented May 11, 2021

Craigacp commented May 11, 2021

Refactoring AbstractLinearSGDModel and Trainer to extract SGD base classes #134

Refactoring AbstractLinearSGDModel and Trainer to extract SGD base classes #134

Conversation

Craigacp commented Apr 20, 2021

Description

Motivation

pogren left a comment

Choose a reason for hiding this comment

Craigacp commented May 11, 2021

Craigacp commented May 11, 2021