Adds a multi-label linear sgd classifier #106

Craigacp · 2020-12-10T22:17:12Z

Description

This PR adds a native multi-label linear SGD model, refactors all the linear sgd models to share a base class, and adds a bunch of additional support to the math library for dense vectors (in preparation for further optimizations to the linear sgd models). It also switches over the vector normalizers to have an in place normalization method to reduce allocations in the inner loop of training and inference.

Motivation

The multi-label package includes a wrapper to convert any classifier into a multi-label classifier, but this wrapper is too slow for large applications. The multi-label linear sgd classifier in this PR is an order of magnitude or so faster when ran on a large text corpus. It also preps the linear models package for further work introducing vectorisation, l2 regularisation and other improvements.

…to make the tests pass.

- Adds the gradient and loss to BinaryCrossEntropy (formerly Sigmoid). - Adds the loss to Hinge, and refactors the gradient computation. - Adds a reduction method to the SGDVector interface.

…ns when performing operations with dense arguments.

…ngs in place.

…tion.

… in preparation for sharing code between the trainers.

… class. Note this commit changes the serialization format for all the models. Compatibility with the 4.0 serialised models will be restored later.

…se class.

…earSGDModel. Adding a test for 4.0 models to the linear SGD package in classification and regression.

…bclasses. Suppressing the unchecked array creation warning, we know it's safe.

Classification/SGD/src/main/java/org/tribuo/classification/sgd/objectives/Hinge.java

eelstretching

Just a few small changes here and there and it should be fine.

eelstretching

See the attached changes.

Common/SGD/src/main/java/org/tribuo/common/sgd/AbstractLinearSGDTrainer.java

eelstretching · 2020-12-17T00:19:45Z

Math/src/main/java/org/tribuo/math/la/DenseMatrix.java

+    /**
+     * Copies the supplied matrix.
+     * @param other The matrix to copy.
+     */


Just curious why we're not using arraycopy in this constructor?

Because it's not final and ShrinkingMatrix and AdaGradRDAMatrix both subclass DenseMatrix and mess with the get method to apply a transformation as part of the regularisation during training. I guess I could check if it's only a DenseMatrix without any other classes and then do an arraycopy, falling back to this code otherwise, but it doesn't seem worth it at the moment.

MultiLabel/SGD/src/main/java/org/tribuo/multilabel/sgd/linear/LinearSGDTrainer.java

Common/SGD/src/main/java/org/tribuo/common/sgd/AbstractLinearSGDTrainer.java

Craigacp · 2020-12-17T02:36:57Z

@eelstretching any thoughts on using the Pair vs the concrete class to return the gradient and the loss?

eelstretching

The changes look good. I think returning the Pair is the correct decision.

Craigacp · 2020-12-17T20:44:04Z

I removed the LossAndGradient class as we've decided on keeping the pair.

eelstretching

LGTM!

Craigacp added 24 commits November 5, 2020 09:47

Initial draft of multi-label logistic regression. Needs thresholding …

0fe1894

…to make the tests pass.

Finishes the multi-label linear model implementation.

e91fb8c

- Adds the gradient and loss to BinaryCrossEntropy (formerly Sigmoid). - Adds the loss to Hinge, and refactors the gradient computation. - Adds a reduction method to the SGDVector interface.

Tidying up the names and the docs

bc14b69

Reducing memory usage of single labels in a MultiLabel.

83e9bc9

Add an option to quiesce ONNX tests.

414ed6e

Optimizing DenseVector and DenseMatrix to use more efficient operatio…

061cd23

…ns when performing operations with dense arguments.

Relaxing LinearParameters so predict accepts a DenseVector.

85e7423

Fixing a bug in DenseVector.createDenseVector.

af7316e

Adding a method to the VectorNormalizer interface that normalizes thi…

8a117b0

…ngs in place.

Converts DenseMatrix.normalizeRows over to use the in place normaliza…

daf61e1

…tion.

Adding a new common project for SGD.

c6d1420

Refactoring the various SGD objective functions to share an interface…

a6ea4ca

… in preparation for sharing code between the trainers.

Refactoring the different LinearSGDModels so they share a common base…

734bf8d

… class. Note this commit changes the serialization format for all the models. Compatibility with the 4.0 serialised models will be restored later.

Refactoring the different LinearSGDTrainers so they share a common ba…

9c0d9b0

…se class.

Tidying up the argument names in LinearSGDModel.

236e750

Removing unnecessary MultiLabel SGD Util class.

1dae605

Adding a package-info.java to Common/SGD.

f21b4a0

Fix licenses in Common/SGD and MultiLabel/SGD

818d5bc

Fixing the license files again.

001c5b4

Restoring backwards compatibility for classification & regression Lin…

4c3fdf0

…earSGDModel. Adding a test for 4.0 models to the linear SGD package in classification and regression.

Removing new deprecated code.

3421bf7

Adding deprecated annotations to the old weights in the linear sgd su…

237c48b

…bclasses. Suppressing the unchecked array creation warning, we know it's safe.

Javadoc updates.

3860e0e

Adding a note to the roadmap about the multi-label linear sgd.

67c04c6

eelstretching reviewed Dec 16, 2020

View reviewed changes

Classification/SGD/src/main/java/org/tribuo/classification/sgd/objectives/Hinge.java Outdated Show resolved Hide resolved

eelstretching requested changes Dec 16, 2020

View reviewed changes

eelstretching requested changes Dec 17, 2020

View reviewed changes

Fixing the review comments.

de37d03

eelstretching previously approved these changes Dec 17, 2020

View reviewed changes

Removing the unused LossAndGradient class.

4e3b724

Craigacp dismissed eelstretching’s stale review via 4e3b724 December 17, 2020 20:43

eelstretching approved these changes Dec 17, 2020

View reviewed changes

Craigacp merged commit bdd257c into main Dec 17, 2020

Craigacp deleted the multi-label-sgd branch December 17, 2020 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds a multi-label linear sgd classifier #106

Adds a multi-label linear sgd classifier #106

Craigacp commented Dec 10, 2020

eelstretching left a comment

eelstretching left a comment

eelstretching Dec 17, 2020

Craigacp Dec 17, 2020

Craigacp commented Dec 17, 2020

eelstretching left a comment

Craigacp commented Dec 17, 2020

eelstretching left a comment

Adds a multi-label linear sgd classifier #106

Adds a multi-label linear sgd classifier #106

Conversation

Craigacp commented Dec 10, 2020

Description

Motivation

eelstretching left a comment

Choose a reason for hiding this comment

eelstretching left a comment

Choose a reason for hiding this comment

eelstretching Dec 17, 2020

Choose a reason for hiding this comment

Craigacp Dec 17, 2020

Choose a reason for hiding this comment

Craigacp commented Dec 17, 2020

eelstretching left a comment

Choose a reason for hiding this comment

Craigacp commented Dec 17, 2020

eelstretching left a comment

Choose a reason for hiding this comment