GH-474: model interface #681

alanakbik · 2019-04-23T15:46:31Z

This PR is a first step of non-breaking refactoring for improving the flair.nn.Model and ModelTrainer functionality. More refactorings will follow, in particular of the evaluation and data loading parts (see #563). The idea of doing this refactoring piece-by-piece is to hopefully make it more manageable.

The main idea is to make it possible to use the ModelTrainer over arbitrary models that implement the flair.nn.Model interface, which is currently

the SequenceTagger,
the TextClassifier
and the beta TextRegressor.

The addition of the TextRegressor made it necessary to move evaluation into the downstream tasks, since regression uses different evaluation metrics than classification. In the future, all three models will use entirely different evaluation methods: TextClassifier will probably use scikit-learn, SequenceTagger will go back to using the CoNLL-03 script and TextRegressor also scikit-learn but with other metrics. Logging and plotting of results has also been refactored to deal with different evaluation outputs.

At the same time, we moved loading, saving and checkpointing up to the flair.nn.Model base class, since this is always the same and leads to current code redundancies otherwise.

So, the new flair.nn.Model interface has 5 classes that need to be implemented by a new downstream task model:

forward_loss() A method that takes sentences and produces a loss with autograd for backpropagation. Implementing this method will make it possible to train the downstream task model.
predict() A method that takes sentences and a mini-batch size to do prediction.
evaluate() The new localized evaluation method which may be entirely different depending on the downstream task. Returns an object with evaluation results (though this will likely change in future refactoring steps).
_get_state_dict() Returns the state dictionary of the model. Implementing this enables saving the model and model checkpoints.
_init_model_with_state_dict() A method that creates a model from a state dictionary. Implementing this enables restoring models and checkpoints.

…ression

aakbik added 6 commits April 20, 2019 17:10

GH-474: Model interface for sequence labeling, classification and reg…

9d32043

…ression

GH-474: Model interface for sequence labeling, classification and reg…

a505edf

…ression

GH-474: remove regression-soecific trainer

3d60de5

GH-474: remove pymagnitude Path

dc559f3

GH-474: adapt Plotter to different types of tasks

6e36bfd

GH-474: fix tests for new Plotter

4121596

alanakbik merged commit 81cf1b5 into master Apr 26, 2019

alanakbik deleted the GH-474-model-interface branch May 9, 2019 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-474: model interface #681

GH-474: model interface #681

alanakbik commented Apr 23, 2019

GH-474: model interface #681

GH-474: model interface #681

Conversation

alanakbik commented Apr 23, 2019