Adding a Deep Nearest Class Means Classifier model to Flair #3532

sheldon-roberts · 2024-08-19T00:27:53Z

This PR adds a DeepNCMClassifier to flair.models
My reasons for adding this model are outlined in the issue: #3531

This model requires a TrainerPlugin because it makes the prototype updates using an after_training_batch hook. Please let me know if there is a cleaner way to handle this.

Example Script:

from flair.data import Corpus
from flair.datasets import TREC_50
from flair.embeddings import TransformerDocumentEmbeddings
from flair.models import DeepNCMClassifier
from flair.trainers import ModelTrainer
from flair.trainers.plugins import DeepNCMPlugin

# load the TREC dataset
corpus: Corpus = TREC_50()

# make a transformer document embedding
document_embeddings = TransformerDocumentEmbeddings("roberta-base", fine_tune=True)

# create the classifier
classifier = DeepNCMClassifier(
    document_embeddings,
    label_dictionary=corpus.make_label_dictionary(label_type="class"),
    label_type="class",
    use_encoder=False,
    mean_update_method="condensation",
)

# initialize the trainer
trainer = ModelTrainer(classifier, corpus)

# train the model
trainer.fine_tune(
    "resources/taggers/deepncm_trec",
    plugins=[DeepNCMPlugin()],
)

plonerma · 2024-08-19T14:42:44Z

Hello @sheldon-roberts,

Thanks a lot for your contribution! This is had been buried deep in the backlog of things to implement.

I also don't see a way of how this could be implemented without a TrainerPlugin.

What do you think about implementing this as a decoder (such as the PrototypicalDecoder), such that it can be used with the default classifier? Then it could be used with all model types (i.e. span, text, etc. classification).

Additionally, what do you think about supporting the different distance functions similar to the PrototypicalDecoder?

sheldon-roberts · 2024-08-20T04:31:17Z

Hi @plonerma, Thanks for taking a look!

What do you think about implementing this as a decoder (such as the PrototypicalDecoder), such that it can be used with the default classifier? Then it could be used with all model types (i.e. span, text, etc. classification).
Additionally, what do you think about supporting the different distance functions similar to the PrototypicalDecoder?

I really like both of these ideas! I will look into making these changes soon

MattGPT-ai · 2024-08-28T22:16:40Z

Hello @sheldon-roberts,

Thanks a lot for your contribution! This is had been buried deep in the backlog of things to implement.

I also don't see a way of how this could be implemented without a TrainerPlugin.

What do you think about implementing this as a decoder (such as the PrototypicalDecoder), such that it can be used with the default classifier? Then it could be used with all model types (i.e. span, text, etc. classification).

Additionally, what do you think about supporting the different distance functions similar to the PrototypicalDecoder?

In order to avoid using a trainer plugin, could we just add a function like def after_training_epoch(): pass that gets added to the base Model class, which gets called right before or after self.dispatch("after_training_epoch", epoch=epoch) in the train_custom function?

I think this would work with this being a class, but might not work when it gets changed to a decoder.

MattGPT-ai · 2024-11-20T20:06:05Z

I am currently working on converting this class to a simpler decoder. I have gotten it to work, but it requires some changes to other classes; the label tensors have to be provided to the forward passes so they can go into the decoder call. Specifically, in DefaultClassifier.forward_loss, you need to have scores = self.decoder(data_point_tensor, label_tensor). In predict, this isn't necessary because you don't need to calculate the proto updates.

Would it make sense to always pass in this in, but just have most base cases ignore the parameter? Another alternative would be to have the class set self.label_tensor before the call so it doesn't need to be an input param at all. Not sure if anyone else has a suggestion of how to design this. I will be pushing up the specific code soon, but just looking for opinions.

MattGPT-ai · 2024-11-24T00:15:04Z

This has

Hello @sheldon-roberts,

Thanks a lot for your contribution! This is had been buried deep in the backlog of things to implement.

I also don't see a way of how this could be implemented without a TrainerPlugin.

What do you think about implementing this as a decoder (such as the PrototypicalDecoder), such that it can be used with the default classifier? Then it could be used with all model types (i.e. span, text, etc. classification).

Additionally, what do you think about supporting the different distance functions similar to the PrototypicalDecoder?

This has been updated to be a decoder. It's overall a lot less code and simpler, although it required some small changes to the DefaultClassifier class, and still requires a plugin. Am definitely open to any suggestion of how to better integrate this.

MattGPT-ai · 2024-11-24T02:32:39Z

Looks like tests are passing except for a couple of MyPy checks that aren't directly related to the changes in the PR, I think just files that this PR touches. Do you have any suggestions for fixing these typing problems?

MattGPT-ai · 2024-11-25T21:56:51Z

Would it be better to move this class into flair/nn/decoder.py now that it is a decoder?

MattGPT-ai · 2024-12-17T18:10:18Z

@plonerma Are you able to re-review this before the next release?

Add tests for DeepNCMClassifier Remove old test Add multi label support Add type hints and doc strings

…ifferent model types. make small changes to DefaultClassifier forward_loss to pass label tensor when needed. update tests

MattGPT-ai · 2024-12-18T20:57:37Z

I've moved this to decoder.py to be more consistent with other decoders

The specified return types were overly resetrictive (e.g. did not include sequence labelling models)

plonerma · 2025-01-03T14:47:10Z

@sheldon-roberts and @MattGPT-ai : Thanks a lot for your collaborative effort!

I made a few minor changes and merged the current master branch into the PR. Now, all checks are passed.

MattGPT-ai · 2025-01-03T23:10:03Z

This looks good now, can we merge?

alanakbik · 2025-01-07T12:25:32Z

Thanks a lot for adding this @MattGPT-ai and for reviewing @plonerma!

alanakbik assigned plonerma Aug 19, 2024

jeffpicard mentioned this pull request Aug 23, 2024

Add OneClassClassifier model #3501

Open

MattGPT-ai force-pushed the deepncm-classifier branch 3 times, most recently from c92f501 to b19e700 Compare November 24, 2024 00:12

MattGPT-ai force-pushed the deepncm-classifier branch 2 times, most recently from 088aac0 to f94a56f Compare November 24, 2024 01:59

sheldon-roberts and others added 3 commits December 18, 2024 15:24

Add DeepNCMClassifier model

d34bfd0

Add tests for DeepNCMClassifier Remove old test Add multi label support Add type hints and doc strings

feat: change DeepNCM classifier to a decoder so it can be used with d…

213396c

…ifferent model types. make small changes to DefaultClassifier forward_loss to pass label tensor when needed. update tests

refactor: move DeepNCMDecoder to decoder.py

649e68d

MattGPT-ai force-pushed the deepncm-classifier branch from f94a56f to 649e68d Compare December 18, 2024 20:24

plonerma and others added 5 commits January 3, 2025 14:25

Removed deprecated deepncm classifier file

46379c9

Slightly refactored deepncm trainer plugin

cab5105

Fixed formatting

5ae1508

Merge branch 'master' into deepncm-classifier

ddb7653

Removed predict return types

8398be2

The specified return types were overly resetrictive (e.g. did not include sequence labelling models)

plonerma approved these changes Jan 3, 2025

View reviewed changes

plonerma merged commit 0864b22 into flairNLP:master Jan 7, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a Deep Nearest Class Means Classifier model to Flair #3532

Adding a Deep Nearest Class Means Classifier model to Flair #3532

sheldon-roberts commented Aug 19, 2024

plonerma commented Aug 19, 2024 •

edited

Loading

sheldon-roberts commented Aug 20, 2024

MattGPT-ai commented Aug 28, 2024

MattGPT-ai commented Nov 20, 2024

MattGPT-ai commented Nov 24, 2024

MattGPT-ai commented Nov 24, 2024

MattGPT-ai commented Nov 25, 2024

MattGPT-ai commented Dec 17, 2024

MattGPT-ai commented Dec 18, 2024

plonerma commented Jan 3, 2025 •

edited

Loading

MattGPT-ai commented Jan 3, 2025

alanakbik commented Jan 7, 2025

Adding a Deep Nearest Class Means Classifier model to Flair #3532

Adding a Deep Nearest Class Means Classifier model to Flair #3532

Conversation

sheldon-roberts commented Aug 19, 2024

plonerma commented Aug 19, 2024 • edited Loading

sheldon-roberts commented Aug 20, 2024

MattGPT-ai commented Aug 28, 2024

MattGPT-ai commented Nov 20, 2024

MattGPT-ai commented Nov 24, 2024

MattGPT-ai commented Nov 24, 2024

MattGPT-ai commented Nov 25, 2024

MattGPT-ai commented Dec 17, 2024

MattGPT-ai commented Dec 18, 2024

plonerma commented Jan 3, 2025 • edited Loading

MattGPT-ai commented Jan 3, 2025

alanakbik commented Jan 7, 2025

plonerma commented Aug 19, 2024 •

edited

Loading

plonerma commented Jan 3, 2025 •

edited

Loading