Support for custom architectures for the TextClassifier class #604

tkon3 · 2019-03-08T18:15:45Z

Hello,
I didn't find a way to use a custom architecture to classify texts so I guess it is not implemented yet.
Currently the TextClassifier class only uses a simple linear layer (nn.Linear) in order to do the task (+sigmoid).

Is allowing custom pytorch layers planned in the futur ?
Something allowing us to specify a custom architecture to the TextClassifier class with torch Sequential or a list :

custom_model = [nn.Linear(embedding_length,50), 
                nn.functional.tanh(nn.Linear(50,25)), 
                nn.Linear(50,len(label_dictionary))
               ]

The text was updated successfully, but these errors were encountered:

NielsRogge · 2019-03-11T13:54:04Z

Sorry I can't answer your question, but where did you find that the TextClassifier only uses a simple linear layer?

alanakbik · 2019-03-12T22:08:05Z

Hello @tkon3 this is a really good question. We are currently looking into refactoring some parts of Flair such as the flair.nn.model interface and ModelTrainer classes to make it easier for people to define their own architectures and tasks (see #563).

However at this point we don't yet know how exactly this will look like. And since many of us are on vacation until beginning of April, real development of these ideas will begin only then. But yes, generally we want to make it possible for users to do this and perhaps your idea with passing a list of layers could work. Hope this answers the question somewhat - as the ideas develop, we'll keep you posted!

alanakbik · 2019-03-12T22:14:41Z

Hi @NielsRogge the TextClassifier itself is indeed simply a linear layer on top of whatever DocumentEmbeddings you use. See the forward method of the TextClassifier, which basically takes the embeddings and puts them through the 'decoder':

https://github.com/zalandoresearch/flair/blob/ccb2ffb4080550d25e1e067e2e82e661c8b81411/flair/models/text_classification_model.py#L59

The self.decoder is simply a linear layer, see:

https://github.com/zalandoresearch/flair/blob/ccb2ffb4080550d25e1e067e2e82e661c8b81411/flair/models/text_classification_model.py#L38

However, this does not mean that the classifier is only a linear layer, since the choice of DocumentEmbeddings impacts how the architecture looks like in the end. For instance, most configurations use DocumentRNNEmbeddings which use an RNN over the words in the text to produce an embedding that is then used in the linear layer to make a prediction. Since this RNN is trained on the task, this means that your final archtecture will be word embeddings -> RNN -> linear layer.

stale · 2020-04-30T02:53:29Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale bot added the wontfix This will not be worked on label Apr 30, 2020

stale bot closed this as completed May 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for custom architectures for the TextClassifier class #604

Support for custom architectures for the TextClassifier class #604

tkon3 commented Mar 8, 2019

NielsRogge commented Mar 11, 2019

alanakbik commented Mar 12, 2019

alanakbik commented Mar 12, 2019

stale bot commented Apr 30, 2020

Support for custom architectures for the TextClassifier class #604

Support for custom architectures for the TextClassifier class #604

Comments

tkon3 commented Mar 8, 2019

NielsRogge commented Mar 11, 2019

alanakbik commented Mar 12, 2019

alanakbik commented Mar 12, 2019

stale bot commented Apr 30, 2020