Update document scores based on ranker node #2048

mathislucka · 2022-01-21T17:08:37Z

Proposed changes:
This adds a score to Documents coming from a Ranker.

Cross-Encoders produce scores and as a Document already has a score property, this property can be used safely.
The score can be useful to show in a UI or for other purposes.
As nothing in the sorting code is changed, this should be a non-breaking change.

Status (please check what you already did):

First draft (up for discussions & feedback)
Final code
Added tests
Updated documentation

closes #2706

julian-risch

Let's add a test case for this new feature. At least to check that documents have scores between 0 and 1 in both cases (logits_dim==1 and logits_dim>1)

mathislucka · 2022-02-10T14:16:26Z

Hey @julian-risch,

I thought about this a little more and I came up with a few issues:

for logits_dim > 1 the score will not be between 0 and 1 because we are using torch.identity here. It is done the same way in sentence_transformers here:

https://github.com/UKPLab/sentence-transformers/blob/5f2c41821c722a3171e2c7c55620ed1f97050b16/sentence_transformers/cross_encoder/CrossEncoder.py#L36

And I think that it will actually break things if we apply sigmoid for multiple labels.

Thinking further about this, I'd still like to change this and add a activation_fct parameter like in the sentence transformers example. For search, it is good when the reranker score returns a score between 0 and 1 but if we want to use a re-ranker for data augmentation we actually want the raw scores (at least this was also used in the GPL paper). So I'd change the behaviour and function signature to match the functionality that we already know from sentence_transformers and add tests for that.

What do you think?

julian-risch · 2022-02-11T09:28:36Z

Hi @mathislucka thanks for bringing this up again.

And I think that it will actually break things if we apply sigmoid for multiple labels.
In that case, I agree that we can use the raw scores internally. Just keep in mind that, as you wrote, search applications need to be able to show a document relevance score (0 to 1) in the end. 👍

tstadel · 2022-03-30T12:48:53Z

@mathislucka I remember we had a brief discussion on this. Are you still working on that? What's the status?

julian-risch · 2022-04-19T07:22:08Z

@mathislucka could you please check the status here? Thank you. 🙂

mathislucka · 2022-04-25T07:24:11Z

Hey @julian-risch and @tstadel ,

I'm not sure about the status. For things like GPL we will need the raw score. For an end-user using sigmoid activation will be most comprehensible. I'd like to have the current code as default behaviour but allow the user to pass in a custom callable which will be called to transform the score. The problem is that as @tstadel mentioned passing in a Callable does not work well with loading nodes from yaml. So I'm not sure what to do here. Any ideas?

tstadel · 2022-04-28T16:25:12Z

@mathislucka @julian-risch how about having just a single activation function self.activation_function and inferring the default from the Transformers models during init by checking self.transformer_model.classifier.out_features. Additionally we should introduce another init param scale_score_to_probability analogous to Retrievers (implemented in #2454) with which we can disable the default scaling in the single dim case using sigmoid.
For custom activation functions, you would be able to set a custom function to ranker.activation_function after calling init. Or if you want/need to use YAMLs you could set scale_score_to_probability to False in order to get the raw scores and pipe the results through a custom node that takes care of the scaling.
What do you think?

julian-risch · 2022-05-31T07:13:30Z

@mathislucka What do you think of the suggestion by Thomas? Would you have time to work on implementing that or should I do that?

mathislucka · 2022-05-31T07:52:27Z

I like the idea and I could implement it. I won't get to it today though. So if you want to merge this earlier I'm glad if you could take over.

julian-risch · 2022-05-31T12:39:02Z

@mathislucka it's not urgent. no need to tackle it today. We were just unsure in our sprint planning whether you can continue working on the issue or whether somebody from the core team needs to take over. Looking forward to your implementation of the idea then and let me know if you need any support. 👍

mathislucka · 2022-06-08T13:14:59Z

Just a quick question for @tstadel: When you say infer from self.transformer_model.classifier.out_features do you mean something like self.transformer_model.num_labels? Because I can't find out features in e.g. BERTForSequenceClassification as implemented here:https://github.com/huggingface/transformers/blob/264128cb9dbd83b666666945fd2fea0662135911/src/transformers/models/bert/modeling_bert.py#L1513

tstadel · 2022-06-08T19:59:36Z

Just a quick question for @tstadel: When you say infer from self.transformer_model.classifier.out_features do you mean something like self.transformer_model.num_labels? Because I can't find out features in e.g. BERTForSequenceClassification as implemented here:https://github.com/huggingface/transformers/blob/264128cb9dbd83b666666945fd2fea0662135911/src/transformers/models/bert/modeling_bert.py#L1513

@mathislucka num_labels contains the same information. transformer_model.classifier is a pytorch nn.Linear module which has in turn the out_features field. The first seems to be a convenience field within the transformers model. The latter is the raw dimensionality of the classification head. They should be both safe to use.

# Conflicts: # haystack/nodes/ranker/sentence_transformers.py

mathislucka · 2022-06-23T05:31:35Z

I think it might work that way, but I am not sure how to fix this mypy issue.

…into add_score_to_ranker

julian-risch

LGTM! 👍 I fixed the mypy issues and slightly adapted the tests. Let's wait for them to run through and if all goes well I'll merge afterward.

* ranker should return scores for later usage * fix wrong tuple order * adjust ranker scores; add tests * Update Documentation & Code Style * fix mypy * Update Documentation & Code Style * fix mypy * Update Documentation & Code Style * relax ranker test tolerance * update ranker test score Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Julian Risch <[email protected]>

mathislucka added 2 commits January 21, 2022 18:06

ranker should return scores for later usage

ed74f1e

fix wrong tuple order

a54c55c

ZanSara added journey:intermediate topic:metadata type:feature New feature or request labels Jan 27, 2022

tholor requested a review from julian-risch February 2, 2022 13:13

tholor marked this pull request as ready for review February 2, 2022 13:14

julian-risch requested changes Feb 2, 2022

View reviewed changes

mathislucka mentioned this pull request Apr 25, 2022

Add Generative Pseudo Labeling (GPL) #2388

Merged

5 tasks

masci assigned mathislucka Jun 8, 2022

MichelBartels mentioned this pull request Jun 22, 2022

Ranker scores shouldn't be discarded #2706

Closed

mathislucka and others added 5 commits June 23, 2022 06:35

Merge branch 'master' into add_score_to_ranker

472c4d6

# Conflicts: # haystack/nodes/ranker/sentence_transformers.py

adjust ranker scores; add tests

3cfcfca

Update Documentation & Code Style

0402bf3

fix mypy

c6f363e

Update Documentation & Code Style

33f5d5c

julian-risch added 2 commits June 27, 2022 09:51

Merge branch 'master' into add_score_to_ranker

2bb2424

fix mypy

a6428ef

github-actions bot and others added 4 commits June 27, 2022 08:22

Update Documentation & Code Style

df4e13d

relax ranker test tolerance

333ee79

Merge branch 'add_score_to_ranker' of github.com:deepset-ai/haystack …

2928d91

…into add_score_to_ranker

update ranker test score

4079eda

julian-risch approved these changes Jun 27, 2022

View reviewed changes

julian-risch changed the title ~~ranker should return scores for later usage~~ Update document scores based on ranker node Jun 27, 2022

julian-risch merged commit 8d65bc5 into master Jun 27, 2022

julian-risch deleted the add_score_to_ranker branch June 27, 2022 10:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update document scores based on ranker node #2048

Update document scores based on ranker node #2048

mathislucka commented Jan 21, 2022 •

edited by julian-risch

Loading

julian-risch left a comment

mathislucka commented Feb 10, 2022

julian-risch commented Feb 11, 2022

tstadel commented Mar 30, 2022

julian-risch commented Apr 19, 2022

mathislucka commented Apr 25, 2022

tstadel commented Apr 28, 2022 •

edited

Loading

julian-risch commented May 31, 2022

mathislucka commented May 31, 2022

julian-risch commented May 31, 2022

mathislucka commented Jun 8, 2022 •

edited

Loading

tstadel commented Jun 8, 2022

mathislucka commented Jun 23, 2022

julian-risch left a comment

Update document scores based on ranker node #2048

Update document scores based on ranker node #2048

Conversation

mathislucka commented Jan 21, 2022 • edited by julian-risch Loading

julian-risch left a comment

Choose a reason for hiding this comment

mathislucka commented Feb 10, 2022

julian-risch commented Feb 11, 2022

tstadel commented Mar 30, 2022

julian-risch commented Apr 19, 2022

mathislucka commented Apr 25, 2022

tstadel commented Apr 28, 2022 • edited Loading

julian-risch commented May 31, 2022

mathislucka commented May 31, 2022

julian-risch commented May 31, 2022

mathislucka commented Jun 8, 2022 • edited Loading

tstadel commented Jun 8, 2022

mathislucka commented Jun 23, 2022

julian-risch left a comment

Choose a reason for hiding this comment

mathislucka commented Jan 21, 2022 •

edited by julian-risch

Loading

tstadel commented Apr 28, 2022 •

edited

Loading

mathislucka commented Jun 8, 2022 •

edited

Loading