[SPARK-14409][ML][WIP] Add RankingEvaluator #16618

daniloascione · 2017-01-17T15:33:16Z

What changes were proposed in this pull request?

This patch adds the implementation of a Dataframe api based RankingEvaluator to ML (ml.evaluation)

How was this patch tested?

Additional test case has been added.

AmplabJenkins · 2017-01-17T15:37:21Z

Can one of the admins verify this patch?

daniloascione · 2017-01-17T17:54:35Z

Please consider this PR WIP. Discussion in JIRA https://issues.apache.org/jira/browse/SPARK-14409

HyukjinKwon · 2017-01-18T02:30:10Z

Could you add [WIP] in the title if it is WIP?

daniloascione · 2017-03-12T16:56:46Z

I rewrote the ranking metrics from the mllib package as UDFs (as suggested here) with minimum changes to the logic.

MLnick · 2017-03-16T15:02:32Z

The basic direction looks right - I won't have time to review immediately. Spark 2.2 QA code freeze will happen shortly so this will wait until 2.3 dev cycle starts

ebernhardson

I like where this is going, it could be quite useful for taking advantage of CrossValidation when doing pairwise and listwise ranking in xgboost4j-spark.

ebernhardson · 2017-04-26T03:24:41Z

mllib/src/main/scala/org/apache/spark/ml/evaluation/RankingEvaluator.scala

+    val predictionAndLabels: DataFrame = dataset
+      .join(topAtk, Seq($(queryCol)), "outer")
+      .withColumn("topAtk", coalesce(col("topAtk"), mapToEmptyArray_()))
+      .select($(labelCol), "topAtk")


Don't we also need to run an aggregation on the label column, roughly the same as the previous aggregation but using labelCol as the sort instead of predictionCol?

Currently this generates a row per prediction, when ranking tasks should have a row per query. I think the aggregation should be run twice, then those two aggregations should be joined together on queryCol. That would result in a dataset containing (labels of top k predictions, top k actual labels)

Yes, I agree. This is currently done in the previous step, when the topAtk Dataframe is calculated (line 101).

Unfortunately this is not compatible with RankingMetrics, which expects the format of predictionAndLabels as input. I didn't want to change RankingMetrics in this same PR.
So the predictionAndLabels DataFrame is calculated to use the same RankingMetrics from mllib (well, it is now UDFs based, but I didn't touched its logic).

ebernhardson · 2017-04-26T03:51:20Z

mllib/src/main/scala/org/apache/spark/ml/evaluation/RankingMetrics.scala

+        var i = 0
+        while (i < n) {
+          val gain = 1.0 / math.log(i + 2)
+          if (i < predicted.length && actualSet.contains(predicted(i))) {


This doesn't seem right, there is no overlap between the calculation of dcg and max_dcg. The question asked here should be if the label at predicted(i) is "good". When treating the labels as binary relevant/not relevant I suppose that might use a threshold, but better would be to move away from a binary dcg and use the full equation from the docblock. I understand though that you are not looking to make major updates to the code from mllib, so it would probably be reasonable for someone to fix this in a followup.

Yes, this should be fixed in another PR to keep changes isolated. FYI, the original JIRA for this is here.

MLnick · 2017-06-26T10:16:45Z

@daniloascione are you able to update this? I'd like to target for 2.3.

But, can we do the following:

Port over ranking metrics (udfs) to ml with the RankingEvaluator as you've done, but excluding MPR
Also don't make any logic changes for the metrics calculation

Let's focus on porting things over and getting the API right. Then create follow up tickets for additional metrics (MPR and any others) as well as looking into correcting the logic and/or naming of the existing metrics.

If you are unable to take it up again, I can help.

Thanks!

MLnick · 2017-07-06T13:10:00Z

@daniloascione any update?

Kornel · 2017-10-04T08:45:58Z

@MLnick I'm wondering what's the status of this issue: seems closed, have you any plans on picking it up again?

I might pick it up, but I'm not sure what's left: move from package mllib to ml and maybe a python API? Or fixes to ndcg as well?

acompa · 2017-12-06T15:42:32Z

I'm also curious about this @MLnick. Seems like there was a lot of movement earlier this year, but this PR has gotten stale.

I can also contribute if @Kornel cannot for any reason.

bantmen · 2019-10-03T15:50:08Z

mllib/src/main/scala/org/apache/spark/ml/evaluation/RankingMetrics.scala

+      }
+    }, DoubleType)
+
+    val R_prime = predictionAndObservations.count()


Shouldn't this be a sum instead of count?
(I know this is old/closed but other people might be referring to this code)

daniloascione added 4 commits January 5, 2017 10:34

[SPARK-14409][ML] Add RankingEvaluator and MPR metric

c93ab86

[SPARK-14409][ML] Handle NaN in predictions

bfd7dc5

[SPARK-14409][ML] Add basic test

3b23bfb

Merge remote-tracking branch 'upstream/master' into SPARK-14409

ca212db

daniloascione changed the title ~~[SPARK-14409][ML] Add RankingEvaluator~~ [WIP][SPARK-14409][ML] Add RankingEvaluator Jan 18, 2017

MLnick mentioned this pull request Feb 28, 2017

[Spark-19535][ML] RecommendForAllUsers RecommendForAllItems for ALS on Dataframe #17090

Closed

daniloascione changed the title ~~[WIP][SPARK-14409][ML] Add RankingEvaluator~~ [SPARK-14409][ML][WIP] Add RankingEvaluator Mar 6, 2017

daniloascione added 3 commits March 12, 2017 15:35

[SPARK-14409][ML] MeanPercentileRankMetrics : Support generic type

ad69499

Merge branch 'danilo/master' into SPARK-14409

9dd5d70

[SPARK-14409][ML] Write the ranking metrics computations as UDFs

fa2155a

ebernhardson suggested changes Apr 26, 2017

View reviewed changes

HyukjinKwon mentioned this pull request Jul 31, 2017

[INFRA] Close stale PRs #18780

Closed

asfgit closed this in 3a45c7f Aug 5, 2017

daniloascione mentioned this pull request Jun 7, 2019

Training LightGBMRanker several times gives different NDCG on testing set microsoft/SynapseML#580

Open

bantmen reviewed Oct 3, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-14409][ML][WIP] Add RankingEvaluator #16618

[SPARK-14409][ML][WIP] Add RankingEvaluator #16618

daniloascione commented Jan 17, 2017 •

edited

Loading

AmplabJenkins commented Jan 17, 2017

daniloascione commented Jan 17, 2017

HyukjinKwon commented Jan 18, 2017

daniloascione commented Mar 12, 2017

MLnick commented Mar 16, 2017

ebernhardson left a comment

ebernhardson Apr 26, 2017 •

edited

Loading

daniloascione Apr 27, 2017

ebernhardson Apr 26, 2017

daniloascione Apr 27, 2017 •

edited

Loading

MLnick commented Jun 26, 2017 •

edited

Loading

MLnick commented Jul 6, 2017

Kornel commented Oct 4, 2017 •

edited

Loading

acompa commented Dec 6, 2017

bantmen Oct 3, 2019 •

edited

Loading

[SPARK-14409][ML][WIP] Add RankingEvaluator #16618

[SPARK-14409][ML][WIP] Add RankingEvaluator #16618

Conversation

daniloascione commented Jan 17, 2017 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

AmplabJenkins commented Jan 17, 2017

daniloascione commented Jan 17, 2017

HyukjinKwon commented Jan 18, 2017

daniloascione commented Mar 12, 2017

MLnick commented Mar 16, 2017

ebernhardson left a comment

Choose a reason for hiding this comment

ebernhardson Apr 26, 2017 • edited Loading

Choose a reason for hiding this comment

daniloascione Apr 27, 2017

Choose a reason for hiding this comment

ebernhardson Apr 26, 2017

Choose a reason for hiding this comment

daniloascione Apr 27, 2017 • edited Loading

Choose a reason for hiding this comment

MLnick commented Jun 26, 2017 • edited Loading

MLnick commented Jul 6, 2017

Kornel commented Oct 4, 2017 • edited Loading

acompa commented Dec 6, 2017

bantmen Oct 3, 2019 • edited Loading

Choose a reason for hiding this comment

daniloascione commented Jan 17, 2017 •

edited

Loading

ebernhardson Apr 26, 2017 •

edited

Loading

daniloascione Apr 27, 2017 •

edited

Loading

MLnick commented Jun 26, 2017 •

edited

Loading

Kornel commented Oct 4, 2017 •

edited

Loading

bantmen Oct 3, 2019 •

edited

Loading