[FEA] TSNE - Kullback Leiback Divergence for early stopping #863

danielhanchen · 2019-07-19T07:46:33Z

Currently only the gradient norm is used for early stopping. However, one can calculate the actual Kullback Leiback divergence during the gradient updates for further diagnosis of whether TSNE has reached a stable configuration.

cjnolet · 2019-07-19T16:48:14Z

Will the new KLDivergence prim help you here?

danielhanchen · 2019-07-19T17:57:03Z

Oh so in the naive kernel I already incrementally updated the KLD loss. I just didn't put it in the Barnes Hut version yet. But I'll check the new prim out.

Most likely I'll continue using the naive TSNE version since no repeat calculations are made, and just P*log(P/Q) is used. Ie I have P, Q but they're not in a vector, but discarded during the algorithm. So I need an incremental version.

teju85 · 2019-07-23T01:10:59Z

What do you mean by incremental KLD version?

danielhanchen · 2019-07-23T10:02:06Z

@teju85 So the KL prim is KL_Prim(P, Q) and KL = sum(P * log(P / Q)) is done.

In TSNE, P and Q are not formed explicity as a vector, they're formed on the go. Hence the KL divergence im using will incrementally add to KL.

teju85 · 2019-07-23T10:13:32Z

got it. Makes sense now. Thanks @danielhanchen for explaining.

minor point: you can reduce a couple of lines of code in yours if you decide to use the KLDOp defined here

danielhanchen · 2019-07-26T16:02:21Z

Yep soz for the delay! I'll defs use that :)

I assume the intent is to implement this after rapidsai#863 is completed, so I didn't remove it completely and just marked it as unused.

drobison00 · 2020-07-29T17:22:47Z

@danielhanchen Do you know the current state of this work?

danielhanchen · 2020-07-31T10:11:38Z

@drobison00 I haven't been able to get around to this sadly.
I did however in https://github.com/danielhanchen/tsne/blob/master/TSNE%20Extended%20Notebook.ipynb, specify some equations which might be helpful.

The main aim is to find the sum for KL for each point, which can be done inside the attractive forces kernel.
So, some manipulation of formulas will have to be done in order to get sum KL_ij.

danielhanchen added ? - Needs Triage Need team to review and classify feature request New feature or request labels Jul 19, 2019

danielhanchen mentioned this issue Jul 19, 2019

[WIP] T-SNE Implementation #682

Closed

zbjornson added a commit to zbjornson/cuml that referenced this issue Jul 11, 2020

Fix _iter_without_progress docs in tSNE Py bindings

8b7ba9d

I assume the intent is to implement this after rapidsai#863 is completed, so I didn't remove it completely and just marked it as unused.

zbjornson mentioned this issue Jul 11, 2020

[REVIEW] Fix n_iter_without_progress docs in tSNE Py bindings #2545

Merged

zbjornson mentioned this issue Jul 29, 2020

[BUG] Barnes-Hut tSNE does not obey initialize_embeddings param #2549

Closed

cjnolet mentioned this issue Jul 29, 2021

[FEA] Expose kl_divergence_ attribute in TSNE #3992

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] TSNE - Kullback Leiback Divergence for early stopping #863

[FEA] TSNE - Kullback Leiback Divergence for early stopping #863

danielhanchen commented Jul 19, 2019

cjnolet commented Jul 19, 2019

danielhanchen commented Jul 19, 2019

teju85 commented Jul 23, 2019

danielhanchen commented Jul 23, 2019

teju85 commented Jul 23, 2019

danielhanchen commented Jul 26, 2019

drobison00 commented Jul 29, 2020

danielhanchen commented Jul 31, 2020

[FEA] TSNE - Kullback Leiback Divergence for early stopping #863

[FEA] TSNE - Kullback Leiback Divergence for early stopping #863

Comments

danielhanchen commented Jul 19, 2019

cjnolet commented Jul 19, 2019

danielhanchen commented Jul 19, 2019

teju85 commented Jul 23, 2019

danielhanchen commented Jul 23, 2019

teju85 commented Jul 23, 2019

danielhanchen commented Jul 26, 2019

drobison00 commented Jul 29, 2020

danielhanchen commented Jul 31, 2020