Gaussian Mixture Model #137

krstopro · 2023-07-13T22:10:27Z

Added Gaussian Mixture Model implementation, similar to the one in scikit-learn Python module. Currently it supports only full covariances and uses k-means for the initialisation. Other covariance types (diagonal, tied, spherical) and initialisation methods (random, k-means++) can easily be added.

msluszniak

Thank you for your contribution! The PR LGTM :)

polvalente

Lots of really minor comments. Overall this is looking great!

lib/scholar/cluster/gmm.ex

polvalente · 2023-07-14T18:34:26Z

lib/scholar/cluster/gmm.ex

+  expectation of the Gaussian assignment for each data point x and the M-step which updates the
+  parameters to maximize the expectations found in E-step. While every iteration of the algorithm
+  is guaranteed to improve the log-likelihood, the final result depends on the initial values of
+  the parameters and the entire procedure should be repeated several times.


From reading without looking at the code beforehand:

Does this mean that we need to run the algorithm multiple times, or does the algorithm itself repeat this procedure?

The algorithm itself repeats the procedure num_runs amount of times, each time with different initial parameters. Similarly as it was already done in k-means implementation.

I am not sure if I should change anything here. Thoughts?

For me, the description is clear. @polvalente do you have any suggestions?

lib/scholar/cluster/gmm.ex

Typo fixed. Co-authored-by: Paulo Valente <[email protected]>

Co-authored-by: Paulo Valente <[email protected]>

krstopro · 2023-07-15T20:40:31Z

@msluszniak @polvalente Thanks for the support and given feedback! I've addressed most of the suggestions provided; see the comments above (except for vectorization that I would leave for now). There are few more remarks that I would like to make; see some of the comments I've written.

lib/scholar/cluster/gmm.ex

msluszniak · 2023-07-17T11:19:41Z

Are there any blockers for this PR?

krstopro · 2023-07-17T11:47:15Z

@msluszniak Don't think so. I just need to address few suggestions that you made. Could you have a look at the two reviews I opened? The logsumexp one might be important.

msluszniak · 2023-07-17T12:03:46Z

@msluszniak Don't think so. I just need to address few suggestions that you made. Could you have a look at the two reviews I opened? The logsumexp one might be important.

Sure, but tbh I cannot see them 😅

krstopro · 2023-07-17T12:06:37Z

Sure, but tbh I cannot see them 😅

@msluszniak They are probably collapsed among many others :)
Tagged you in the comments.

msluszniak · 2023-07-17T12:32:59Z

Are you sure that your comments are not in pending mode or in the batch and not pushed or sth? Because I really cannot find anything

krstopro · 2023-07-17T12:45:13Z

Are you sure that your comments are not in pending mode or in the batch and not pushed or sth? Because I really cannot find anything

Sorry, they were in pending mode and I don't think I am able to submit them. Hence, I'll write them here.
Line 357-366 includes implementation to logsumexp which basically calculates the logarithm of the sum of the exponentials of the tensor in a numerically stable way. This is a well known trick that is implemented in numerical libraries, e.g. see SciPy
https://docs.scipy.org/doc/scipy/reference/generated/scipy.special.logsumexp.html.
However, I am not aware that such a function exists in Nx. Is there one or perhaps I should raise an issue about implementing it?

Line 292: doing {num_gaussians, num_features, num_features} = Nx.shape(covariances) inside defn won't work, even though it works outside. I suppose this is intended?

msluszniak · 2023-07-17T13:40:36Z

I'm almost sure that there is no such function implemented in Nx so feel free to send a PR with it. I'm not sure if we eventually want to have this type of functions in Nx or in Scholar, but it will be helpful.

This line won't work in defn because you're doing pattern matching. If you want to assert that the shape is as follows you need to check for it separately.

polvalente · 2023-07-17T13:42:56Z

You can use case for checking the validity of the shape inside defn

krstopro · 2023-07-17T16:25:20Z

You can use case for checking the validity of the shape inside defn

I see, thanks. Here it is not an issue, just wanted to check that this is intended.
Anyway, if everything else is fine can you approve the PR?

lib/scholar/cluster/gmm.ex

polvalente · 2023-07-17T17:06:51Z

lib/scholar/cluster/gmm.ex

+            },
+            k < num_gaussians do
+        diff = x - means[k]
+        covariance = Nx.dot(responsibilities[[.., k]] * Nx.transpose(diff), diff) / nk[k]


Suggested change

covariance = Nx.dot(responsibilities[[.., k]] * Nx.transpose(diff), diff) / nk[k]

covariance = Nx.dot(responsibilities[[.., k]] * diff, [-2], diff / nk[k], [-2])

This won't work as Nx.transpose is only applied to diff, not the entire product. Adding another axis to responsibilities[[.., k]] solves the problem.
covariance = Nx.dot(diff * Nx.new_axis(responsibilities[[.., k]], 1), [0], diff / nk[k], [0])

josevalim · 2023-07-17T21:01:22Z

@krstopro please verify @polvalente suggestions above and we can ship it :)

josevalim · 2023-07-18T17:07:24Z

💚 💙 💜 💛 ❤️

Krsto Proroković and others added 5 commits July 12, 2023 15:10

Adding Gaussian Mixture Model

34dcdb2

Reformatting the code again.

9bd9eaf

Update.

6f2d260

Last commit before opening the pull request.

7751677

Merge branch 'elixir-nx:main' into main

fa1d833

msluszniak approved these changes Jul 14, 2023

View reviewed changes

polvalente reviewed Jul 14, 2023

View reviewed changes

krstopro and others added 4 commits July 14, 2023 22:14

Update lib/scholar/cluster/gmm.ex

bf57a0a

Typo fixed. Co-authored-by: Paulo Valente <[email protected]>

Update lib/scholar/cluster/gmm.ex

1844d03

Co-authored-by: Paulo Valente <[email protected]>

Update lib/scholar/cluster/gmm.ex

d25d586

Co-authored-by: Paulo Valente <[email protected]>

Update.

6c4e7ed

Krsto Proroković added 2 commits July 15, 2023 22:45

Adressing the suggestions in PR request.

0949bd2

Added another reference.

1974cb0

msluszniak reviewed Jul 17, 2023

View reviewed changes

lib/scholar/cluster/gmm.ex Outdated Show resolved Hide resolved

msluszniak reviewed Jul 17, 2023

View reviewed changes

lib/scholar/cluster/gmm.ex Outdated Show resolved Hide resolved

Replacing resp with responsibilities.

7beced3

polvalente reviewed Jul 17, 2023

View reviewed changes

lib/scholar/cluster/gmm.ex Outdated Show resolved Hide resolved

polvalente reviewed Jul 17, 2023

View reviewed changes

lib/scholar/cluster/gmm.ex Outdated Show resolved Hide resolved

polvalente reviewed Jul 17, 2023

View reviewed changes

polvalente approved these changes Jul 17, 2023

View reviewed changes

msluszniak approved these changes Jul 17, 2023

View reviewed changes

msluszniak added enhancement New feature or request good first issue Good for newcomers labels Jul 17, 2023

Krsto Proroković and others added 2 commits July 18, 2023 18:00

Rewriting few lines with Nx.dot to avoid using Nx.transpose

56bb062

Update lib/scholar/cluster/gmm.ex

09c5f2d

josevalim merged commit 5c51869 into elixir-nx:main Jul 18, 2023

krstopro changed the title ~~Adding Gaussian Mixture Model~~ Gaussian Mixture Model Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gaussian Mixture Model #137

Gaussian Mixture Model #137

krstopro commented Jul 13, 2023

msluszniak left a comment

polvalente left a comment •

edited

Loading

polvalente Jul 14, 2023

krstopro Jul 14, 2023

krstopro Jul 15, 2023

msluszniak Jul 16, 2023

krstopro commented Jul 15, 2023 •

edited

Loading

msluszniak commented Jul 17, 2023

krstopro commented Jul 17, 2023

msluszniak commented Jul 17, 2023

krstopro commented Jul 17, 2023 •

edited

Loading

msluszniak commented Jul 17, 2023

krstopro commented Jul 17, 2023

msluszniak commented Jul 17, 2023

polvalente commented Jul 17, 2023

krstopro commented Jul 17, 2023

polvalente Jul 17, 2023

krstopro Jul 18, 2023

josevalim commented Jul 17, 2023

josevalim commented Jul 18, 2023

	covariance = Nx.dot(responsibilities[[.., k]] * Nx.transpose(diff), diff) / nk[k]
	covariance = Nx.dot(responsibilities[[.., k]] * diff, [-2], diff / nk[k], [-2])

Gaussian Mixture Model #137

Gaussian Mixture Model #137

Conversation

krstopro commented Jul 13, 2023

msluszniak left a comment

Choose a reason for hiding this comment

polvalente left a comment • edited Loading

Choose a reason for hiding this comment

polvalente Jul 14, 2023

Choose a reason for hiding this comment

krstopro Jul 14, 2023

Choose a reason for hiding this comment

krstopro Jul 15, 2023

Choose a reason for hiding this comment

msluszniak Jul 16, 2023

Choose a reason for hiding this comment

krstopro commented Jul 15, 2023 • edited Loading

msluszniak commented Jul 17, 2023

krstopro commented Jul 17, 2023

msluszniak commented Jul 17, 2023

krstopro commented Jul 17, 2023 • edited Loading

msluszniak commented Jul 17, 2023

krstopro commented Jul 17, 2023

msluszniak commented Jul 17, 2023

polvalente commented Jul 17, 2023

krstopro commented Jul 17, 2023

polvalente Jul 17, 2023

Choose a reason for hiding this comment

krstopro Jul 18, 2023

Choose a reason for hiding this comment

josevalim commented Jul 17, 2023

josevalim commented Jul 18, 2023

polvalente left a comment •

edited

Loading

krstopro commented Jul 15, 2023 •

edited

Loading

krstopro commented Jul 17, 2023 •

edited

Loading