Skip to content

What's the best method of getting correlation matrix? #3070

Answered by oleksiyskononenko
vopani asked this question in Q&A
Discussion options

You must be logged in to vote

We could have the correlation matrix capability implemented. However, I guess that for big data performance of the native implementation will not be very different from your code, since the correlation reducer is already parallel internally. Btw, you could increase your performance by the factor of two calculating only a half of the symmetric matrix.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@vopani
Comment options

Answer selected by vopani
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants