-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Limit the number of variables to plot on correlation circle ? #78
Comments
Hi Pierre-François, I am glad you like the package! You are right. There is no current way of limiting the number of variables shown in the correlation circles. For other types of plots, we chose to have a In the case of the Fabien |
Of course I love it ! It is just awesome and I am only just starting playing with it :-) What would be great would be to be able to put thresholds for correlation and / or greatness of projection (cos2) for correlation circle and biology, in the way the author of factoextra implemented it for Factominer objects. I guess I could try to implement something by forking the current version on GitHub. But I am really not proefficient at developing R packages. Do you think this would be not-too-much time consuming / difficult to do ? All the best and thanks again ! Pef |
Ok, I see that they have the I can do it, but feel free if you want to do it! The only file to change is df <- df[seq(min(n_mark, NROW(df))), ] with the new selection procedure. I would use the So please tell me what you prefer and if you agree with my proposition. Best, |
That sounds great ! I need to think a bit about it but I think this is exactly matching what I would love. Another thing I thought about this afternoon is to implement an equivalent of the cim function in mixOmics package - which draw a heatmap with biclustering for selected variables, while projecting the class samples belong to, and blocks variables belong to. That would be super interesting for SGCCA. A circos plot might be useful as well. I would be happy to give a hand on this. What do you think ? Thanks a lot ! Pef |
Hi Fabien, I hope that you're great. I've started digging a bit in the code and I think it's a little bit out of my skills. Do you think you can do it in a reasonable timing ? That would be awesome and so helpful. Thanks a lot and have a great day. ++ Pef |
Hi Pef, Yes, I could do it this week. I looked at the Best, |
Hi @PFRoux I'm sorry I didn't take the time last week. I've worked on it today. I thought of different modifications compared to what we discussed and would like your opinion on that. First, instead of names Now, it leads to issues we did not have to consider before. Previously, we had a single value per variable/sample in the plots where
What do you think? @Tenenhaus, don't hesitate to let us know if you have any ideas. Best, |
Thank you so much for your feedback and sorry for the delay - I was in vacations for a couple of days. Regarding the selection process in 2D plots, what about offering 2 options :
Do you think it would be difficult to implement ? Thanks a lot for your help. Pef |
I am trying to understand what their cos2 represents. It is a normalization of the weight associated with each variable in each component, but I did not see the formula for how it is calculated. Because of the constraints in our models, the weights are already between -1 and 1 (or 0 and 1 if you square them), so we do not need an extra normalization. Do you have a specific interpretation of this cos2 measure? There would not be too many differences between putting a correlation threshold on either of the two components or putting a threshold on the distance to the center. So, proposition 1 makes perfect sense. I do not think it would be difficult to implement, but I do not see the point in implementing both propositions. What would be the benefit of having both instead of just proposition 1? Best, |
Hi Fabien, I am facing the same problem as you finding out what the cos2 really correspond to - I cannot find the formula. Best, Pef |
Hi @PFRoux you can try out this branch https://github.com/rgcca-factory/RGCCA/tree/limit_number_of_var_in_plots to see if it suits your needs! Best, |
Thank you so much @GFabien for your help. Best, Peu |
Hi @PFRoux, |
Thanks a lot for this really awesome package.
Is there a way to put a cutoff in term of correlation or projection to the variables to select only part of them when using plot(fit, type = "cor_circle") ?
Thanks a lot.
Pierre-François
The text was updated successfully, but these errors were encountered: