-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wighted vs unweighted UniFrac strong difference #149
Comments
@ArnaudGaudry that's interesting. Mathematically speaking Bray-Curtis is more similar to weighted UniFrac than it is to unweighted UniFrac. I don't remember seeing this plot, mainly because we knew the abundance-based weighting inherent to the weighted variant of UniFrac would play an important role based on other experiments and tests we ran before. @anupriyatripathi any thoughts on this? |
@ArnaudGaudry <https://github.com/ArnaudGaudry> thanks for the question and
your analysis!
In line with what Yoshiki said, we used weighted UniFrac because abundances
encode important information for metabolomics data analysis. Bray-Curtis
also uses abundances similar to weighted UniFrac and therefore we used it
for our comparisons.
It's interesting that you see unweighted UniFrac capturing batch-to-batch
variation. This could be due to the property of this metric to give
importance to really low abundance signals as well, which might be
different between the batches (due to shifts in retention time.)
I also expect that if you compare unweighted UniFrac to a comparable metric
such as the tree-agnostic binary Jaccard distance (using PERMANOVA test
statistic), you might see that the UniFrac metric improves the batch effect
even if it doesn't reconcile the batches completely. We'd love to take a
look at your plots/analysis if you'd like more input.
Thanks again for the question - very interesting!
Anupriya Tripathi, PhD
…On Fri, 23 Jul 2021 at 10:52, Yoshiki Vázquez Baeza < ***@***.***> wrote:
@ArnaudGaudry <https://github.com/ArnaudGaudry> that's interesting.
Mathematically speaking Bray-Curtis is more similar to weighted UniFrac
than it is to unweighted UniFrac. I don't remember seeing this plot, mainly
because we knew the abundance-based weighting inherent to the weighted
variant of UniFrac would play an important role based on other experiments
and tests we ran before. @anupriyatripathi
<https://github.com/anupriyatripathi> any thoughts on this?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#149 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AD6ULPMXID7H6XVV3XEIFVLTZGT53ANCNFSM5AYAHNKA>
.
|
@anupriyatripathi @ElDeveloper Thank you for your detailed answers! It is indeed maybe due to the weight given to low abundance metabolites. PERMANOVA is a really good idea to measure the groups separations and I'll give it a try. Since the idea is to use chemical relationships to mitigate the batch effect, I also compared Qemistree to CSCS (also weighted and unweighted). |
Hello qemistree developers!
I tried to reproduce analyses from the publication on the evaluation dataset: https://github.com/knightlab-analyses/qemistree-analyses/blob/master/Evaluation-Dataset-Analyses.ipynb
When generating the plot using the metricunweighted_unifrac instead of weighted_normalized_unifrac , it generates a really different plot that is actually quite similar to the one generated using bray-curtis (strong batch effect visible). Is this inherent in the metric and expected? I thought you might have tested it in development!
Thanks and best regards,
Arnaud
The text was updated successfully, but these errors were encountered: