-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Example of scATAC-seq pseudobulk analysis for differential accessibility testing #11
Comments
Can I use ArchR for this example? |
@paupaiz possibly, could you briefly describe more? My intention here is to be able to compare accessibility between experimental conditions (e.g. WT and mutant) where we have multiple replicates of each. The test would produce cluster-by-cluster results of genomic regions, log-fold-changes, and adjusted p-values. The ArchR documentation on this here uses an example of one cell type vs another (ie finding marker regions that are specific to one cell type), which IMO is a slightly different problem, but as long as it does this in a replicate-aware fashion it could also work? |
For multilevel pseudobulk analyses, see
|
@stemangiola @jeremymsimon I developed condition-aware pseudobulking for ArchR motivated by this paper We will include this in the next release (1.0.3). Let me know if it would be useful to have an example here! |
I have working example using Signac and Deseq2. Am I eligible to contribute/provide the code? I am sorry for the basic question but could not find anything in the guidelines for contribution. |
@jeremymsimon, @stemangiola, @paupaiz |
@AmelZulji would love to check out your post so I can answer! |
Hi @paupaiz, @jeremymsimon, @stemangiola Here is the reproducible example the code:
|
Thanks @AmelZulji this is a helpful place to start and may work for many cases. When I first posted this challenge I was envisioning including a local background correction as well like |
Thanks @jeremymsimon, in my opinion, the correction in The only concern might be "double dipping" as mentioned at the end of the first paragraph in this chapter of Please correct me if im wrong or misunderstood your points |
Any comments on this @jeremymsimon ? I would be interested to work it further on this if you have suggestion. Regards, |
Hi @AmelZulji I'm not a statistician but I think these are separate issues. The issue regarding "double-dipping" is related to constructing your peak set in a condition-aware fashion, over which the differential test will then be conducted. In other words, if you call peaks in ConditionA, then call peaks in ConditionB, then merge into a union set and test over those windows, you risk losing error control.
What I'm proposing here is to do the following:
@stemangiola apparently has methods (mentioned above) for doing some of this, but IMO it would be nice to work through a solution that includes the local background/composition bias correction that broad window counts provides None of this has been published before for scATAC-seq, AFAIK, so this would provide a means for us to evaluate whether there is any benefit to testing in this fashion and context |
Could be downstream of #1
Work through multi-sample multi-condition differential test using pseudobulked counts, a la a hybrid of
muscat
andcsaw
The text was updated successfully, but these errors were encountered: