Bulk rnaseq multiqc per sample #67

AlaaALatif · 2024-03-07T19:45:40Z

key modifications to the pipeline:

MultiQC report per sample
- in addition to a global multiQC report, there is now an individual report for each sample ID
- reports now include additional metrics from SNP calling
Global CPU resource management
- added executor.cpus parameter in config/base.config to be able to control maximum allowable number of CPU usage when running under local profile i.e. running the pipeline on a local machine or on a developmental (dev) node
- added executor.queueSize parameter in config/base.config to be able to control maximum allowable number of jobs to run when running the pipeline on a cluster via slurm
- added max_cpus_per_job parameter in config/parameters to be able to control maximum allowable number of CPU usage per job when running the pipeline on a cluster via slurm
- the above two additions enable us to control the "ceiling" of number of CPUs the pipeline can use via slurm, preventing the risk of "hogging" C4 resources
Optional SNP calling
- added call_snps parameter in config/parameters.config to be able to control whether or not SNP calling process steps need to be carried out. This was requested by the Genomics core and makes sense since these steps have considerably high computational requirements

… a hpc via slurm

…ax cpu usage on a hpc via slurm

…l/dev node

…er call_snps

…nal snp calling, and global cpu resource management on slurm

erflynn · 2024-03-13T23:26:03Z

This is great!! Very excited to have the separated MQC outs and I think it's good to reduce the number of jobs for slurm. Such a bummer we can't easily set maximum cpus for slurm in nextflow, but I think what you have is a good call.
Only comment is that we may want to consider setting up a profile that takes more time but fewer cpus? Not part of this PR of course, but something for the future.

AlaaALatif added 14 commits January 26, 2024 10:18

format line spacing

2e5b882

output meta info with log file for multiqc_per_sample

361d90d

added process step to generat multiqc reports per sample

d1faa9f

implement process step to generate multiqc reports per sample

6ab9b51

implementing multiqc report generation per sample ID

363c2c7

institute max cpus per job as factor1 of controlling max cpu usage on…

bda8a6a

… a hpc via slurm

institute executor.queueSize - max jobs - as factor2 of controlling m…

7dadcf3

…ax cpu usage on a hpc via slurm

institute executor.cpus as parameter to control max cpu usage on loca…

313c87c

…l/dev node

snp calling process steps can be skipped using user-specified paramet…

50a257b

…er call_snps

implement additional step to generate metrics for multiqc report

1063377

edit file emissions for multiqc reports

1ea874a

edit file emissions for multiqc reports

4f12cc2

tested pipeline with latest changes, multiqc report per sample, optio…

252ac7b

…nal snp calling, and global cpu resource management on slurm

updated default parameter settings

1b8bd51

AlaaALatif requested review from erflynn and dtm2451 March 7, 2024 19:45

erflynn approved these changes Mar 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bulk rnaseq multiqc per sample #67

Bulk rnaseq multiqc per sample #67

AlaaALatif commented Mar 7, 2024

erflynn commented Mar 13, 2024

Bulk rnaseq multiqc per sample #67

Are you sure you want to change the base?

Bulk rnaseq multiqc per sample #67

Conversation

AlaaALatif commented Mar 7, 2024

erflynn commented Mar 13, 2024