Skip to content

Commit

Permalink
Merge branch 'workflows/transcriptomics/rnaseq-sr' of github.com:plan…
Browse files Browse the repository at this point in the history
…emo-autoupdate/iwc into workflows/transcriptomics/rnaseq-sr
  • Loading branch information
lldelisle committed Sep 1, 2023
2 parents 1cf96a6 + a88b3ee commit 3062747
Show file tree
Hide file tree
Showing 21 changed files with 878 additions and 320 deletions.
4 changes: 2 additions & 2 deletions .github/workflows/workflow_test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ jobs:
with:
python-version-list: "[\"3.7\"]"
galaxy-fork: galaxyproject
galaxy-branch: release_22.05
galaxy-branch: release_23.1
max-chunks: 4

# Planemo lint the changed repositories
Expand Down Expand Up @@ -57,7 +57,7 @@ jobs:
python-version-list: "[\"3.7\"]"
repository-list: ${{ needs.setup.outputs.repository-list }}
galaxy-fork: galaxyproject
galaxy-branch: master
galaxy-branch: release_23.1
check-outputs: false

combine_outputs:
Expand Down
6 changes: 5 additions & 1 deletion workflows/epigenetics/atacseq/.dockstore.yml
Original file line number Diff line number Diff line change
@@ -1,7 +1,11 @@
version: 1.2
workflows:
- name: main
primaryDescriptorPath: /atacseq.ga
subclass: Galaxy
publish: true
primaryDescriptorPath: /atacseq.ga
testParameterFiles:
- /atacseq-tests.yml
authors:
- name: Lucille Delisle
orcid: 0000-0002-1964-4960
10 changes: 10 additions & 0 deletions workflows/epigenetics/atacseq/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,15 @@
# Changelog

## [0.5] 2023-03-17

### Automatic update
- `toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.3` was updated to `toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.4`
- `toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0` was updated to `toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0+galaxy1`
- `toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.0+galaxy1` was updated to `toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.4+galaxy0`

### Manual update
- add normalization steps for coverage

## [0.4] 2023-01-16

### Automatic update
Expand Down
7 changes: 5 additions & 2 deletions workflows/epigenetics/atacseq/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@ You can have more information about ATAC-seq analysis in the [slides](https://tr

- reference_genome: this field will be adapted to the genomes available for bowtie2 and the genomes available for bedtools slopbed (dbkeys table)
- effective_genome_size: this is used by macs2 and may be entered manually (indications are provided for heavily used genomes)
- bin_size: this is used when normalization of coverage is performed. Large values will allow to have smaller output files but with less resolution while small values will increase computation time and size of output files to produce more resolutive bigwigs.

## Processing

Expand All @@ -21,13 +22,15 @@ You can have more information about ATAC-seq analysis in the [slides](https://tr
- The BAM is converted to BED to enable macs2 to take both pairs into account.
- The peaks are called with macs2 which at the same time generates a coverage file.
- The coverage file is converted to bigwig
- The amount of reads 500bp from summits and the total number of reads are computed if further normalization is wanted.
- The amount of reads 500bp from summits and the total number of reads are computed.
- Two normalizations are computed:
- By million reads
- By million reads in peaks (500bp from summits)
- Other QC are performed:
- A histogram with fragment length is computed.
- The evaluation of percentage of reads to chrM or MT is computed.
- A multiQC is run to have an overview of the QC.

### Warning

- The coverage output is not normalized.
- The `reference_genome` parameter value is used to select references in bowtie2 and bedtools slopbed. Only references that are present in bowtie2 **and** bedtools slopbed are selectable. If your favorite reference genome is not available ask your administrator to make sure that each bowtie2 reference has a corresponding len file for use in bedtools slopbed.
19 changes: 17 additions & 2 deletions workflows/epigenetics/atacseq/atacseq-tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,8 @@
location: https://zenodo.org/record/3862793/files/SRR891268_chr22_enriched_R2.fastq.gz
filetype: fastqsanger
reference_genome: hg19
effective_genome_size: '2700000000'
effective_genome_size: 2700000000
bin_size: 1000
outputs:
mapping stats:
element_tests:
Expand Down Expand Up @@ -82,6 +83,20 @@
asserts:
has_line:
line: "10167"
bigwig_norm:
element_tests:
SRR891268_chr22_enriched:
asserts:
has_size:
value: 1130014
delta: 100000
bigwig_norm2:
element_tests:
SRR891268_chr22_enriched:
asserts:
has_size:
value: 1133795
delta: 100000
'MultiQC on input dataset(s): Stats':
element_tests:
bowtie2:
Expand All @@ -91,7 +106,7 @@
cutadapt:
asserts:
has_line:
line: "SRR891268_chr22_enriched_2\t4.0\t285247\t41011\t40415\t4283\t280964\t28524700\t480633\t27163516\t4.771948521807416"
line: "SRR891268_chr22_enriched_2\t4.4\t285247\t41011\t40415\t4283\t280964\t28524700\t480633\t27163516\t4.771948521807416"
general_stats:
asserts:
has_text:
Expand Down
Loading

0 comments on commit 3062747

Please sign in to comment.