Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merging from issue 15 (multi-sample possibility) #23

Merged
merged 159 commits into from
Nov 4, 2021
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
159 commits
Select commit Hold shift + click to select a range
b043f00
Update nextflow.config
fmalmeida Jul 4, 2021
46d7e96
Merge branch 'master' into develop
fmalmeida Jul 10, 2021
e7908af
Merge branch 'master' into develop
fmalmeida Jul 10, 2021
d42c788
trying to update docker contents
fmalmeida Jul 15, 2021
d1b1c16
Update Dockerfile
fmalmeida Jul 16, 2021
17c04d9
Merge branch 'master' into develop
fmalmeida Jul 16, 2021
ab861bd
Merge branch 'master' into develop
fmalmeida Jul 16, 2021
aa82cb8
Update nextflow.config
fmalmeida Jul 16, 2021
4537348
genomicconsensus changed to gcpp
fmalmeida Jul 19, 2021
b8f451f
removing default --no-correct option
fmalmeida Jul 19, 2021
93d294c
update line of software
fmalmeida Jul 19, 2021
3ae24ba
Update README.md
fmalmeida Jul 19, 2021
20a1d15
Update README.md
fmalmeida Jul 19, 2021
da7b98d
testing sub-directory names
fmalmeida Jul 26, 2021
6522bbd
try to fix directory organization
fmalmeida Jul 29, 2021
731cacb
fixing dir names
fmalmeida Jul 29, 2021
9a5f106
try to fix paired end ids
fmalmeida Jul 30, 2021
029f4a8
Minor fix
fmalmeida Aug 12, 2021
d30660a
Update short-reads-only.nf
fmalmeida Aug 23, 2021
2f75cab
adding software
fmalmeida Aug 30, 2021
6366b8d
wtdbg2 added
fmalmeida Aug 31, 2021
a087261
changing single quote to double quote
fmalmeida Aug 31, 2021
e65b6e4
single quote to double quote
fmalmeida Aug 31, 2021
c8898ee
changing single to double quotes
fmalmeida Aug 31, 2021
f95ab0e
shasta assembler added
fmalmeida Aug 31, 2021
befaf45
completing config templates
fmalmeida Aug 31, 2021
3820f7b
new assemblers added
fmalmeida Sep 6, 2021
45303be
cleaning files
fmalmeida Sep 6, 2021
6f9f195
adding new long reads assemblers
fmalmeida Sep 6, 2021
163ee61
fixing output sufix
fmalmeida Sep 6, 2021
17d8494
fixing output sufix
fmalmeida Sep 6, 2021
c9f02ee
Delete quast_all_at_once.nf
fmalmeida Sep 6, 2021
f1f1009
fixing output directory
fmalmeida Sep 7, 2021
ba312c6
fixing pilon call with reads values
fmalmeida Sep 7, 2021
90151a9
typo fix
fmalmeida Sep 7, 2021
8be86b6
creating a module for prefix
fmalmeida Sep 8, 2021
a9fc42d
Update multiqc.nf
fmalmeida Sep 11, 2021
f394434
Adding csvtk to image
fmalmeida Sep 11, 2021
8a1022b
creating a module that separately evaluates prefix
fmalmeida Sep 11, 2021
66c0b82
Merge branch 'master' into develop
fmalmeida Sep 12, 2021
81567ee
Merge branch 'master' into develop
fmalmeida Sep 13, 2021
f40a185
Merge branch 'master' into develop
fmalmeida Sep 13, 2021
d6315a9
Merge branch 'master' into develop
fmalmeida Sep 14, 2021
531a818
Merge branch 'master' into develop
fmalmeida Sep 14, 2021
dd7e0d9
updating docs
fmalmeida Sep 16, 2021
6d3c8a8
v2.3 finished?
fmalmeida Sep 16, 2021
1b553a3
Update README.md
fmalmeida Sep 16, 2021
dde957b
Update multiqc.nf
fmalmeida Sep 16, 2021
d2c9b2a
added logo in readme
fmalmeida Sep 17, 2021
d7562ef
update
fmalmeida Sep 17, 2021
3a73fb4
update
fmalmeida Sep 17, 2021
d71df86
v2.3 ready
fmalmeida Sep 17, 2021
13738d6
Update config.rst
fmalmeida Sep 17, 2021
ea415ef
Update nextflow_schema.json
fmalmeida Sep 17, 2021
676e88a
Update CHANGELOG.md
fmalmeida Sep 17, 2021
0f30dda
added prefix option
fmalmeida Sep 18, 2021
7e0dab7
Merge branch 'develop' of https://github.com/fmalmeida/MpGAP into dev…
fmalmeida Sep 18, 2021
84e8314
Update manual.rst
fmalmeida Sep 18, 2021
63c5b3e
Update quickstart.rst
fmalmeida Sep 18, 2021
0afd539
Update manual.rst
fmalmeida Sep 18, 2021
d7a474a
Update manual.rst
fmalmeida Sep 18, 2021
a96892a
Update manual.rst
fmalmeida Sep 18, 2021
aded828
Update manual.rst
fmalmeida Sep 18, 2021
ea04761
Update config.rst
fmalmeida Sep 18, 2021
54d2b3b
fix dir name
fmalmeida Sep 18, 2021
f64ce42
change suffix
fmalmeida Sep 18, 2021
30a8af9
Merge branch 'master' into develop
fmalmeida Sep 20, 2021
a979a83
Merge branch 'master' into develop
fmalmeida Sep 20, 2021
c9f8cd5
Merge branch 'master' into develop
fmalmeida Sep 24, 2021
71da4e2
first step of implementation
fmalmeida Sep 24, 2021
1a7b574
first step towards multi-sample implementation
fmalmeida Sep 27, 2021
f93adff
changing processes names of long reads assemblers
fmalmeida Sep 27, 2021
b8f26d0
fixing shasta decompression
fmalmeida Sep 27, 2021
fc21a74
fixed shasta uncompression
fmalmeida Sep 27, 2021
e0f603d
implementation test of lr-only batch workflow
fmalmeida Oct 1, 2021
54bb51d
merge (#21)
fmalmeida Oct 1, 2021
321a09c
long-reads only multi-sample workflow done
fmalmeida Oct 7, 2021
1a7626a
finally created a way to filter workflows
fmalmeida Oct 7, 2021
279aace
finally working with workflow filter
fmalmeida Oct 7, 2021
979f564
workflow conditional working
fmalmeida Oct 7, 2021
61a24f8
single genome working using concat modules
fmalmeida Oct 21, 2021
0b305e0
adding the 'cpus' directive
fmalmeida Oct 21, 2021
34b1ccf
delete unused file
fmalmeida Oct 22, 2021
9abd666
Merge branch 'issue-15' into develop
fmalmeida Oct 22, 2021
5795acd
aligning brackets
fmalmeida Oct 22, 2021
65192e7
changed gcpp to use a single bam
fmalmeida Oct 23, 2021
3bd253c
update bam information
fmalmeida Oct 23, 2021
9c6a214
Merge branch 'master' into issue-15
fmalmeida Oct 29, 2021
4d227eb
multi-sample-all sucessfully run
fmalmeida Oct 29, 2021
866b68a
Merge branch 'issue-15' of https://github.com/fmalmeida/MpGAP into is…
fmalmeida Oct 29, 2021
1cbffd4
fixing prefixes parsing
fmalmeida Oct 30, 2021
714e30d
updating readme
fmalmeida Oct 30, 2021
12b6e46
fixing detection of corrected long reads
fmalmeida Nov 1, 2021
37f5120
updating corrected long reads check-up
fmalmeida Nov 1, 2021
3b07054
Update nextflow_schema.json
fmalmeida Nov 1, 2021
d7d65fc
help message only in config and online
fmalmeida Nov 1, 2021
bd83268
Create .gitignore
fmalmeida Nov 1, 2021
9b15c56
Update help.nf
fmalmeida Nov 1, 2021
6263afe
input files only passed via samplesheet
fmalmeida Nov 1, 2021
0730127
Update main.nf
fmalmeida Nov 1, 2021
ba82c24
finishing example samplesheet
fmalmeida Nov 1, 2021
8ce0a09
Update nextflow.config
fmalmeida Nov 1, 2021
6d88377
Update README.md
fmalmeida Nov 2, 2021
0de5afe
Update writeCSV.nf
fmalmeida Nov 2, 2021
dd900ec
Update logMessages.nf
fmalmeida Nov 2, 2021
c653172
changing in_yaml to input
fmalmeida Nov 2, 2021
32438b3
changing outdir and output parameter keys
fmalmeida Nov 2, 2021
bd8e6f2
update on params.lr_type
fmalmeida Nov 2, 2021
689a261
write_csv function properly parsing samplesheet
fmalmeida Nov 2, 2021
f89a568
possible to use params check from another file?
fmalmeida Nov 2, 2021
6f41360
Delete paramsCheck.nf
fmalmeida Nov 2, 2021
78dd837
Update writeCSV.nf
fmalmeida Nov 2, 2021
ba33836
changing behaviour of wtdbg2 technology assumption
fmalmeida Nov 2, 2021
4e37c45
added samplesheet template
fmalmeida Nov 2, 2021
3720c2b
Update unicycler_polish.nf
fmalmeida Nov 2, 2021
e56d695
Update CHANGELOG.md
fmalmeida Nov 2, 2021
b56d0fd
added shasta config parameter
fmalmeida Nov 2, 2021
ba9fc77
Update .gitignore
fmalmeida Nov 2, 2021
c57bc0b
Create settings.json
fmalmeida Nov 2, 2021
824acbe
updating parameterization
fmalmeida Nov 2, 2021
0e65fe0
Update conf.py
fmalmeida Nov 2, 2021
f01a95a
Update main.nf
fmalmeida Nov 2, 2021
bef4863
Update example_samplesheet.yml
fmalmeida Nov 2, 2021
8d4f782
documenting samplesheet
fmalmeida Nov 2, 2021
3bf103e
updating parameter information
fmalmeida Nov 2, 2021
d7366b2
Update config.rst
fmalmeida Nov 2, 2021
c111465
fixing config display
fmalmeida Nov 2, 2021
139c863
add information on samplesheet template download
fmalmeida Nov 2, 2021
a011544
adding more information about params
fmalmeida Nov 2, 2021
f2655c4
add shasta_config explanation
fmalmeida Nov 2, 2021
b038f0d
reorganizing changelog and earlier releases md
fmalmeida Nov 2, 2021
d9a083f
update readme
fmalmeida Nov 2, 2021
810337e
update readme
fmalmeida Nov 2, 2021
e4763f7
update readme
fmalmeida Nov 2, 2021
efd563a
update readme
fmalmeida Nov 2, 2021
5480938
update readme
fmalmeida Nov 2, 2021
e4276dc
finishing readme
fmalmeida Nov 2, 2021
af10f09
finishing readme
fmalmeida Nov 2, 2021
dde5e57
complete
fmalmeida Nov 2, 2021
f000d17
finishing samplesheet manual
fmalmeida Nov 2, 2021
eb715c7
stantardising pilon polish directory
fmalmeida Nov 3, 2021
1f78735
removing view statement
fmalmeida Nov 3, 2021
59a2b4e
fixing multiqc
fmalmeida Nov 3, 2021
3358c62
fixing "consensus" nomenclature for polishers
fmalmeida Nov 3, 2021
423e466
bringing defaults to params given in config and cli
fmalmeida Nov 3, 2021
0f0767f
changing params behaviour
fmalmeida Nov 3, 2021
58fd72e
Update wtdbg2.nf
fmalmeida Nov 3, 2021
f199f5b
Update main.nf
fmalmeida Nov 3, 2021
e18592e
update feedback info
fmalmeida Nov 3, 2021
316516b
update changelog information
fmalmeida Nov 3, 2021
270dfc8
Update writeCSV.nf
fmalmeida Nov 3, 2021
7c9093a
Update installation.rst
fmalmeida Nov 3, 2021
bea6570
Update quickstart.rst
fmalmeida Nov 3, 2021
679bde4
documentation finished
fmalmeida Nov 4, 2021
180415f
finishing nextflow config file
fmalmeida Nov 4, 2021
a95fed0
finishing template samplesheet
fmalmeida Nov 4, 2021
7b805dc
finished changelog
fmalmeida Nov 4, 2021
5f5a581
finished main.nf
fmalmeida Nov 4, 2021
93d1add
finishing corrected long reads parsing
fmalmeida Nov 4, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
38 changes: 38 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
---
name: Bug report
about: Create a report to help us improve
title: ''
labels: ''
assignees: ''

---

**Describe the bug**
A clear and concise description of what the bug is.

**To Reproduce**
Steps to reproduce the behavior:
1. Go to '...'
2. Click on '....'
3. Scroll down to '....'
4. See error

**Expected behavior**
A clear and concise description of what you expected to happen.

**Screenshots**
If applicable, add screenshots to help explain your problem.

**Desktop (please complete the following information):**
- OS: [e.g. iOS]
- Browser [e.g. chrome, safari]
- Version [e.g. 22]

**Smartphone (please complete the following information):**
- Device: [e.g. iPhone6]
- OS: [e.g. iOS8.1]
- Browser [e.g. stock browser, safari]
- Version [e.g. 22]

**Additional context**
Add any other context about the problem here.
20 changes: 20 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
---
name: Feature request
about: Suggest an idea for this project
title: ''
labels: ''
assignees: ''

---

**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

**Describe the solution you'd like**
A clear and concise description of what you want to happen.

**Describe alternatives you've considered**
A clear and concise description of any alternative solutions or features you've considered.

**Additional context**
Add any other context or screenshots about the feature request here.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
.nextflow*
docs/_build
3 changes: 3 additions & 0 deletions .vscode/settings.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
{
"restructuredtext.confPath": "${workspaceFolder}/docs"
}
103 changes: 46 additions & 57 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,42 +20,54 @@

## About

MpGAP is an easy to use nextflow docker-based pipeline that adopts well known software for genome assembly of Illumina, Pacbio and Oxford Nanopore sequencing data through illumina only, long reads only or hybrid modes. This pipeline wraps up the following software:
MpGAP is an easy to use nextflow docker-based pipeline that adopts well known software for _de novo_ genome assembly of Illumina, Pacbio and Oxford Nanopore sequencing data through illumina only, long reads only or hybrid modes. This pipeline wraps up the following software:

|| **Source** |
|:- | :- |
| **Assemblers** | [Canu](https://github.com/marbl/canu), [Flye](https://github.com/fenderglass/Flye), [Raven](https://github.com/lbcb-sci/raven), [Shasta](https://github.com/chanzuckerberg/shasta), [wtdbg2](https://github.com/ruanjue/wtdbg2), [Haslr](https://github.com/vpc-ccg/haslr), [Unicycler](https://github.com/rrwick/Unicycler), [Spades](https://github.com/ablab/spades), [Shovill](https://github.com/tseemann/shovill) |
| **Polishers** | [Nanopolish](https://github.com/jts/nanopolish), [Medaka](https://github.com/nanoporetech/medaka), [gcpp](https://github.com/PacificBiosciences/gcpp), [Pilon](https://github.com/broadinstitute/pilon) |
| **Quality check** | [QUAST](https://github.com/ablab/quast), [MultiQC](https://multiqc.info/) |

## Further reading
### Release notes

Are you curious about changes between releases? See the [changelog](markdown/CHANGELOG.md).

* I **strongly**, **vividly**, **mightily** recommend the usage of the latest versions hosted in master branch, which is nextflow's default.
+ The latest will always have support, bug fixes and generally maitain the same processes (I mainly add things instead of removing) that also were in previous versions.
+ But, if you **really** want to execute an earlier release, please [see the instructions for that](markdown/earlier_releases_instructions.md).
* Versions below 2.0 are no longer supported.

### Further reading

This pipeline has two complementary pipelines (also written in nextflow) for [NGS preprocessing](https://github.com/fmalmeida/ngs-preprocess) and [prokaryotic genome annotation](https://github.com/fmalmeida/bacannot) that can give the user a complete workflow for bacterial genomics analyses.

### Feedback

In the pipeline we always try to create a workflow and a execution dynamics that is the most generic possible and is suited for the most possible use cases.

Therefore, feedbacks are very well welcomed. If you believe that your use case is not encompassed in the pipeline, you have enhancement ideas or found a bug, please do not hesitate to [open an issue](https://github.com/fmalmeida/MpGAP/issues/new/choose) to disscuss about it.

## Requirements

This pipeline has only two dependencies: [Docker](https://www.docker.com) and [Nextflow](https://github.com/nextflow-io/nextflow).

* Unix-like operating system (Linux, macOS, etc)
+ Windows users maybe can execute it using the linux subsystem for windows as shown in:
+ https://nextflow.io/blog/2021/setup-nextflow-on-windows.html
+ https://docs.microsoft.com/pt-br/windows/wsl/install-win10
+ https://www.nextflow.io/docs/latest/getstarted.html
+ https://en.wikipedia.org/wiki/Windows_Subsystem_for_Linux
* Java 8 (or higher)
* Nextflow (version 20.01 or higher)
* Docker
* Image: `fmalmeida/mpgap:v2.3`
* Image: `fmalmeida/mpgap:v3.0`

## Installation

1. If you don't have it already install [Docker](https://docs.docker.com/) in your computer.
* After installed, you need to download the required Docker images

docker pull fmalmeida/mpgap:v2.3

> Each release is accompanied by a Dockerfile in the docker folder. When using releases older releases, users can create the correct image using
the Dockerfile that goes alongside with the release (Remember to give the image the correct name, as it is in dockerhub and the nextflow script).
The latest release will always have its docker image in dockerhub.
docker pull fmalmeida/mpgap:v3.0

2. Install Nextflow (version 20.01 or higher):

Expand All @@ -65,65 +77,48 @@ The latest release will always have its docker image in dockerhub.

nextflow run fmalmeida/mpgap --help

> Users can let the pipeline always updated with: `nextflow pull fmalmeida/mpgap`
:fire: Users can let the pipeline always updated with: `nextflow pull fmalmeida/mpgap`

## Documentation

### Explanation of hybrid strategies

Hybrid assemblies can be produced using one of two available strategies:
Hybrid assemblies can be produced with two available strategies. Please read more about the strategies and how to set them up in the [online documentation](https://mpgap.readthedocs.io/en/latest/manual.html#hybrid-assembly-strategies).

:arrow_right: they are chosen with the parameter `--hybrid_strategy`.

#### Strategy 1

By using Unicycler, Haslr and/or SPAdes hybrid assembly modes. For instance, it can use the Unicycler hybrid mode which will first assemble a high quality assembly graph with Illumina data and then it will use long reads to bridge the gaps. More information about Unicycler Hybrid mode can be found [here](https://github.com/rrwick/Unicycler#method-hybrid-assembly).
It uses the hybrid assembly modes from Unicycler, Haslr and/or SPAdes.

#### Strategy 2

By polishing a long reads only assembly with Illumina reads. For that, users will have to set `--strategy_2` to true. This will tell the pipeline to produce a long reads only assembly (with canu, flye, raven or unicycler) and polish it with Pilon (for unpaired reads) or with [Unicycler-polish program](https://github.com/rrwick/Unicycler/blob/master/docs/unicycler-polish.md) (for paired end reads).

> Note that, `--strategy_2` parameter is an alternative workflow, when used, it will execute ONLY strategy 2 and not both strategies. When false, only strategy 1 will be executed.
It produces a long reads only assembly and polishes (correct errors) it with short reads using Pilon (for unpaired reads) or with [Unicycler-polish program](https://github.com/rrwick/Unicycler/blob/main/docs/unicycler-polish.md) (for paired end reads).

#### Example:

nextflow run fmalmeida/mpgap --outdir output --threads 5 --shortreads_paired "path-to/illumina_r{1,2}.fastq" \
--shortreads_single "path-to/illumina_unpaired.fastq" --lr_type 'nanopore' --longreads "path-to/ont_reads.fastq" --strategy_2

### Usage

<a href="https://mpgap.readthedocs.io/en/latest/index.html"><strong>Users are advised to read the complete documentation »</strong></a>

* Complete command line explanation of parameters:
+ `nextflow run fmalmeida/mpgap --help`
* See usage examples in the command line:
+ `nextflow run fmalmeida/mpgap --examples`

### Command line usage examples
```bash
# run the pipeline setting the desired hybrid strategy globally (for all samples)
nextflow run fmalmeida/mpgap \
--output output \
--threads 5 \
--input "samplesheet.yml" \
--hybrid_strategy "both"
```

Command line executions are exemplified [in the manual](https://mpgap.readthedocs.io/en/latest/examples.html).
:fire: This will perform, for **all** samples, both both strategy 1 and strategy 2 hybrid assemblies. Please read more about it in the [manual reference page](https://mpgap.readthedocs.io/en/latest/manual.html) and [samplesheet reference page](https://mpgap.readthedocs.io/en/latest/samplesheet.html).

#### Warnings
### Usage

* Remember to **always** write input paths inside double quotes.
* When using paired end reads it is **required** that input reads are set with the “{1,2}” pattern. For example: “SRR6307304_{1,2}.fastq”. This will properly load reads “SRR6307304_1.fastq” and “SRR6307304_2.fastq”
* When running hybrid assemblies or mixing short read types it is advised to **avoid not required REGEX** and write the full file path, using only the required REGEX for paired end reads when applicable. So that the pipeline does not load any different read that also matches the REGEX and avoid confusions with the inputs.
For understading pipeline usage and configuration, users must read the <a href="https://mpgap.readthedocs.io/en/latest/index.html"><strong>complete online documentation »</strong></a>

### Using the configuration file

All parameters showed above can be, and are advised to be, set through the configuration file. When a configuration file is used the pipeline is executed as `nextflow run fmalmeida/mpgap -c ./configuration-file`. Your configuration file is what will tell the pipeline which type of data you have, and which processes to execute. Therefore, it needs to be correctly configured.

To create a configuration file in your working directory:

* For Hybrid assemblies:

nextflow run fmalmeida/mpgap --get_hybrid_config

* For Long reads only assemblies:

nextflow run fmalmeida/mpgap --get_lreads_config

* For illumina only assemblies:

nextflow run fmalmeida/mpgap --get_sreads_config
* To create a configuration file in your working directory:

nextflow run fmalmeida/mpgap --get_config

### Interactive graphical configuration and execution

Expand Down Expand Up @@ -163,19 +158,13 @@ It will result in the following:

1. Whenever using unicycler with unpaired reads, an odd platform-specific SPAdes-related crash seems do randomly happen as it can be seen in the issue discussed at https://github.com/rrwick/Unicycler/issues/188.
+ As a workaround, Ryan says to use the `--no_correct` parameter which solves the issue and does not have a negative impact on assembly quality.
+ Therefore, if you run into this error when using unpaired data you can activate this workaroud with `--unicycler_additional_parameters "--no_correct"`.
2. Whenever running the pipeline for multiple samples at once using glob patterns such as '*' and '?', users are advised to do not perform hybrid assemblies, nor combining both paired and unpaired short reads in short reads only assemblies. Because the pipeline is not yet trained to properly search for the correct pairs, and since nextflow channels are random, we cannot ensure that the combination of data used in these to assembly types will be right. The pipeline treats each input file as a unique sample, and it will execute it individually.
+ To date, the use of glob patterns only works properly with long reads only assembly, or short reads only assemblies using either paired or unpaired reads, not both at the same time. For example:
+ `nextflow run [...] --longreads 'my_data/*.fastq' --lr_type 'nanopore' --outdir my_results`
+ The pipeline will load and assembly each fastq in the `my_data` folder and assemble it, writing the results for each read in a sub-folder with the reads basename in the `my_results` output folder.
+ `nextflow run [...] --shortreads_single 'my_data/*.fastq' --outdir my_results`
+ The pipeline will load and assembly each fastq in the `my_data` folder and assemble it, writing the results for each read in a sub-folder with the reads basename in the `my_results` output folder.

> However, we are currently working in a proper way to execute the hybrid and combination of short reads in assemblies for multiple samples at once so that users can properly execute it without confusion.

3. Sometimes, shovill assembler can fail and cause the pipeline to fail due to problems in estimating the genome size. This, is actually super simple to solve! Instead of letting the shovill assembler estimate the genome size, you can pass the information to it and prevent its fail:
+ `--shovill_additional_parameters '--gsize 3m'`
+ Therefore, if you run into this error when using unpaired data you can activate this workaroud with:
+ `--unicycler_additional_parameters " --no_correct "`.
2. Sometimes, shovill assembler can fail and cause the pipeline to fail due to problems in estimating the genome size. This, is actually super simple to solve! Instead of letting the shovill assembler estimate the genome size, you can pass the information to it and prevent its fail:
+ `--shovill_additional_parameters " --gsize 3m "`

## Citation

To cite this pipeline users can use our Zenodo tag or directly via the github url. Users are encouraged to cite the programs used in this pipeline whenever they are used.

Please, do not forget to cite the software that were used whenever you use its outputs. See [the list of tools](markdown/list_of_tools.md).
Loading