GUNC workflow

	Developed by the Bork Group in collaboration with nf-core Raise an issue or contact us See our other Software & Services	Contributors: Mahdi Robbani Christian Schudoma Daniel Podlesny	Collaborators: Jose Espinosa James A. Fellows Yates
The development of this workflow was supported by NFDI4Microbiota

Description

The GUNC workflow is a nextflow workflow for the detection of chimerism & contamination in prokaryotic genomes resulting from mis-binning of contigs from unrelated lineages. The workflow is based on the CheckM and GUNC (Genome UNClutterer) tools. GUNC applies an entropy based score on taxonomic assignment and the contig location of all genes in a genome.

Citation

This workflow:

Also cite:

Orakov A, Fullam A, Coelho LP, et al. GUNC: detection of chimerism and contamination in prokaryotic genomes. Genome Biol. 2021;22(1):178. Published 2021 Jun 13. doi:10.1186/s13059-021-02393-0
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25(7):1043-1055. doi:10.1101/gr.186072.114
Ewels PA, Peltzer A, Fillinger S, et al. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020;38(3):276-278. doi:10.1038/s41587-020-0439-x

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

Overview

Run CheckM (CheckM)
Run GUNC (GUNC)

Usage

Cloud-based Workflow Manager (CloWM)

This workflow will be available on the CloWM platform (coming soon).

Command-Line Interface (CLI)

You can run the pipeline using:

nextflow run gunc \
   -profile <docker/singularity/.../institute> \
   --input samplesheet.csv \
   --outdir <OUTDIR>

Input files

The input is a csv samplesheet with your input data that looks as follows:

samplesheet.csv:

id,group,assembler,fasta
test_minigut,0,MEGAHIT,https://github.com/nf-core/test-datasets/raw/mag/assemblies/MEGAHIT-test_minigut.contigs.fa.gz

Each row represents a metagenomic bin.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.devcontainer		.devcontainer
.github		.github
assets		assets
bin		bin
conf		conf
docs		docs
lib		lib
modules		modules
subworkflows/local		subworkflows/local
workflows		workflows
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
.nf-core.yml		.nf-core.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
CHANGELOG.md		CHANGELOG.md
CITATIONS.md		CITATIONS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
clowm_info.json		clowm_info.json
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
pyproject.toml		pyproject.toml
tower.yml		tower.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GUNC workflow

Description

Citation

Overview

Usage

Cloud-based Workflow Manager (CloWM)

Command-Line Interface (CLI)

Input files

About

Releases 3

Packages

Contributors 3

Languages

License

grp-bork/gunc_workflow

Folders and files

Latest commit

History

Repository files navigation

GUNC workflow

Description

Citation

Overview

Usage

Cloud-based Workflow Manager (CloWM)

Command-Line Interface (CLI)

Input files

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 3

Languages

Packages