Skip to content

run_md5.nf

Ernesto Lowy edited this page Apr 8, 2021 · 9 revisions

This workflow is used to calculate the MD5 hash/es of single or multiple files. If multiple files are provided, MD5 hashes are calculated concurrently to speed up the processing.

Dependencies

md5sum

Docker image

The best way of running this pipeline is by using the following Docker image:
https://hub.docker.com/repository/docker/elowy01/ubuntu_img

Nextflow configuration file

The configuration file that can be used with this pipeline can be found here

Nextflow workflow

This workflow is implemented in the script named run_md5.nf

Usage

nextflow -C run_md5.config run run_md5.nf --list <path_list.txt> --cpus <int>

Parameters

--list

List of file paths that will be analysed. Each path on a new line

--cpus

Number of cpus this workflow will use. The more CPUs the quicker the calculation will be done

Output

The workflow will create a .txt file with the MD5 hash for each of the files being analysed. These .txt files will be put in a directory named result_md5