irma-scripts

Stand-alone scripts deployed to Miarka

These scripts are deployed to /vulpes/ngi/production/latest/sw/upps_standalone_scripts/ by the miarka-provision process. The script directory is added to PATH when loading the Miarka environment, meaning that these scripts are available on the command-line.

The scripts should contain instructions for usage unless it's obvious how to use them. Preferably, invoking a script without arguments should be safe to run without any side effects and only display usage instructions on stdout

NEVER put any passwords, usernames, tokens, user data or other sensitive information in the scripts. If such information is required by the script, rely on reading it from an environment variable instead.

When adding a new script to this repository, be sure to add a brief description of its purpose below:

concordance_check.sh - bash script to perform concordance check between a vcf file with genotypes and a vcf file with variant calls
deliver_project_to_user.sh - bash wrapper script around the deliver.py script, which should facilitate the delivery for the SNP platform
find_unorganized_flowcells.sh - bash script that verifies that the organized project folder under the DATA directory contains all runfolders in incoming having data from the project in them
link_project_sisyphus_reports.sh - bash script that links sisyphus runfolder reports from the incoming folder to the corresponding project folder under ANALYSIS
set_charon_genotyping_status.sh - bash script to set the genotyping status field in charon to a specified value for samples present in a supplied vcf file
statdump_to_json.pl - perl script that can parse a statdump zipfile created by sisyphus and output the statistics as json
run_FastQC_and_MultiQC.sh - bash script to run FastQC on a specified project in a runfolder. The script will summarize the output in one or several MultiQC-reports.
run_multiqc_bp_qc.sh - A simple wrapper for the MultiQC command used when performing QC of best-practice WGS projects.
project_runfolders.sh - Mainly used to find all runfolders with samplesheets containing a specific project or sample name. Scans incoming for csv-files at most two folders down and greps for the given string, then echoes folder if found.
cleanup_nf_projects.py - Script for cleaning up old analysis nextflow projects. The script will list folders (with full path) that will be deleted and calculate how much data will be removed. It will wait for input from user before removing anything. See usage at the top of the script.
make_nf_run_script.py - Script for generating an sbatch run script for NextFlow rnaseq and methylseq pipelines. See usage at the top of the script.
merge_fastqs.py - Script for merging fastq-files from different lanes / runs per sample.
start_merge.py - Convenience script for merging fastq files in a project per sample, depends on merge_fastqs.py
1_create_reference_tsv.bash - A helper script to create_reference_tsv.py.
create_reference_tsv.py - A script for writing sample info and paths to a WES project's fastq-files in a .tsv-file used by Sarek.
2_create_twist_exome_analysis.bash - This script will use a template, twist_exome_38_template.sbatch, to create a sbatch script to start Sarek for WES analysis.
twist_exome_38_template.sbatch - Template for running Sarek 2.6.1 on WES-data using reference GRCh38.
charon_project_samples_status_update.sh - Script to get all samples in Charon for a supplied project and set the analysis_status to ANALYZED and the status to STALE.
run_hs_metrics.sh - Run CollectHsMtrics for all recalibrated BAM files in a WES project.
bed2interval_list.sh - Example script on how to run picard BedToIntervalList (format needed for run_hs_metrics.sh).
organize_flowcell.py - Script to organize fastq files for a specific runfolder and project prior to analysis.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

irma-scripts

About

Releases

Packages

Contributors 9

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
config		config
run_script_templates		run_script_templates
1_create_reference_tsv.bash		1_create_reference_tsv.bash
2_create_twist_exome_analysis.bash		2_create_twist_exome_analysis.bash
README.md		README.md
archive_folder.sh		archive_folder.sh
bed2interval_list.sh		bed2interval_list.sh
calculate_autosomal_coverage.py		calculate_autosomal_coverage.py
charon_project_samples_status_update.sh		charon_project_samples_status_update.sh
cleanup_nf_projects.py		cleanup_nf_projects.py
concordance_check.sh		concordance_check.sh
create_nf_samplesheet.sh		create_nf_samplesheet.sh
create_reference_tsv.py		create_reference_tsv.py
create_sarek_samplesheet.py		create_sarek_samplesheet.py
deliver_project_to_user.sh		deliver_project_to_user.sh
find_unorganized_flowcells.sh		find_unorganized_flowcells.sh
irma_to_miarka_file_lists.py		irma_to_miarka_file_lists.py
link_project_reports.sh		link_project_reports.sh
make_nf_run_script.py		make_nf_run_script.py
merge_fastqs.py		merge_fastqs.py
multiqc_pipeline_info.py		multiqc_pipeline_info.py
multiqc_sarek_project.sh		multiqc_sarek_project.sh
organize_flowcell.py		organize_flowcell.py
project_runfolders.sh		project_runfolders.sh
run_FastQC_and_MultiQC.sh		run_FastQC_and_MultiQC.sh
run_hs_metrics.sh		run_hs_metrics.sh
sample_list_for_multiqc.py		sample_list_for_multiqc.py
sample_list_template.yaml.j2		sample_list_template.yaml.j2
set_charon_genotyping_status.sh		set_charon_genotyping_status.sh
stage_methylseq_delivery.sh		stage_methylseq_delivery.sh
stage_rnaseq_delivery.sh		stage_rnaseq_delivery.sh
start_merge.py		start_merge.py
twist_exome_38_template.sbatch		twist_exome_38_template.sbatch

Molmed/irma-scripts

Folders and files

Latest commit

History

Repository files navigation

irma-scripts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages