Demultiplexer

Given four input fastq files (2 with biological reads, 2 with index reads) and a list of known indexes, this program will demultiplex reads by index-pair, outputting one R1 fastq file and one R2 fastq file per matching index-pair, another two fastq files for non-matching index-pairs (index-hopping), and two additional fastq files when one or both index reads are unknown or low quality.

The sequence of each index-pair will be added to the header of BOTH reads in all fastq files for all categories (e.g. “AAAAAAAA-CCCCCCCC” will be appended to headers of every read pair that had an index1 of AAAAAAAA and an index2 of CCCCCCCC.

Final output stats files will report the number of read-pairs with properly matched indexes (per index-pair), the number of read pairs with index-hopping observed, and the number of read-pairs with unknown index(es).

Input

4 fastq files (one read pair, one index pair)
A text file with a list of known index sequences

argparse options:
- -f, --files: required arg, Paths to input fastq files (one read pair, one index pair)
- -i, --indexes: required arg, Path to file containing known index sequences + sample information
- -d, --direct: required arg, Path to output directory
- -s, --stats: required arg, Name for output stats files

Output

A pair of fastq files per known index pair, a pair for index-hopped read-pairs, and a pair for reads with unknown or low quality index-pairs
Summary stats files
- % and # of read-pairs with matched indexes, read-pairs with index-hopping, and read-pairs with unknown indexes
- % and # of read-pairs with matched indexes reported per index-pair

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Assignment-the-first		Assignment-the-first
TEST-input_FASTQ		TEST-input_FASTQ
TEST-output_FASTQ		TEST-output_FASTQ
Bioinfo.py		Bioinfo.py
README.md		README.md
demux.py		demux.py
demux.sh		demux.sh
stats_final_ind.tsv		stats_final_ind.tsv
stats_final_overall.tsv		stats_final_overall.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Demultiplexer

Input

Output

About

Releases

Packages

Languages

czakarian/Demultiplex

Folders and files

Latest commit

History

Repository files navigation

Demultiplexer

Input

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages