Project name: Fully dynamic de Bruijn Graph Implementation
For mac or linux: Download the code, enter the root directory of the code in the terminal.
Type into the terminal:
make cr
The code will automatically compile and run. The result will be displayed in the terminal.
If you want to clean up, type:
make clean
Defult k value is 51. If you want to change the k value, open the makefile with any text editor. You can change the value accordingly. Recommand k value is from 31 to 51.
# default command line arguments: [input fastq file, k value]
args="data/test.fastq" 51
Current code hasn't implemented the succinct representation of k-mers. So it can not handle the large fastq files.
*.class files.
Test input data: Including fastq file of Illumina sequence reads for part of E.coli.
/ src
/ debruijnGraph # data structures: forest, DBG; hash functions: karp-rabin, minimum perfect hash
/ reference # open source libraries that I use: karp-rabin, minimum perfect hash
/ test # main: entrance of the program
-
Build de Bruijn graph from input string (finished)
-
Build de Bruijn graph from kmers (finished)
-
Rabin Karp Hash (finished)
-
Minimal Perfect Hash (finished)
-
Forest construction (finished)
-
Membership query (finished)
-
Adding/Deleting edges (finished)
-
Adding/Deleting vertices (can be achieved by rehashing every time)