Skip to content

NAF v1.0.0 - the initial release

Compare
Choose a tag to compare
@KirillKryukov KirillKryukov released this 17 Jan 09:12
· 128 commits to master since this release

The initial release

The initial release of NAF tools. Tested on Windows, Linux and Mac. Tested by compressing and decompressing over 200,000 genomes (2.5 TB), and numerous other datasets.

Provides basic functionality:

  • Compresses a FASTA or FASTQ file, or an input stream, autodetecting format.
  • Decompresses into same format by default.
  • Autodetects, stores and recovers line length.
  • Extracts and stores sequence mask, with the option to ignore it, for both compression and decompression
  • Supports alignments (sequences with gap marked as '-').
  • Supports N and other ambiguous IUPAC nucleotide codes (R, Y, S, W, K, M, B, D, H, V).
  • Can pipe all input and output, enabling use in pipelines.
  • Has partial decompression options for saving time: concatenated DNA sequence, accession numbers, sequence names, lengths, mask, 4-bit encoded sequence.
  • Very fast on low compression levels, while still providing useful compression.
  • Provides state of the art compression strength on high compression levels.