Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use pbzip2/pigz to decompress corpora if available #947

Merged
merged 6 commits into from
Apr 6, 2020

Commits on Apr 2, 2020

  1. Use pbzip2/pigz to decompress corpora if available

    Decomporessing large corpora using the standard bzip2/gzip libraries
    can be a slow process as they only utilize one cpu core.  Take
    advantage of pbzip2/pigz, if available, to speed up the process by
    taking advantage of all cores.
    dliappis committed Apr 2, 2020
    Configuration menu
    Copy the full SHA
    9730b93 View commit details
    Browse the repository at this point in the history
  2. Don't open file prematurely

    dliappis committed Apr 2, 2020
    Configuration menu
    Copy the full SHA
    c51403c View commit details
    Browse the repository at this point in the history
  3. Change warn

    dliappis committed Apr 2, 2020
    Configuration menu
    Copy the full SHA
    fee8c00 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    746fb50 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c6575b4 View commit details
    Browse the repository at this point in the history
  6. Switch to logging only

    dliappis committed Apr 2, 2020
    Configuration menu
    Copy the full SHA
    476e3fd View commit details
    Browse the repository at this point in the history