Skip to content
This repository has been archived by the owner on May 14, 2022. It is now read-only.

Faster characterization #776

Closed
escowles opened this issue Oct 19, 2016 · 2 comments
Closed

Faster characterization #776

escowles opened this issue Oct 19, 2016 · 2 comments

Comments

@escowles
Copy link
Member

Running FITS takes a very long time (up to 80% of the time to ingest a TIFF is spent doing characterization). Explore alternatives such as:

  • Running JHove standalone
  • Running a lightweight characterization service first (e.g., ImageMagick's identify), and then running FITS on a low-priority queue
@jpstroop
Copy link
Member

Whether JHOVE or full-blown FITS, running the characterization as a service so that we're not spinning up the JVM every time would probably give us a good boost.

@escowles
Copy link
Member Author

I think this issue is mostly resolved with ucsdlib/ansible-role-fits#2 — disabling Tika/Tesseract has improved performance, now averaging about 10 seconds per file.

Running a FITS/JHOVE service might be a small improvement on that, however, and I agree it's worth looking into.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants