Skip to content

Documentation

cneud edited this page Oct 25, 2012 · 29 revisions

The technical and research partners in IMPACT have developed more than 20 different tools for various stages in the OCR process. Generally speaking, all of these tools operate on image or text data, either by modifying the data or by extracting information from it. IMPACT has therefore also developed an interoperability framework that allows for a loose coupling of these tools and the exchange of data between them.

The IMPACT Interoperability Framework comprises the following components:

In the first step, command line tools are wrapped as web services with the help of the toolwrapper and according tool specifications.

Using the derived workflow modules it is then possible to form a pipeline of the tools where the output of one tool is used as input for the next tool.

An important incentive for creating such a framework is that the historical material that libraries, archives and other content holders are digitising in large quantities is very different in nature. Because there is no optimal combination of tools (called a workflow) for every purpose, users have to be enabled to try and evaluate certain combinations to find their optimal workflow.

The following related articles might also be of interest:

Clone this wiki locally