Skip to content

Documentation

cneud edited this page Oct 25, 2012 · 29 revisions

The technical and research partners in IMPACT have developed more than 20 different tools for various stages in the OCR process. Generally speaking, all of these tools operate on image or text data, either by modifying the data or by extracting information from it. IMPACT has therefore also developed an interoperability framework that allows for a loose coupling of these tools and the exchange of data between them.

The IMPACT Interoperability Framework comprises the following components:

In the first step, command line tools are wrapped as web services with the help of the toolwrapper and according tool specifications. The derived web service can in turn be wrapped again in a workfow module for the Taverna workflow system, a so called composite workflow.

Using the workflow modules it is then possible via drag-and-drop operation in the user interface of the workflow system to form a pipeline of the tools where the output of one tool is used as input for the next tool.

An important incentive for creating such a framework is that the historical material that libraries, archives and other content holders are digitising in large quantities is very diverse in nature. Because there is no optimal combination of tools for every source material and purpose, users have to be enabled to try and evaluate various combinations to find their optimal workflow.

The following related articles might also be of interest:

Clone this wiki locally