A gentle command line tool for harvesting OAI-PMH XML data provided by ConedaKOR (Frankfurt)
Building the jar file (ffm.jar) with Maven:
$ cd /ffm-import/
$ mvn clean install
The generated jar file is located under ./target/ffm-import-jar-with-dependencies.jar
Copy to parent folder and rename the file from ffm-import-jar-with-dependencies.jar
to ffm.jar
.
Running the programm:
$ java -Xms1g -Xmx2g -jar ffm.jar -c ./conf -d ./data
Options:
-c, --config The configuration directory. ./data by default. -d, --data" The data directory contains temporary and output files. ./conf by default. -ts, --timestamp The timestamp for the import. The current timestamp in format yyyy-MM-dd-HH-mm-ss by default.
Example:
$ java -Xms1g -Xmx2g -jar ffm.jar -c ./conf -d ./data -ts $(date +"%Y-%m-%d-%H-%M-%S")
To generate separate harvesting and transforming applications change mainClass element value of plugin element in pom.xml from de.prometheus.bildarchiv.Application to de.prometheus.bildarchiv.Harvester or de.prometheus.bildarchiv.Transformer respectively
<plugin>
<artifactId>maven-assembly-plugin</artifactId>
<configuration>
<finalName>ffm-import</finalName>
<archive>
<manifest>
<mainClass>de.prometheus.bildarchiv.Application</mainClass>
</manifest>
</archive>
<descriptorRefs>
<descriptorRef>jar-with-dependencies</descriptorRef>
</descriptorRefs>
</configuration>
<executions>
<execution>
<id>make-assembly</id> <!-- this is used for inheritance merges -->
<phase>package</phase> <!-- bind to the packaging phase -->
<goals>
<goal>single</goal>
</goals>
</execution>
</executions>
</plugin>
Rebuild and run with above commands. Make sure data directory specified is the same for Harvester and Transformer.