See https://github.com/dainst/file-index.
- Java 8
usage: neofindertoes [options] FILE_OR_DIRECTORY1 [FILE_OR_DIRECTORY2
[FILE_OR_DIRECTORY3] ...]
Options:
-A,--autocorrect enables auto correction:
- try to fix lines with less
columns than the header
specifies
- if only one date column could
be parsed assign this value to
both date fields
(for catalog parsing only)
-a,--address <ADDRESS> the address of the
elasticsearch index
(omitting this the local
loopback address will be used)
-c,--catalog parse and import
cdfinder/neofinder catalog
files
-e,--esclustername <NAME> the name of the elasticsearch
cluster
(omitting this the default name
'elasticsearch' will be used)
-h,--help print this message
-i,--indexname <NAME> the name of the elasticsearch
index
(omitting this the name
'marbildertivoli' will be used)
-I,--ignore <FIELDLIST> the field or fields to ignore
potentially invalid data for
if multiple fields are
specified they must be comma
separated
(use with care as this may
create records that will miss
the specified fields)
(for catalog parsing only)
--iknowiamonlyimportingpathandvolume import only path and volume
(for catalog parsing only)
-m,--mimetype <STRATEGY> the mime type fetch strategy to
use:
0: no mime type information is
fetched (default)
1: mime type is 'guessed' based
on file extension
2: mime type is detected by
inspecting the file (most
accurate but slow)
(for file system scanning only)
-n,--newindex create a new elasticsearch
index
(if an old one with the same
name exists it will be deleted)
-r,--readerrors ignore read errors
(for file system scanning only)
-v,--verbose show JSON objects that are
added to the index
0 - success
1 - unrecognized command line option
2 - failed to parse command line
3 - failed to create elasticsearch index
4 - unknown field given for -I
5 - dublicate paths in catalog file
6 - elasticsearch host not found
7 - could not connect to elasticsearch cluster
Prerequisites:
- Maven
mvn clean package