span-tag

command
v0.1.290 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 9, 2019 License: GPL-3.0 Imports: 18 Imported by: 0

Documentation

Overview

span-tag takes an intermediate schema file and a configuration forest of filters for various tags and runs all filters on every record of the input to produce a stream of tagged records.

TODO(miku): Allow to skip label attachment by inspecting a SOLR index on the fly. Calculate label attachments for record, query index for doi or similar id, if the preferred source is already in the index, drop the label. If the unpreferred source is indexed, we cannot currently update the index, so just emit a warning and do not change anything.

$ span-tag -c '{"DE-15": {"any": {}}}' < input.ldj > output.ldj

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL