Directories ¶
Path | Synopsis |
---|---|
span-check runs quality checks on input data
|
span-check runs quality checks on input data |
Given as single file with crossref works API message, create a potentially smaller file, which contains only the most recent version of each document.
|
Given as single file with crossref works API message, create a potentially smaller file, which contains only the most recent version of each document. |
span-export creates various destination formats, mostly for SOLR.
|
span-export creates various destination formats, mostly for SOLR. |
Freeze file containing urls along with the content of all urls.
|
Freeze file containing urls along with the content of all urls. |
span-reshape is a dumbed down span-import.
|
span-reshape is a dumbed down span-import. |
span-join-assets combines a directory of json or single column TSV configurations into a single file.
|
span-join-assets combines a directory of json or single column TSV configurations into a single file. |
span-oa-filter will set x.oa to true, if the given KBART file validates a record.
|
span-oa-filter will set x.oa to true, if the given KBART file validates a record. |
redact intermediate schema
|
redact intermediate schema |
span-tag takes an intermediate schema file and a configuration forest of filters for various tags and runs all filters on every record of the input to produce a stream of tagged records.
|
span-tag takes an intermediate schema file and a configuration forest of filters for various tags and runs all filters on every record of the input to produce a stream of tagged records. |
span-update-labels takes a TSV of an IDs and ISILs and updates an intermediate schema record x.labels field accordingly.
|
span-update-labels takes a TSV of an IDs and ISILs and updates an intermediate schema record x.labels field accordingly. |
Click to show internal directories.
Click to hide internal directories.