Documentation ¶
Overview ¶
Package cmd of the Dataflow kit contains the following CLI daemons:
- fetch.d service downloads html content from web pages to feed Dataflow kit scrapers.
- parse.d service parses html content from web pages following the rules described in configuration JSON file.
- fetch.cli CLI tool for fetch.d service.
Directories ¶
Path | Synopsis |
---|---|
Fetcher CLI of the Dataflow kit downloads html content from web pages via Fetcher service endpoint.
|
Fetcher CLI of the Dataflow kit downloads html content from web pages via Fetcher service endpoint. |
Fetcher service of the Dataflow kit downloads html content from web pages to feed Dataflow kit scrapers.
|
Fetcher service of the Dataflow kit downloads html content from web pages to feed Dataflow kit scrapers. |
Parse service of the Dataflow kit parses html content from web pages following the rules described in configuration JSON file.
|
Parse service of the Dataflow kit parses html content from web pages following the rules described in configuration JSON file. |
Click to show internal directories.
Click to hide internal directories.