lookup

command

v1.0.1 Latest Latest Go to latest Published: Nov 3, 2022 License: MIT Imports: 10 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/capillariesio/capillaries

Links

Open Source Insights

README ¶

Lookup integration test

Created using Ubuntu WSL. Other Linux flavors and MacOS may require edits.

Workflow

The DOT diagram generated with

go run toolbelt.go validate_script -script_file=../../../test/data/cfg/lookup/script.json -params_file=../../../test/data/cfg/lookup/script_params_two_runs.json -idx_dag=true

and rendered in https://dreampuf.github.io/GraphvizOnline :

drawing

What's tested:

table_lookup_table with parallelism (10 batches), all suported types of joins (inner and left outer, grouped and not)
file_table read from single file
table_file with top/limit/order
single-run (test_one_run.sh) and multi-run (test_two_runs.sh) script execution

Multi-run test simulates the scenario when an operator validates loaded order and order item data before proceeding with joining orders with order items.

How to test

Direct node execution

Run test_exec_nodes.sh - the Toolbelt executes script nodes one by one, without invoking RabbitMQ workflow.

Using RabbitMQ workflow (single run)

Make sure the Daemon is running (run go run daemon.go to start it in pkg/exe/daemon).

Run test_one_run.sh - the Toolbelt publishes batch messages to RabbitMQ and the Daemon consumes them and executes all script nodes in parallel as part of a single run.

Using RabbitMQ workflow (two runs)

Make sure the Daemon is running (run go run daemon.go to start it in pkg/exe/daemon).

Run test_two_runs.sh - the Toolbelt publishes batch messages to RabbitMQ and the Daemon consumes them and executes script nodes that load data from files as part of the first run.

After the first run is complete, the Toolbelt publishes batch messages to RabbitMQ and the Daemon consumes them and executes script nodes that process the data as part of the second run.

This test mimics the "operator validation" scenario.

Possible edits

Play with number of total line items (see "-items=..." in 1_create_test_data.sh).

References:

Data model design: Brazilian E-Commerce public dataset (https://www.kaggle.com/datasets/olistbr/brazilian-ecommerce)

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

generate_data.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL