Documentation
¶
Overview ¶
package pipeline implements a system for running data pipelines on top of the filesystem
Index ¶
- Variables
- func RunPipelines(pipelineDir, inRepo, outRepo, commit, branch, shard string) error
- func WaitPipeline(pipelineDir, pipeline, commit string) error
- type Pipeline
- func (p *Pipeline) Cancel() error
- func (p *Pipeline) Fail() error
- func (p *Pipeline) Finish() error
- func (p *Pipeline) Image(image string) error
- func (p *Pipeline) Input(name string) error
- func (p *Pipeline) Run(cmd []string) error
- func (p *Pipeline) RunPachFile(r io.Reader) error
- func (p *Pipeline) Shuffle(dir string) error
- func (p *Pipeline) Start() error
- type Runner
Constants ¶
This section is empty.
Variables ¶
Functions ¶
func RunPipelines ¶
RunPipelines lets you easily run the Pipelines in one line if you don't care about cancelling them.
func WaitPipeline ¶ added in v0.8.3
WaitPipeline waits for a pipeline to complete. If the pipeline fails ErrFailed is returned.
Types ¶
type Pipeline ¶
type Pipeline struct {
// contains filtered or unexported fields
}
func NewPipeline ¶
func (*Pipeline) Run ¶
Run runs a command in the container, it assumes that `branch` has already been created. Notice that any failure in this function leads to the branch having uncommitted dirty changes. This state needs to be cleaned up before the pipeline is rerun. The reason we don't do it here is that even if we try our best the process crashing at the wrong time could still leave it in an inconsistent state.