Documentation ¶
Overview ¶
package pipeline implements a system for running data pipelines on top of the filesystem
Index ¶
- Variables
- func RunPipelines(pipelineDir, inRepo, outRepo, commit, branch, shard string) error
- type Pipeline
- func (p *Pipeline) Cancel() error
- func (p *Pipeline) Finish() error
- func (p *Pipeline) Image(image string) error
- func (p *Pipeline) Input(name string) error
- func (p *Pipeline) Run(cmd []string) error
- func (p *Pipeline) RunPachFile(r io.Reader) error
- func (p *Pipeline) Shuffle(dir string) error
- func (p *Pipeline) Start() error
- type Runner
Constants ¶
This section is empty.
Variables ¶
View Source
var ArgCount = errors.New("Illegal argument count.")
View Source
var Cancelled = errors.New("cancelled")
Functions ¶
func RunPipelines ¶
RunPipelines lets you easily run the Pipelines in one line if you don't care about cancelling them.
Types ¶
type Pipeline ¶
type Pipeline struct {
// contains filtered or unexported fields
}
func NewPipeline ¶
func (*Pipeline) Run ¶
Run runs a command in the container, it assumes that `branch` has already been created. Notice that any failure in this function leads to the branch having uncommitted dirty changes. This state needs to be cleaned up before the pipeline is rerun. The reason we don't do it here is that even if we try our best the process crashing at the wrong time could still leave it in an inconsistent state.
Click to show internal directories.
Click to hide internal directories.