Documentation ¶
Overview ¶
gomrjob - a Go library for hadoop map reduce jobs
It provides a lightweight framework for writing map and reduce steps as well as a Runner that will submit jobs and put the steps together.
Index ¶
Constants ¶
View Source
const VERSION = "1.1.2"
Variables ¶
This section is empty.
Functions ¶
func LoadAndValidateFlags ¶
func LoadAndValidateFlags()
LoadAndValidateFlags loads flags from env and checks for missing arguments
Types ¶
type Runner ¶
type Runner struct { Name string Steps []Step // Inputfiles can be of the format `/pattern/to/files*.gz` or `hdfs:///pattern/to/files*.gz` or `s3://bucket/pattern` InputFiles []string Output string // fully qualified ReducerTasks int PassThroughOptions []string // CLI arguments to $exe when run as map / reduce tasks CompressOutput bool CacheFiles []string // -files Files []string // -file Properties map[string]string // -D key=value argumets to mapreduce-streaming.jar JobType JobType // contains filtered or unexported fields }
type StepReducerTasksCount ¶
type StepReducerTasksCount interface {
NumberReducerTasks() int
}
Source Files ¶
Click to show internal directories.
Click to hide internal directories.