Documentation ¶
Overview ¶
Package traverse provides primitives for concurrent and parallel traversal of slices or user-defined collections.
Example ¶
package main import ( "math/rand" "github.com/grailbio/base/traverse" ) func main() { // Compute N random numbers in parallel. const N = 1e5 out := make([]float64, N) traverse.Parallel.Range(len(out), func(start, end int) error { r := rand.New(rand.NewSource(rand.Int63())) for i := start; i < end; i++ { out[i] = r.Float64() } return nil }) }
Output:
Index ¶
Examples ¶
Constants ¶
This section is empty.
Variables ¶
var Parallel = T{Limit: 2 * runtime.GOMAXPROCS(0)}
Parallel is the default traverser for parallel traversal, intended CPU-intensive parallel computing. Parallel limits the number of concurrent invocations to a small multiple of the runtime's available processors.
Functions ¶
Types ¶
type Reporter ¶
type Reporter interface { // Init is called when processing is about to begin. Parameter // n indicates the number of tasks to be executed by the traversal. Init(n int) // Complete is called after the traversal has completed. Complete() // Begin is called when task i is begun. Begin(i int) // End is called when task i has completed. End(i int) }
A Reporter receives events from an ongoing traversal. Reporters can be passed as options into Traverse, and are used to monitor progress of long-running traversals.
func NewSimpleReporter ¶
NewSimpleReporter returns a new reporter that prints the number of queued, running, and completed tasks to stderr.
func NewTimeEstimateReporter ¶
NewTimeEstimateReporter returns a reporter that reports the number of jobs queued, running, and done, as well as the running time of the Traverse and an estimate for the amount of time remaining. Note: for estimation, it assumes jobs have roughly equal running time and are FIFO-ish (that is, it does not try to account for the bias of shorter jobs finishing first and therefore skewing the average estimated job run time).
type T ¶
type T struct { // Limit is the traverser's concurrency limit: there will be no more // than Limit concurrent invocations per traversal. A limit value of // zero (the default value) denotes no limit. Limit int // Reporter receives status reports for each traversal. It is // intended for users who wish to monitor the progress of large // traversal jobs. Reporter Reporter }
A T is a traverser: it provides facilities for concurrently invoking functions that traverse collections of data.
func (T) Each ¶
Each performs a traversal on fn. Specifically, Each invokes fn(i) for 0 <= i < n, managing concurrency and error propagation. Each returns when the all invocations have completed, or after the first invocation fails, in which case the first invocation error is returned. Each also propagates panics from underlying invocations to the caller.
func (T) Range ¶
Range performs ranged traversal on fn: n is split is split into contiguous ranges, and fn is invoked for each range. The range sizes are determined by the traverser's concurrency limits. Range allows the caller to amortize function call costs, and is typically used when limit is small and n is large, for example on parallel traversal over large collections, where each item's processing time is comparatively small.