coroutine

package module
v0.9.3 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jun 28, 2024 License: Apache-2.0 Imports: 4 Imported by: 2

README

build Go Reference Apache 2 License Discord

coroutine

This project contains a durable coroutine compiler and runtime library for Go.

Note Read our announcement blog post: Fairy tales of workflow orchestration.

Usage

The coroutine package can be used as a simple library to create coroutines in a Go program, allowing the function passed as entry point to the coroutine to be paused at yield points and later resumed by the caller.

When pausing, the coroutine yields a value that is received by the caller, and on resumption the caller can send back a value that the coroutine obtains as result.

Creating Coroutines

The following code example shows how to create a coroutine and yield values to the caller:

// main.go
package main

import "github.com/dispatchrun/coroutine"

func main() {
    coro := coroutine.New[int, any](func() {
        for i := 0; i < 3; i++ {
            coroutine.Yield[int, any](i)
        }
    })

    for coro.Next() {
        println(coro.Recv())
    }
}

Executing the program produces the following output:

$ go run main.go
0
1
2

coroutine.New and coroutine.Yield are the main functions that applications use to create coroutines and declare yield points. An important observation to make here is the fact that the functions have two generic type parameters (we name then R and S) to declare the type of values that the program can receive from and send to the coroutine. The types passed to Yield must correspond to the types used when creating the coroutine, if the types mismatch, the coroutine panics at the yield point.

Terminating Coroutines

Coroutines hold state in order to resume from the yield point and ensure that operations such as deferred function calls will be executed, therefore they need to be driven to completion by calling Next until it returns false, which indicates that the entry point of the coroutine as exited.

Sometimes, a program may need to prematurely interrupt a coroutine before it reached completion; this can be done by calling Stop, which prevents returning from the current yield point. Stop only marks the coroutine as interrupted, it is still necessary for the program to call Next in order to drive the code to completion, running deferred function calls, and returning from the entry point.

Note yielding from a coroutine after it was stopped is a fatal error; therefore, it is not advised to yield from defers as it would result in a crash if the defer was executed after stopping the coroutine.

Often times, the simplest construct to drive coroutine executions is to use the Run function:

package main

import "github.com/dispatchrun/coroutine"

func main() {
    coro := coroutine.New[int, any](func() {
        for i := 0; i < 3; i++ {
            coroutine.Yield[int, any](i)
        }
    })

    // The callback function is invoked for each value received from the
    // coroutine, and returns the value that will be sent back.
    coroutine.Run(coro, func(v int) any {
        println(v)
        return nil
    })
}
Using Coroutines

Coroutines can be a powerful building block to represent cooperative scheduling constructs, the program has complete autonomy when it comes to deciding when a coroutine is resumed and gets to carry on to its next operation.

The part of the program driving the execution of coroutines can be seen as a local scheduler for a sub-part of the code.

Another useful property of coroutines is that, just like functions, they can be composed. A parent coroutine can create more coroutines for which it can drive execution in a subcontext of the program, and yield values to its caller that applied computations from the values received from the sub-routines.

Design Decisions

There are other coroutine implementations in Go that have taken slightly different approaches to express how to yield to the caller; for example, the research on the subject that was laid out by Russ Cox in Coroutines for Go demonstrates how the API could be based on passing a yield function to the coroutine entrypoint.

A quality of this model is it maximizes type safety since the code will not compile if the yield function is misused. It also makes it somewhat explicit which part of the code is a coroutine, since the yield function must be passed through the code up to the location where yield points are reached.

As everything is always a trade off in software engineering, there are also problems that can arise with this model; for example, having to explicitly accept and pass the yield function through the coroutine code can introduce function coloring problems in Go programs. Incrementally introducing coroutines in existing code becomes challenging as all functions on the call stack up to the yield point must be coroutine-aware, and participate in passing the yield point through the call stack. Sometimes, those changes are made even more difficult by the control flow traversing code paths that the programmer does not have control over (e.g., the Go standard library or other dependencies). Such changes can leak deep into the code base, as functions or type signatures may have to be changed, breaking interface implementations that can sometimes only be detected at runtime.

In this coroutine package, we took a different approach and leveraged goroutine local storage (GLS) to transparently pass the coroutine context through the call stack. This model offers more flexibility to incrementally evolve existing code into using coroutines, it also makes the declaration of yield points are a local decision of the function that yields, leaving the rest of the code clear of the responsibility of participating in the coroutine control flow.

A limitation of this model is that it creates implicit coupling between the place where the coroutine in created, and the places where the yield points are declared; the coroutine type must match the type of the yield point, and while in most case static analysis can validate correctness, there are cases where validation may only happen at runtime and requires the code paths to be tested to ensure that no type mismatches would occur on the coroutine code paths.

Durable Coroutines

Coroutines are functions that can be suspended and resumed. Durable coroutines are functions that can be suspended, serialized and resumed in another process.

Warning This section documents highly experimental capabilities of the coroutine package, changes will likely be made, use at your own risks!

The project contains a source-to-source Go compiler named coroc which compiles volatile coroutines into durable versions where the state captured by a coroutine can be encoded into a binary representation that can be saved to a storage medium and later reloaded to restart computation where it was left off.

Generate durable programs

To install the coroc compiler:

go install github.com/dispatchrun/coroutine/compiler/cmd/coroc@latest

Then to compile a package:

coroc ./path/to/package

This will generate files named *_durable.go and set build tags on source files that need to be excluded when building in durable mode. Because the compiler may need to generate coroutines in code paths of the standard Go library, it creates a copy under ./goroot in the module directory.

The standard Go toolchain can then be used to compile the application in durable mode:

GOROOT=$PWD/goroot go build -tags durable .

Pro tip A common pattern is to use a go:generate directive in the main application package to trigger the compilation of the durable files:

//go:generate coroc

package main

func main() {
    ...
}

This creates a tighter integration with the Go toolchain, as the compilation of durable files can be executed with:

go generate
Saving and Restoring

In durable mode, the state of coroutine.Coroutine values can be marshaled into a byte slice for storage beyond the lifetime of the application, and resumed on restart.

The following program is adapted from the previous example to save and restore the coroutine state from a file. In volatile mode, it only prints the first yielded value; but in durable mode, progress made by the coroutine is captured and restored over multiple executions.

//go:generate coroc

package main

import (
    "errors"
    "flag"
    "io/fs"
    "log"
    "os"

    "github.com/dispatchrun/coroutine"
)

func work() {
    for i := 0; i < 3; i++ {
        coroutine.Yield[int, any](i)
    }
}

func main() {
    var state string
    flag.StringVar(&state, "state", "coroutine.state", "Location of the coroutine state file")
    flag.Parse()

    coro := coroutine.New[int, any](work)

    if coroutine.Durable {
        b, err := os.ReadFile(state)
        if err != nil {
            if !errors.Is(err, fs.ErrNotExist) {
                log.Fatal(err)
            }
        } else if err := coro.Context().Unmarshal(b); err != nil {
            log.Fatal(err)
        }

        defer func() {
            if b, err := coro.Context().Marshal(); err != nil {
                log.Fatal(err)
            } else if err := os.WriteFile(state, b, 0666); err != nil {
                log.Fatal(err)
            }
        }()
    }

    if coro.Next() {
        println("yield:", coro.Recv())
    }
}

When building in volatile mode (the default), the program runs a single step of the coroutine and loses its state, each run of the application starts back at the beginning:

$ go build
$ ./main
yield: 0
$ ./main
yield: 0
$ ./main
yield: 0

However, when building in durable mode, the program saves the coroutine state and restores it for each run, it keeps making progress across executions:

$ go generate && GOROOT=$PWD/vendor/goroot go build -tags durable
$ ./main
yield: 0
$ ./main
yield: 1
$ ./main
yield: 2

Warning At this time, the state of a coroutine is bound to a specific version of the program, attempting to resume a state on a different version is not supported.

More examples of how to use durable coroutines can be found in examples.

Extend serialization

coroutine is able to seamlessly serialize and deserialize most types by default. However there are times when you may want to control the serialization of specific types. For example, chan values are not supported, or you may decide that some values need specific logic to be functional upon deserialization. See the coroutine/types package for the tools to take control of serialization of the coroutine state.

Scheduling

Pausing, marshaling, unmarshalling, and resuming durable coroutines is work for a scheduler which is not included in this package. The coroutine project only provides the building blocks needed to create those types of systems.

Note This is an area of development that we are excited about, feel free to reach out if you would like to learn or discuss more in details about it!

Language Support

The coroc compiler currently supports a subset of Go when compiling coroutines to durable mode.

The compiler currently does not support compiling coroutines that contain the go keyword, control structures with goto or fallthrough statements, or for loop post statements with function calls. Those limitations will be lifted in the future but as of now have not proven necessary to support compiling durable coroutines in common Go programs.

Note that none of those restrictions apply to code that is not on the call path of coroutines.

Performance

The code generated by coroc has been tested for correctness but has not been extensively benchmarked, it is expected that the durable form of coroutines will have compute and memory footprint than when running in volatile mode.

Contributing

Pull requests are welcome! Anything that is not a simple fix would probably benefit from being discussed in an issue first.

Remember to be respectful and open minded!

Community

Documentation

Index

Constants

View Source
const Durable = false

Durable is a constant which takes the values true or false depending on whether the program is built with the "durable" tag.

Variables

View Source
var (
	// ErrNotDurable is an error that occurs when attempting to
	// serialize a coroutine that is not durable.
	ErrNotDurable = errors.New("only durable coroutines can be serialized")

	// ErrInvalidState is an error that occurs when attempting to
	// deserialize a coroutine that was serialized in another build.
	ErrInvalidState = errors.New("durable coroutine was serialized in another build")
)

Functions

func Run

func Run[R, S any](c Coroutine[R, S], f func(R) S) R

Run executes a coroutine to completion, calling f for each value that the coroutine yields, and sending back each value that f returns.

If c was constructed with NewWithReturn, the return value of the coroutine is returned by Run. Otherwise, the zero-value is returned and can be ignored by the caller.

func Yield

func Yield[R, S any](v R) S

Yield sends v to the generator and pauses the execution of the coroutine until the Next method is called on the associated generator.

The function panics when called on a stack where no active coroutine exists, or if the type parameters do not match those of the coroutine.

Types

type Context

type Context[R, S any] struct {
	// contains filtered or unexported fields
}

Context is passed to a coroutine and flows through all functions that Yield (or could yield).

func LoadContext

func LoadContext[R, S any]() *Context[R, S]

LoadContext returns the context for the current coroutine.

The function panics when called on a stack where no active coroutine exists, or if the type parameters do not match those of the coroutine.

func (*Context[R, S]) Marshal

func (c *Context[R, S]) Marshal() ([]byte, error)

func (*Context[R, S]) Unmarshal

func (c *Context[R, S]) Unmarshal(b []byte) error

func (*Context[R, S]) Yield

func (c *Context[R, S]) Yield(v R) S

type Coroutine

type Coroutine[R, S any] struct {
	// contains filtered or unexported fields
}

Coroutine instances expose APIs allowing the program to drive the execution of coroutines.

The type parameter R represents the type of values that the program can receive from the coroutine (what it yields), and the type parameter S is what the program can send back to a coroutine yield point.

func New

func New[R, S any](f func()) Coroutine[R, S]

New creates a new coroutine which executes f as entry point.

func NewWithReturn

func NewWithReturn[R, S any](f func() R) Coroutine[R, S]

New creates a new coroutine which executes f as entry point.

func (Coroutine[R, S]) Context

func (c Coroutine[R, S]) Context() *Context[R, S]

Context returns the coroutine's associated Context.

func (Coroutine[R, S]) Done

func (c Coroutine[R, S]) Done() bool

Done returns true if the coroutine completed, either because it was stopped or because its function returned.

func (Coroutine[R, S]) Next

func (c Coroutine[R, S]) Next() bool

Next executes the coroutine until its next yield point, or until completion. The method returns true if the coroutine entered a yield point, after which the program should call Recv to obtain the value that the coroutine yielded, and Send to set the value that will be returned from the yield point.

func (Coroutine[R, S]) Recv

func (c Coroutine[R, S]) Recv() R

Recv returns the last value that the coroutine has yielded. The method must be called only after a call to Next has returned true, or the return value is undefined. Calling the method multiple times after a call to Next returns the same value each time.

func (Coroutine[R, S]) Result

func (c Coroutine[R, S]) Result() R

Result is the return value of the coroutine, if it was constructed with NewWithReturn. Result should only be called once Next returns false, indicating that the coroutine finished executing.

func (Coroutine[R, S]) Send

func (c Coroutine[R, S]) Send(v S)

Send sets the value that will be seen by the coroutine after it resumes from a yield point. Calling the method multiple times before a call to Next does not result in sending multiple values, only the last value sent will be seen by the coroutine.

func (Coroutine[R, S]) Stop

func (c Coroutine[R, S]) Stop()

Stop interrupts the coroutine. On the next call to Next, the coroutine will not return from its yield point; instead, it unwinds its call stack, calling each defer statement in the inverse order that they were declared.

Stop is idempotent, calling it multiple times or after completion of the coroutine has no effect.

This method is just an interrupt mechanism, the program does not have to call it to release the coroutine resources after completion.

Directories

Path Synopsis
gen
Package types contains the infrastructure needed to support serialization of Go types.
Package types contains the infrastructure needed to support serialization of Go types.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL