msg

package

v0.2.1 Latest Latest Go to latest Published: Sep 1, 2023 License: MIT Imports: 1 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/swdunlop/llm-go

Links

Open Source Insights

Documentation ¶

Overview ¶

Package msg describes the protocol used between the NATS client and worker.

Index ¶

Constants
type Error
- func (e Error) Error() string
type InterruptRequest
type InterruptResponse
type ListRequest
type ListResponse
type ModelInfo
type OptionInfo
type PredictRequest
type PredictResponse
- func (msg PredictResponse) String() string
type StreamResponse
- func (msg StreamResponse) String() string
type WorkerRequest
type WorkerResponse

Constants ¶

View Source

const (
	ErrUnknown            = iota // omitted error code, indicates an unknown error
	ErrIllegibleRequest          // request was not a valid JSON object
	ErrInvalidRequest            // request is missing required fields or has invalid values
	ErrUnsupportedCommand        // command was not found
	ErrJobNotFound               // job was not found, usually due to interrupt
	ErrShuttingDown              // server is shutting down and will not accept new jobs
	ErrBusy                      // server is busy and cannot accept new jobs at this time
	ErrPredictionFailed          // prediction failed
	ErrInterrupted               // request was interrupted
	ErrHookFailed                // a request hook failed
	ErrInvalidModel              // the model could not be loaded
)

Error codes.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Error ¶

type Error struct {
	Code int    `json:"code,omitempty"`
	Err  string `json:"error"`
}

Error is used to indicate that a request failed.

func (Error) Error ¶

func (e Error) Error() string

Error implements the error interface by returning the Err field, ignoring the Code field.

type InterruptRequest ¶

type InterruptRequest struct {
}

InterruptRequest is sent to the worker on the worker subject to request an interruption.

type InterruptResponse ¶

type InterruptResponse struct {
}

InterruptResponse is sent as a reply to InterruptRequest once the interruption is complete.

type ListRequest ¶ added in v0.2.1

type ListRequest struct {
}

ListRequest is sent to the worker on the worker subject to request a list of available models.

type ListResponse ¶ added in v0.2.1

type ListResponse struct {
	// Models contains the list of available models.
	Models []ModelInfo `json:"models,omitempty"`
}

ListResponse is sent as a reply to ListRequest to show information about the worker and its models.

type ModelInfo ¶ added in v0.2.1

type ModelInfo struct {
	// Model is the name of the model, as used by PredictRequest.
	Model string `json:"model,omitempty"`

	// Options contains the list of settings that can be used to configure the model and their defaults.
	Options []llm.Option `json:"options,omitempty"`
}

ModelInfo contains information about a model.

type OptionInfo ¶ added in v0.2.1

type OptionInfo struct {
	// Name is the name of the option.
	Name string `json:"name,omitempty"`

	// Default is the default value for the option.
	Default string `json:"default,omitempty"`

	// Description is a human readable description of the option.
	Description string `json:"description,omitempty"`
}

OptionInfo contains information about a model option.

type PredictRequest ¶

type PredictRequest struct {
	// System contains the portion of the input that should be kept regardless of token context limits.  Warning:
	// if the System text is larger than the token context limits, the model will return an error.
	System string `json:"system,omitempty"`

	// Input contains the portion of the input that should be used to predict tokens.  Input may be truncated, losing
	// lines of text from the beginning of the input to fit within the token context limits.
	Input []string `json:"input,omitempty"`

	// Stream identifies a NATS subject where incremental results should be sent as they are produced.  This subject will
	// receive a stream of PredictStream messages in JSON format.
	//
	// No more Stream messages will be sent after the PredictResponse is sent.
	Stream string `json:"stream,omitempty"`

	// Options provides additional options used by the worker to perform the prediction.  This may be omitted or nil.
	Options map[string]any `json:"options,omitempty"`
}

PredictRequest is sent to the worker on the worker subject to request a prediction.

type PredictResponse ¶

type PredictResponse struct {
	// Output contains the predicted text.
	Output string `json:"output,omitempty"`
}

PredictResponse is sent as a reply to PredictRequest once the prediction is complete.

func (PredictResponse) String ¶

func (msg PredictResponse) String() string

String implements the fmt.Stringer and llm.Predictor interface by returning the Output field.

type StreamResponse ¶

type StreamResponse struct {
	// Output contains the predicted text.
	Output string `json:"output,omitempty"`
}

StreamResponse is optionally sent if the Stream subject is provided in a PredictRequest.

func (StreamResponse) String ¶

func (msg StreamResponse) String() string

String implements the fmt.Stringer and llm.Predictor interface by returning the Output field.

type WorkerRequest ¶

type WorkerRequest struct {
	// Job identifies the request, which will be present in responses and can be used to cancel long running requests.
	Job string `json:"job,omitempty"`

	// List is a request to list the available models.
	List *ListRequest `json:"list,omitempty"`

	// Predict is a request to predict the next token.
	Predict *PredictRequest `json:"predict,omitempty"`

	// Interrupt is a request to interrupt a long running request.
	Interrupt *InterruptRequest `json:"interrupt,omitempty"`
}

WorkerRequest is sent to the worker on the worker subject ask it to do something on behalf of the client. Only one of the pointer fields should be non-nil.

type WorkerResponse ¶

type WorkerResponse struct {
	// Job matches the job id from the WorkerRequest.
	Job string `json:"job,omitempty"`

	// List is a response to a ListRequest.
	List *ListResponse `json:"list,omitempty"`

	// Predict is a response to a PredictRequest.
	Predict *PredictResponse `json:"predict,omitempty"`

	// Interrupt is a response to an InterruptRequest.
	Interrupt *InterruptResponse `json:"interrupt,omitempty"`

	// Stream is a response to a PredictRequest that contains incremental results.
	Stream *StreamResponse `json:"stream,omitempty"`

	// Error is a response to any request that failed.
	Error *Error `json:"error,omitempty"`
}

WorkerResponse is sent from the worker on the worker subject to reply to a WorkerRequest. Only one of the pointer fields should be non-nil. An empty WorkerResponse is sent if the response will be streamed.

Source Files ¶

View all Source files

protocol.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL