vision

package

v0.0.9 Latest Latest Go to latest Published: Sep 9, 2022 License: AGPL-3.0 Imports: 36 Imported by: 15

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/viamrobotics/rdk

Links

Open Source Insights

Documentation ¶

Overview ¶

Package vision is the service that allows you to access various computer vision algorithms (like detection, segmentation, tracking, etc) that usually only require a camera or image input.

Index ¶

Constants
Variables
func FindFirstName(r robot.Robot) string
func ImageToFloatBuffer(img image.Image) []float32
func ImageToUInt8Buffer(img image.Image) []byte
func Named(name string) resource.Name
func NewServer(s subtype.Service) pb.VisionServiceServer
func NewTFLiteClassifier(ctx context.Context, conf *VisModelConfig, logger golog.Logger) (classification.Classifier, *inf.TFLiteStruct, error)
func NewTFLiteDetector(ctx context.Context, cfg *VisModelConfig, logger golog.Logger) (objectdetection.Detector, *inf.TFLiteStruct, error)
func WrapWithReconfigurable(s interface{}) (resource.Reconfigurable, error)
type Attributes
type Service
type TFLiteClassifierConfig
type TFLiteDetectorConfig
type VisModelConfig
type VisModelType
type VisOperation

Constants ¶

View Source

const (
	TFLiteDetector   = VisModelType("tflite_detector")
	TFDetector       = VisModelType("tf_detector")
	ColorDetector    = VisModelType("color_detector")
	TFLiteClassifier = VisModelType("tflite_classifier")
	TFClassifier     = VisModelType("tf_classifier")
	RCSegmenter      = VisModelType("radius_clustering_segmenter")
	ObjectSegmenter  = VisModelType("object_segmenter")
)

The set of allowed vision model types.

View Source

const (
	VisDetection      = VisOperation("detection")
	VisClassification = VisOperation("classification")
	VisSegmentation   = VisOperation("segmentation")
)

The set of operations supported by the vision model types.

View Source

const RadiusClusteringSegmenter = "radius_clustering"

RadiusClusteringSegmenter is the name of a segmenter that finds well separated objects on a flat plane.

View Source

const SubtypeName = resource.SubtypeName("vision")

SubtypeName is the name of the type of service.

Variables ¶

View Source

var Subtype = resource.NewSubtype(
	resource.ResourceNamespaceRDK,
	resource.ResourceTypeService,
	SubtypeName,
)

Subtype is a constant that identifies the vision service resource subtype.

Functions ¶

func FindFirstName ¶ added in v0.0.8

func FindFirstName(r robot.Robot) string

FindFirstName returns name of first vision service found.

func ImageToFloatBuffer ¶ added in v0.0.9

func ImageToFloatBuffer(img image.Image) []float32

ImageToFloatBuffer reads an image into a byte slice (buffer) the most common sense way. Left to right like a book; R, then G, then B. No funny stuff. Assumes values between -1 and 1.

func ImageToUInt8Buffer ¶ added in v0.0.9

func ImageToUInt8Buffer(img image.Image) []byte

ImageToUInt8Buffer reads an image into a byte slice in the most common sense way. Left to right like a book; R, then G, then B. No funny stuff. Assumes values should be between 0-255.

func Named ¶

func Named(name string) resource.Name

Named is a helper for getting the named vision's typed resource name. RSDK-347 Implements vision's Named.

func NewServer ¶

func NewServer(s subtype.Service) pb.VisionServiceServer

NewServer constructs a vision gRPC service server.

func NewTFLiteClassifier ¶ added in v0.0.9

func NewTFLiteClassifier(ctx context.Context, conf *VisModelConfig,
	logger golog.Logger,
) (classification.Classifier, *inf.TFLiteStruct, error)

NewTFLiteClassifier creates an RDK classifier given a VisModelConfig. In other words, this function returns a function from image-->[]classifier.Classifications. It does this by making calls to an inference package and wrapping the result.

func NewTFLiteDetector ¶

func NewTFLiteDetector(ctx context.Context, cfg *VisModelConfig, logger golog.Logger) (objectdetection.Detector, *inf.TFLiteStruct, error)

NewTFLiteDetector creates an RDK detector given a DetectorConfig. In other words, this function returns a function from image-->[]objectdetection.Detection. It does this by making calls to an inference package and wrapping the result.

func WrapWithReconfigurable ¶

func WrapWithReconfigurable(s interface{}) (resource.Reconfigurable, error)

WrapWithReconfigurable wraps a vision service as a Reconfigurable.

Types ¶

type Attributes ¶

type Attributes struct {
	ModelRegistry []VisModelConfig `json:"register_models"`
}

Attributes contains a list of the user-provided details necessary to register a new vision service.

type Service ¶

type Service interface {
	// detector methods
	GetDetectorNames(ctx context.Context) ([]string, error)
	AddDetector(ctx context.Context, cfg VisModelConfig) error
	RemoveDetector(ctx context.Context, detectorName string) error
	GetDetectionsFromCamera(ctx context.Context, cameraName, detectorName string) ([]objdet.Detection, error)
	GetDetections(ctx context.Context, img image.Image, detectorName string) ([]objdet.Detection, error)
	// classifier methods
	GetClassifierNames(ctx context.Context) ([]string, error)
	AddClassifier(ctx context.Context, cfg VisModelConfig) error
	RemoveClassifier(ctx context.Context, classifierName string) error
	GetClassificationsFromCamera(ctx context.Context, cameraName, classifierName string, n int) (classification.Classifications, error)
	GetClassifications(ctx context.Context, img image.Image, classifierName string, n int) (classification.Classifications, error)
	// segmenter methods
	GetSegmenterNames(ctx context.Context) ([]string, error)
	GetSegmenterParameters(ctx context.Context, segmenterName string) ([]utils.TypedName, error)
	GetObjectPointClouds(ctx context.Context, cameraName, segmenterName string, params config.AttributeMap) ([]*viz.Object, error)
}

A Service that implements various computer vision algorithms like detection and segmentation.

func FirstFromRobot ¶ added in v0.0.8

func FirstFromRobot(r robot.Robot) (Service, error)

FirstFromRobot returns the first vision service in this robot.

func FromRobot ¶

func FromRobot(r robot.Robot, name string) (Service, error)

FromRobot is a helper for getting the named vision service from the given Robot.

func New ¶

func New(ctx context.Context, r robot.Robot, config config.Service, logger golog.Logger) (Service, error)

New registers new detectors from the config and returns a new object detection service for the given robot.

func NewClientFromConn ¶

func NewClientFromConn(ctx context.Context, conn rpc.ClientConn, name string, logger golog.Logger) Service

NewClientFromConn constructs a new Client from connection passed in.

type TFLiteClassifierConfig ¶ added in v0.0.9

type TFLiteClassifierConfig struct {
	// this should come from the attributes part of the detector config
	ModelPath  string  `json:"model_path"`
	NumThreads int     `json:"num_threads"`
	LabelPath  *string `json:"label_path"`
}

TFLiteClassifierConfig specifies the fields necessary for creating a TFLite classifier.

type TFLiteDetectorConfig ¶

type TFLiteDetectorConfig struct {
	// this should come from the attributes part of the detector config
	ModelPath  string  `json:"model_path"`
	NumThreads int     `json:"num_threads"`
	LabelPath  *string `json:"label_path"`
	ServiceURL *string `json:"service_url"`
}

TFLiteDetectorConfig specifies the fields necessary for creating a TFLite detector.

type VisModelConfig ¶ added in v0.0.9

type VisModelConfig struct {
	Name       string              `json:"name"`
	Type       string              `json:"type"`
	Parameters config.AttributeMap `json:"parameters"`
}

VisModelConfig specifies the name of the detector, the type of detector, and the necessary parameters needed to build the detector.

type VisModelType ¶ added in v0.0.9

type VisModelType string

VisModelType defines what vision models are known by the vision service.

type VisOperation ¶ added in v0.0.9

type VisOperation string

VisOperation defines what types of operations are allowed by the vision service.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL