cortex

module

v0.30.0 Latest Latest Go to latest Published: Mar 2, 2021 License: Apache-2.0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/cortexlabs/cortex

Links

Open Source Insights

README ¶

Website • Slack • Docs

Model serving at scale

Cortex is a platform for deploying, managing, and scaling machine learning in production.

Key features

Run realtime inference, batch inference, and training workloads.
Deploy TensorFlow, PyTorch, ONNX, and other models to production.
Scale to handle production workloads with server-side batching and request-based autoscaling.
Configure rolling updates and live model reloading to update APIs without downtime.
Serve models efficiently with multi-model caching and spot / preemptible instances.
Stream performance metrics and structured logs to any monitoring tool.
Perform A/B tests with configurable traffic splitting.

How it works

Implement a Predictor

# predictor.py

from transformers import pipeline

class PythonPredictor:
    def __init__(self, config):
        self.model = pipeline(task="text-generation")

    def predict(self, payload):
        return self.model(payload["text"])[0]

Configure a realtime API

# text_generator.yaml

- name: text-generator
  kind: RealtimeAPI
  predictor:
    type: python
    path: predictor.py
  compute:
    gpu: 1
    mem: 8Gi
  autoscaling:
    min_replicas: 1
    max_replicas: 10

Deploy

$ cortex deploy text_generator.yaml

# creating http://example.com/text-generator

Serve prediction requests

$ curl http://example.com/text-generator -X POST -H "Content-Type: application/json" -d '{"text": "hello world"}'

Directories ¶

Path	Synopsis
cli
cluster
cmd
lib/routines
types/cliconfig
types/flags
dev
pkg
consts
lib/archive
lib/aws
lib/cache
lib/cast
lib/configreader
lib/console
lib/cron
lib/debug
lib/docker
lib/errors
lib/exit
lib/files
lib/gcp
lib/hash
lib/json
lib/k8s
lib/maps
lib/math
lib/msgpack
lib/parallel
lib/pointer
lib/print
lib/prompt
lib/random
lib/regex
lib/sets/strset
lib/sets/strset/threadsafe
lib/slices
lib/strings
lib/table
lib/telemetry
lib/time
lib/urls
operator
operator/config
operator/endpoints
operator/lib/exit
operator/lib/logging
operator/lib/routines
operator/operator
operator/resources
operator/resources/job
operator/resources/job/batchapi
operator/resources/job/taskapi
operator/resources/realtimeapi
operator/resources/trafficsplitter
operator/schema
types
types/clusterconfig
types/clusterstate
types/metrics
types/spec
types/status
types/userconfig

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL