model-mesh-triton-adapter

command
v0.12.0-rc0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 22, 2024 License: Apache-2.0 Imports: 7 Imported by: 0

README

Model Mesh Triton Adapter

This is an adapter which implements the internal model-mesh model management API for Triton Inference Server.

How to

  1. Clone the repository

    $ git clone https://github.com/kserve/modelmesh-runtime-adapter.git
    $ cd modelmesh-runtime-adapter/model-mesh-triton-adapter
    
  2. Pull Triton Serving Docker Image

    $ docker pull nvcr.io/nvidia/tritonserver:20.09-py3
    
  3. Run Triton Serving Container with model data mounted

    By default, Triton Serving Docker expose Port 8000 for HTTP and Port 8001 for gRPC.

    Using following command to forward container's 8000 to your workstation's 8000 and container's 8001 to your workstation's 8001.

    $ docker run -p 8000:8000 -p 8001:8001 -v $(pwd)/examples/models:/models nvcr.io/nvidia/tritonserver:20.09-py3 tritonserver --model-store=/models --model-control-mode=explicit --strict-model-config=false --strict-readiness=false
    
  4. Setup your Golang, gRPC and Protobuff Development Environment locally

    Follow this gRPC Go Quick Start Guide

  5. Run Triton adapter with:

    $ export ROOT_MODEL_DIR=$(pwd)/examples/models
    $ export CONTAINER_MEM_REQ_BYTES=268435456 # 256MB
    $ go run main.go
    
  6. Test adapter with this client from another terminal:

    $ go run triton/adapter_client/adapter_client.go
    

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis
generated
mocks
Package mock_triton is a generated GoMock package.
Package mock_triton is a generated GoMock package.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL