Documentation ¶
Index ¶
- type EngineStatus
- type EngineTracker
- type P
- func (p *P) AddOrUpdateEngineStatus(srv engineCommunicator, engineStatus *v1.EngineStatus, ...) error
- func (p *P) DumpStatus() *Status
- func (p *P) MaxInProgressTaskDuration() time.Duration
- func (p *P) NumEnginesByTenantID() map[string]int
- func (p *P) NumInProgressTasks() int
- func (p *P) NumQueuedTasks() int32
- func (p *P) ProcessTaskResult(taskResult *v1.TaskResult, clusterInfo *auth.ClusterInfo) error
- func (p *P) RemoveEngine(engineID string, clusterInfo *auth.ClusterInfo) error
- func (p *P) Run(ctx context.Context) error
- func (p *P) SendChatCompletionTask(ctx context.Context, tenantID string, req *v1.CreateChatCompletionRequest, ...) (*http.Response, error)
- func (p *P) SendEmbeddingTask(ctx context.Context, tenantID string, req *v1.CreateEmbeddingRequest, ...) (*http.Response, error)
- type Status
- type TaskStatus
- type TenantStatus
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type EngineStatus ¶
type EngineStatus struct { RegisteredModelIDs []string `json:"registeredModelIds"` InProgressModelIDs []string `json:"inProgressModelIds"` Tasks []*TaskStatus `json:"tasks"` }
EngineStatus is the status of an engine.
type EngineTracker ¶ added in v1.2.0
type EngineTracker struct {
// contains filtered or unexported fields
}
EngineTracker trackes the status of the engines across multiple server pods.
func NewEngineTracker ¶ added in v1.2.0
func NewEngineTracker(store *store.S, localPodName, localPodIP string) *EngineTracker
NewEngineTracker creates a new EngineTracker.
type P ¶
type P struct {
// contains filtered or unexported fields
}
P processes inference tasks.
func (*P) AddOrUpdateEngineStatus ¶
func (p *P) AddOrUpdateEngineStatus( srv engineCommunicator, engineStatus *v1.EngineStatus, clusterInfo *auth.ClusterInfo, ) error
AddOrUpdateEngineStatus adds or updates the engine status.
func (*P) DumpStatus ¶
DumpStatus dumps the status of the processor.
func (*P) MaxInProgressTaskDuration ¶
MaxInProgressTaskDuration returns the maximum duration of in-progress tasks.
func (*P) NumEnginesByTenantID ¶
NumEnginesByTenantID returns the number of engines by tenant ID.
func (*P) NumInProgressTasks ¶
NumInProgressTasks returns the number of in-progress tasks.
func (*P) NumQueuedTasks ¶
NumQueuedTasks returns the number of queued tasks.
func (*P) ProcessTaskResult ¶
func (p *P) ProcessTaskResult( taskResult *v1.TaskResult, clusterInfo *auth.ClusterInfo, ) error
ProcessTaskResult processes the task result.
func (*P) RemoveEngine ¶
func (p *P) RemoveEngine(engineID string, clusterInfo *auth.ClusterInfo) error
RemoveEngine removes the engine.
type Status ¶
type Status struct {
Tenants map[string]*TenantStatus `json:"tenants"`
}
Status is the status of the processor.
type TaskStatus ¶
TaskStatus is the status of a task.
type TenantStatus ¶
type TenantStatus struct {
Engines map[string]*EngineStatus `json:"engines"`
}
TenantStatus is the status of a tenant.