Documentation ¶
Index ¶
- Constants
- Variables
- func GetModelDefCache() *cache.FileCache
- func PopulateExpTrialsMetrics(pgdb *db.PgDB, masterConfig *config.Config, trivialMetrics bool, batches int) error
- type AllocationMetadata
- type ExperimentRequestQuery
- type ExperimentSnapshotShimError
- type LogrusLogFn
- type Master
- type TaskLogBackend
- type TrialLogBackend
Constants ¶
const ( // MIMEApplicationXTar is Tar's MIME type. MIMEApplicationXTar = "application/x-tar" // MIMEApplicationGZip is GZip's MIME type. MIMEApplicationGZip = "application/gzip" // MIMEApplicationZip is Zip's MIME type. MIMEApplicationZip = "application/zip" )
const ( // InvalidHPKillDelay the delay before we forcibly kill a trial that said it had an invalid HP. InvalidHPKillDelay = 10 * time.Second )
Variables ¶
var SummaryMetricStatistics = []string{"last", "max", "mean", "min"}
SummaryMetricStatistics lists values possibly queryable within summary metrics.
var ( // TrialAvailableSeriesBatchWaitTime is exported to be changed by tests. TrialAvailableSeriesBatchWaitTime = 15 * time.Second )
Functions ¶
func GetModelDefCache ¶
GetModelDefCache returns FileCache object.
Types ¶
type AllocationMetadata ¶
type AllocationMetadata struct { AllocationID model.AllocationID TaskType model.TaskType Username string WorkspaceName string ResourcePool string ExperimentID int Slots int StartTime time.Time EndTime time.Time ImagepullingTime float64 SlotHours float64 }
AllocationMetadata captures the historic allocation information for a given task.
type ExperimentRequestQuery ¶
ExperimentRequestQuery contains values for the experiments request queries with defaults already applied. This should to be kept in sync with the expected queries from ParseExperimentsQuery.
func ParseExperimentsQuery ¶
func ParseExperimentsQuery(apiCtx echo.Context) (*ExperimentRequestQuery, error)
ParseExperimentsQuery parse queries for the experiments endpoint.
type ExperimentSnapshotShimError ¶
type ExperimentSnapshotShimError struct {
Message string
}
ExperimentSnapshotShimError describes an error encountered while shimming.
func (ExperimentSnapshotShimError) Error ¶
func (e ExperimentSnapshotShimError) Error() string
type LogrusLogFn ¶
type LogrusLogFn func(format string, args ...interface{})
LogrusLogFn is an interface for all the logrus Levelf log functions.
type Master ¶
Master manages the Determined master state.
func (*Master) Info ¶
func (m *Master) Info() aproto.MasterInfo
Info returns this master's information.
func (*Master) ResolveResources ¶
func (m *Master) ResolveResources( resourcePool string, slots int, workspaceID int, isSingleNode bool, ) (rm.ResourcePoolName, []pkgCommand.LaunchWarning, error)
ResolveResources - Validate ResoucePool and check for availability.
func (*Master) Run ¶
Run causes the Determined master to connect the database and begin listening for HTTP requests.
gRPCLogInitDone is closed when the grpclog package's logger singletons are set. This is just used by tests to soothe -race, since we asynchronously launch a gRPC server and connect with a gRPC client, in the same program, using the same singletons.
type TaskLogBackend ¶
type TaskLogBackend interface { TaskLogs( taskID model.TaskID, limit int, filters []api.Filter, order apiv1.OrderBy, state interface{}, ) ([]*model.TaskLog, interface{}, error) AddTaskLogs([]*model.TaskLog) error TaskLogsCount(taskID model.TaskID, filters []api.Filter) (int, error) TaskLogsFields(taskID model.TaskID) (*apiv1.TaskLogsFieldsResponse, error) DeleteTaskLogs(taskIDs []model.TaskID) error // MaxTerminationDelay is the max delay before a consumer can be sure all logs have been // recevied. A better interface may be an interface for streaming, rather than helper // interfaces to aid streaming, but it's not bad enough to motivate changing it. MaxTerminationDelay() time.Duration }
TaskLogBackend is an interface task log backends, such as elastic or postgres, must support to provide the features surfaced in our API.
type TrialLogBackend ¶
type TrialLogBackend interface { TrialLogs( trialID, limit int, filters []api.Filter, order apiv1.OrderBy, state interface{}, ) ([]*model.TrialLog, interface{}, error) TrialLogsCount(trialID int, filters []api.Filter) (int, error) TrialLogsFields(trialID int) (*apiv1.TrialLogsFieldsResponse, error) DeleteTrialLogs(trialIDs []int) error }
TrialLogBackend is an interface trial log backends, such as elastic or postgres, must support to provide the features surfaced in API. This is deprecated, note it no longer supports adding logs in favor of unified logs.
Source Files ¶
- api.go
- api_agents.go
- api_auth.go
- api_checkpoint.go
- api_command.go
- api_experiment.go
- api_generic_tasks.go
- api_job.go
- api_logretention.go
- api_master.go
- api_model.go
- api_notebook.go
- api_project.go
- api_resourcepool.go
- api_runs.go
- api_shell.go
- api_task.go
- api_tasks.go
- api_tensorboard.go
- api_trials.go
- api_user.go
- api_workspace.go
- audit.go
- cache.go
- checkpoint_gc.go
- core.go
- core_checkpoint.go
- core_experiment.go
- core_observability.go
- core_searcher.go
- core_task.go
- experiment.go
- experiment_filter.go
- experiment_job_service.go
- populate_metrics.go
- resource_allocation.go
- restore.go
- routes.go
- spec_util.go
- trial.go