taskranker

package module

v1.1.0 Latest Latest Go to latest Published: Jul 10, 2020 License: Apache-2.0 Imports: 9 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/pradykaushik/task-ranker

README ¶

Task Ranker

Rank tasks running as docker containers in a cluster.

Task Ranker runs as a cron job on a specified schedule. Each time the task ranker is run, it fetches data from Prometheus, filters the data as required and then submits it to a task ranking strategy. The task ranking strategy uses the data received to calibrate currently running tasks on the cluster and then rank them accordingly. The results of the strategy are then fed back to the user through callbacks.

You will need to have a working Golang environment running at least 1.12 and a Linux environment.

How To Use?

Run the below command to download and install Task Ranker.

go get github.com/pradykaushik/task-ranker

Environment

Task Ranker can be used in environments where,

Prometheus is used to collect container specific metrics from hosts on the cluster that are running docker containers.
cAdvisor, a docker native metrics exporter is run on the hosts to export resource isolation and usage information of running containers.

See cAdvisor docs for more information on how to monitor cAdvisor with Prometheus.

Container Label Prefixes

CAdvisor prefixes all container labels with container_label_. Given that the Task Ranker only talks to Prometheus, the labels provided should also include these prefixes. For example, let us say that we launch a task in a docker container using the command below.

docker run --label task_id="1234" -t repository/name:version

CAdvisor would then export container_label_task_id as the container label.

Configuration

Task Ranker configuration requires two components to be configured and provided.

DataFetcher - Responsible for fetching data from Prometheus, filtering it using the provided labels and submitting it to the chosen strategy.
- Endpoint: Prometheus HTTP API endpoint.
Ranking Strategy - Uses the data to calibrate currently running tasks and then rank them accordingly.
- Labels: Used for filtering the time series data using the specified label matching operation.
- Receiver of the task ranking results.

Task Ranker is configured as shown below. The below code snippet shows how Task Ranker can be configured to,

fetch time series data from a Prometheus server running at http://localhost:9090.
data is fetched every 5 seconds.
use the cpushares strategy to rank tasks.
filter out metrics where container_label_task_id!="".
filter out metrics where container_label_task_host!="".
use container_label_task_id as the dedicated label to help retrieve the task identifier.
use container_label_task_host as the dedicated label to help retrieve the hostname on which the task is running.
use dummyTaskRanksReceiver as the receiver of ranked tasks.

type dummyTaskRanksReceiver struct{}

func (r *dummyTaskRanksReceiver) Receive(rankedTasks entities.RankedTasks) {
	log.Println(rankedTasks)
}

prometheusDataFetcher, err = prometheus.NewDataFetcher(
    prometheus.WithPrometheusEndpoint("http://localhost:9090"))

tRanker, err = New(
    WithDataFetcher(prometheusDataFetcher),
    WithSchedule("?/5 * * * * *"),
    WithStrategy("cpushares", []*query.LabelMatcher{
        {Type: query.TaskID, Label: "container_label_task_id", Operator: query.NotEqual, Value: ""},
        {Type: query.TaskHostname, Label: "container_label_task_host", Operator: query.Equal, Value: "localhost"},
    }, new(dummyTaskRanksReceiver), 1*time.Second))

You can now also configure the strategies using initialization options. This allows for configuring the time duration of range queries, enabling fine-grained control over the number of data points over which the strategy is applied. See below example for strategy configuration using options.

type dummyTaskRanksReceiver struct{}

func (r *dummyTaskRanksReceiver) Receive(rankedTasks entities.RankedTasks) {
	log.Println(rankedTasks)
}

prometheusDataFetcher, err = prometheus.NewDataFetcher(
    prometheus.WithPrometheusEndpoint("http://localhost:9090"))

tRanker, err = New(
    WithDataFetcher(prometheusDataFetcher),
    WithSchedule("?/5 * * * * *"),
    WithStrategyOptions("cpuutil",
        strategies.WithLabelMatchers([]*query.LabelMatcher{
            {Type: query.TaskID, Label: "container_label_task_id", Operator: query.NotEqual, Value: ""},
            {Type: query.TaskHostname, Label: "container_label_task_host", Operator: query.Equal, Value: "localhost"}}),
        strategies.WithTaskRanksReceiver(new(dummyTaskRanksReceiver)),
        strategies.WithPrometheusScrapeInterval(1*time.Second),
        strategies.WithRange(query.Seconds, 5)))

Dedicated Label Matchers

Dedicated Label Matchers can be used to retrieve the task ID and host information from data retrieved from Prometheus. Strategies can mandate the requirement for one or more dedicated labels.

Currently, the following dedicated label matchers are supported.

TaskID - This is used to flag a label as one that can be used to fetch the unique identifier of a task.
TaskHostname - This is used to flag a label as one that can be used to fetch the name of the host on which the task is running.

Strategies can demand that one or more dedicated labels be provided. For instance, if a strategy ranks all tasks running on the cluster, then it can mandate only TaskID dedicated label. On the other hand if a strategy ranks colocated tasks, then it can mandate both TaskID and TaskHostname dedicated labels.

Dedicated label matchers will need to be provided when using strategies that demand them.
The below code snippet shows how a dedicated label can be provided when configuring the Task Ranker.

WithStrategy("strategy-name", []*query.LabelMatcher{
    {Type: query.TaskID, Label: "taskid_label", Operator: query.NotEqual, Value: ""},
    ... // Other label matchers.
})

Start the Task Ranker

Once the Task Ranker has been configured, then you can start it by calling tRanker.Start().

Test Locally

Setup

Run ./create_test_env to,

bring up a docker-compose installation running Prometheus and cAdvisor.
run tasks in docker containers.

Each container is allocated different cpu-shares. For more information on running Prometheus and cAdvisor locally see here.

Once you have Prometheus and cAdvisor running (test by running curl http://localhost:9090/metrics or use the browser),

Test

Now run the below command to run tests.

go test -v ./...

The task ranking results are displayed on the console. Below is what it will look like.

HOST = localhost
========================================================================
		
[TaskID = <task id>,Hostname = localhost,Weight = <weight>,], Rank = 0
[TaskID = <task id>,Hostname = localhost,Weight = <weight>,], Rank = 1
[TaskID = <task id>,Hostname = localhost,Weight = <weight>,], Rank = 2
...
[TaskID = <task id>,Hostname = localhost,Weight = <weight>,], Rank = n
========================================================================

Tear-Down

Once finished testing, tear down the test environment by running ./tear_down_test_env.

Documentation ¶

Index ¶

type Option
type TaskRanker
- func New(options ...Option) (*TaskRanker, error)

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Option ¶

type Option func(*TaskRanker) error

func WithDataFetcher ¶

func WithDataFetcher(dataFetcher df.Interface) Option

func WithSchedule ¶

func WithSchedule(specString string) Option

func WithStrategy ¶

func WithStrategy(
	strategy string,
	labelMatchers []*query.LabelMatcher,
	receiver strategies.TaskRanksReceiver,
	prometheusScrapeInterval time.Duration) Option

WithStrategy builds the task ranking strategy associated with the given name using the provided information. For backwards compatibility, strategies that use range queries will use the default duration. If the time duration for the range query needs to be configured, then use WithStrategyOptions(...) to configure the strategy and provide the WithRange(...) option.

func WithStrategyOptions ¶ added in v0.5.0

func WithStrategyOptions(strategy string, strategyOptions ...strategies.Option) Option

WithStrategyOptions builds the strategy associated with the given name using the provided initialization options.

type TaskRanker ¶

type TaskRanker struct {
	// DataFetcher used to pull task/container specific data.
	DataFetcher df.Interface
	// Strategy to use for calibration and ranking of tasks using the data fetched.
	Strategy strategies.Interface
	// Schedule on which the ranker runs. The schedule should follow the cron schedule format.
	// See https://en.wikipedia.org/wiki/Cron.
	// Alternatively, Seconds can also be specified as part of the schedule.
	// See https://godoc.org/github.com/robfig/cron.
	Schedule cron.Schedule
	// contains filtered or unexported fields
}

TaskRanker fetches data pertaining to currently running tasks, deploys a strategy to rank them and then feeds the results back to the caller. Runs as a cron job on the defined schedule.

func New ¶

func New(options ...Option) (*TaskRanker, error)

func (*TaskRanker) Run ¶

func (tRanker *TaskRanker) Run()

func (*TaskRanker) Start ¶

func (tRanker *TaskRanker) Start()

func (*TaskRanker) Stop ¶

func (tRanker *TaskRanker) Stop()

Source Files ¶

View all Source files

ranker.go

Directories ¶

Path	Synopsis
datafetcher
prometheus
entities
query
strategies
factory
util

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL