go-http-metrics
go-http-metrics knows how to measure http metrics in different metric formats, it comes with a middleware that will measure metrics of a Go net/http handler. The metrics measured are based on RED and/or Four golden signals, follow standards and try to be measured in a efficient way.
If you are using a framework that isn't directly compatible with go's http.Handler
interface from the std library, do not worry, there are multiple helpers available to get middlewares fo the most used http Go frameworks. If there isn't you can open an issue or a PR.
Table of contents
Metrics
The metrics obtained with this middleware are the most important ones for a HTTP service.
- Records the duration of the requests(with: code, handler, method).
- Records the count of the requests(with: code, handler, method).
- Records the size of the responses(with: code, handler, method).
Metrics recorder implementations
go-http-metrics is easy to extend to different metric backends by implementing metrics.Recorder
interface.
Framework compatibility middlewares
The middleware is mainly focused to be compatible with Go std library using http.Handler, but it comes with helpers to get middlewares for other frameworks or libraries.
Getting Started
A simple example that uses Prometheus as the recorder with the standard Go handler.
package main
import (
"log"
"net/http"
"github.com/prometheus/client_golang/prometheus/promhttp"
metrics "github.com/slok/go-http-metrics/metrics/prometheus"
"github.com/slok/go-http-metrics/middleware"
)
func main() {
// Create our middleware.
mdlw := middleware.New(middleware.Config{
Recorder: metrics.NewRecorder(metrics.Config{}),
})
// Our handler.
myHandler := http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
w.WriteHeader(http.StatusOK)
w.Write([]byte("hello world!"))
})
h := mdlw.Handler("", myHandler)
// Serve metrics.
log.Printf("serving metrics at: %s", ":9090")
go http.ListenAndServe(":9090", promhttp.Handler())
// Serve our handler.
log.Printf("listening at: %s", ":8080")
if err := http.ListenAndServe(":8080", h); err != nil {
log.Panicf("error while serving: %s", err)
}
}
For more examples check the examples. default and custom are the examples for Go net/http std library users.
Prometheus query examples
Get the request rate by handler:
sum(
rate(http_request_duration_seconds_count[30s])
) by (handler)
Get the request error rate:
rate(http_request_duration_seconds_count{code=~"5.."}[30s])
Get percentile 99 of the whole service:
histogram_quantile(0.99,
rate(http_request_duration_seconds_bucket[5m]))
Get percentile 90 of each handler:
histogram_quantile(0.9,
sum(
rate(http_request_duration_seconds_bucket[10m])
) by (handler, le)
)
Options
Middleware Options
The factory options are the ones that are passed in the moment of creating the middleware factory using the middleware.Config
object.
Recorder
This is the implementation of the metrics backend, by default it's a dummy recorder.
GroupedStatus
Storing all the status codes could increase the cardinality of the metrics, usually this is not a common case because the used status codes by a service are not too much and are finite, but some services use a lot of different status codes, grouping the status on the \dxx
form could impact the performance (in a good way) of the queries on Prometheus (as they are already aggregated), on the other hand it losses detail. For example the metrics code code="401"
, code="404"
, code="403"
with this enabled option would end being code="4xx"
label. By default is disabled.
DisableMeasureSize
This setting will disable measuring the size of the responses. By default measuring the size is enabled.
Custom handler ID
One of the options that you need to pass when wrapping the handler with the middleware is handlerID
, this has 2 working ways.
-
If an empty string is passed mdwr.Handler("", h)
it will get the handler
label from the url path. This will create very high cardnialty on the metrics because /p/123/dashboard/1
, /p/123/dashboard/2
and /p/9821/dashboard/1
would have different handler
labels. This method is only recomended when the URLs are fixed (not dynamic or don't have parameters on the path).
-
If a predefined handler ID is passed, mdwr.Handler("/p/:userID/dashboard/:page", h)
this will keep cardinality low because /p/123/dashboard/1
, /p/123/dashboard/2
and /p/9821/dashboard/1
would have the same handler
label on the metrics.
There are different parameters to set up your middleware factory, you can check everything on the docs and see the usage in the examples.
Prometheus recorder options
Prefix
This option will make exposed metrics have a {PREFIX}_
in fornt of the metric. For example if a regular exposed metric is http_request_duration_seconds_count
and I use Prefix: batman
my exposed metric will be batman_http_request_duration_seconds_count
. By default this will be disabled or empty, but can be useful if all the metrics of the app are prefixed with the app name.
DurationBuckets
DurationBuckets are the buckets used for the request duration histogram metric, by default it will use Prometheus defaults, this is from 5ms to 10s, on a regular HTTP service this is very common and in most cases this default works perfect, but on some cases where the latency is very low or very high due the nature of the service, this could be changed to measure a different range of time. Example, from 500ms to 320s Buckets: []float64{.5, 1, 2.5, 5, 10, 20, 40, 80, 160, 320}
. Is not adviced to use more than 10 buckets.
Registry
The Prometheus registry to use, by default it will use Prometheus global registry (the default one on Prometheus library).
Label names
The label names of the Prometheus metrics can be configured using HandlerIDLabel
, StatusCodeLabel
, MethodLabel
...
Benchmarks
pkg: github.com/slok/go-http-metrics/middleware
BenchmarkMiddlewareHandler/benchmark_with_default_settings.-4 1000000 1062 ns/op 256 B/op 6 allocs/op
BenchmarkMiddlewareHandler/benchmark_disabling_measuring_size.-4 1000000 1101 ns/op 256 B/op 6 allocs/op
BenchmarkMiddlewareHandler/benchmark_with_grouped_status_code.-4 1000000 1324 ns/op 256 B/op 7 allocs/op
BenchmarkMiddlewareHandler/benchmark_with_predefined_handler_ID-4 1000000 1155 ns/op 256 B/op 6 allocs/op