README ¶
veneur-proxy
is a proxy that sits two sets of Veneur instances: those that run on all your hosts and those that act as global aggregators.
Setup
ADD DIAGRAM HERE
veneur-proxy
acts as a stateless bridge. The following is a guide for setting up a Veneur pipeline.
- Be running consul in your infrastructure.
- Set up 3 or more Veneur instances as global. Maybe call the consul service
veneur-global-srv
? - Set up 3 or more
veneur-proxy
instances on hosts, use either DNS — we use consul — or load-balancers to expose them. - Configure the
veneur-proxy
instances to find your global instances viaconsul_forward_service_name
configuration option. If you usedveneur-global-srv
as above, then enter that value! - Point your per-host Veneur instances to the proxies via a host and port pair using the
forward_address
configuration option. This step is dependent on the way you chose to load balancer your proxies. - Profit!
How It Works
veneur-proxy
implements the same API as a global Veneur, accepting metrics send from other Veneur instances. It hashes each metric name and tag value/pair combination, using a consistent hash ring to ensure that metrics are always aggregated at the same global Veneur instance.
configuration
Use the consul_refresh_interval
to specify how often Veneur should refresh it's list.
debug
: Enable or disable debug logging with true/false.enable_profiling
: Enable or disable go profiling. Danger, might fill up your disk if not cared for.http_address
: Thehost:port
pair in which this program will listen for HTTP commands.grpc_address
: Thehost:port
pair to listen on for metric forwards over gRPC.consul_refresh_interval
: How often to refresh from Consul's healthy nodes. Value must be parseable by time.ParseDuration (https://golang.org/pkg/time/#ParseDuration)ssf_destination_address
: Thehost:port
address of a Veneur to sendveneur_proxy
's metrics to over SSF.stats_address
: Thehost:port
destination to send metrics to over StatsD whenveneur-proxy
experiences backpressure on submitting tossf_destination_address
.forward_address
: Use a static host for forwarding over HTTP.grpc_forward_address
: Use a static host for forwarding (over gRPC).consul_forward_service_name
: The name of a consul service for consistent forwarding over HTTP.consul_forward_grpc_service_name
: The name of a consul service for consistent forwarding over gRPC.sentry_dsn
: A Sentry DSN to which errors will be sent.
Concerns
- When metrics are accepted, the act of forwarding them is asynchronous. As far as the client is concerned the HTTP operation always succeeds. This tradeoff is because any failures are expected to be short in duration and there is no mechanism for notify clients of failure due to the nature of UDP.
- The list of global servers is locked when refreshing and flushing to avoid race conditions. If your retrieval of consul hosts (see metric
veneur.discoverer.update_duration_ns
) or flushes (see metricveneur.flush.total_duration_ns
) are slow, you see one or the other slow down. - A consistent hash ring is used mitigate the impact of changes in Consul's list of healthy nodes. This is not perfect, and you can expect some churn whenever the list of healthy nodes changes in Consul.
Operation
Replacing A Global Veneur
Using either Consul's health checks or other means, remove the instance you're working on. Within consul_refresh_interval
the proxies should remove the host and rebalance the ring. To add the new host, simply turn it on and wait for it to show up in Consul. veneur-proxy
will do the rest.
Replacing A Proxy Veneur
Using either Consul or some sort of load balancer, remove the proxy instance. Per-instance veneurs should stop flushing to the proxies. After this time you can replace and add a new proxy, as all proxy work is stateless.
Monitoring
Since the proxy's job is to accept and dispatch connections, the important metrics to watch are:
veneur_proxy.proxy.duration_ns.*
- A timer describing the duration of the entire proxy call.veneur_proxy.import.duration_ns.*
- A timer describing the duration of handling the "import" call, which is used to deserialize and process the incoming metrics from a child Veneur.veneur_proxy.forward.duration_ns.*
: A timer for the duration of forwardsveneur_proxy.forward.error_total
: The count of errored forwards
To monitor the health of the forwarded metrics, you might want to look at:
veneur_proxy.forward.content_length_bytes.*
- Length of forwarded request bodies as a histogramveneur_proxy.metrics_by_destination
- A gauge describing the number of metrics that were proxied to each destination instance.
If you use service discovery (e.g. Consul) for forwarding or tracing, these metrics will be useful to you. Each of these is tagged with service
that has a value matching the service name supplied via the config:
veneur_proxy.discoverer.destination_number
- A gauge containing the number of hosts Veneur discovered and added to the hash ring.veneur_proxy.discoverer.errors
- A counter tracking the number of times the service discovery mechanism has failed to return any hosts. Note that Veneur will refuse to update it's list if there are 0 returned hosts and may use stale results until such as as > 1 host is returned.veneur_proxy.discoverer.update_duration_ns
- A timer describing the duration of service discovery calls.
Documentation ¶
There is no documentation for this package.