metricstatsprocessor

package module

v1.71.5 Latest Latest Go to latest Published: Feb 19, 2025 License: Apache-2.0 Imports: 16 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/observiq/bindplane-otel-collector

README ¶

Metric Stats Processor

This processor calculates statistics from metrics over a configurable interval, allowing for metrics to be sampled at a higher rate, or to reduce the volume of metric data from push-based sources.

Minimum agent versions

Introduced: v1.19.0

Supported pipelines

Metrics

How it works

The user configures the metricstats processor in the desired metrics pipeline.
Every metric that flows through the pipeline is matched against the provided include regex.
If the metric name does not match the include regex, the metric passes through the processor.
If the metric matches, but is not a gauge or cumulative sum, the metric passes through the processor.
If the metric name does match, and the metric is a gauge or cumulative sum, the metric is added to a statistic based on its attributes. The metric does not continue down the pipeline.
After the configured interval has passed, all calculated metrics are emitted. Calculated metrics are emitted with a name of ${metric_name}.${statistic_type} e.g. if you take the average of the metric system.cpu.utilization, the calculated metric would be system.cpu.utilization.avg.
All calculations are cleared, and will not be emitted on the next interval, unless another matching metric enters the pipeline.

Configuration

Field	Type	Default	Description
`interval`	duration	`1m`	The interval on which to emit calculated metrics.
`include`	regexp	`".*"`	A regex that specifies which metrics to consider for calculation. The default regex matches all metrics.
`stats`	[]string	`["min", "max, "avg"]`	A list of statistics to calculate on each metric. Valid values are: `min`, `max`, `avg`, `first`, `last`.

Example configuration

Reduce volume of log-based metrics

In this example, the throughput of log-based metrics is limited, by calculating the "last" statistic. The last datapoint received from the log will be emitted every minute at a maximum.

receivers:
  filelog:
    include:
    - $HOME/example.log
    operators:
    - type: regex_parser
      regex: "^(?P<timestamp>[^ ]+) (?P<number>.*)$$"
      timestamp:
      parse_from: attributes.timestamp
      layout: "%d-%m-%YT%H:%M:%S.%LZ"

  route/extract:

processors:
  metricstats:
    interval: 1m
    include: '^.*$$'
    stats: ["last"]
  metricextract:
    route: extract
    extract: attributes.number
    metric_name: 'log.count'
    metric_unit: '{count}'
    metric_type: gauge_int

exporters:
  nop:
  googlecloud:

service:
  pipelines:
    logs:
      receivers: [filelog]
      processors: [metricextract]
      exporters: [nop]
    metrics:
      receivers: [route/extract]
      processors: [metricstats]
      exporters: [googlecloud]

This configuration extracts metrics from a log file, and passes them through the metricstats processor. The metricstats processor will hold the last data point it receives, then emit it after a one minute interval as log.count.last, sending the metric to Google Cloud Monitoring. This limits the throughput to 1 metric per minute.

Sample CPU utilization at a higher rate

In this example, we sample CPU utilization once per second, but only emit calculated metrics every minute. This allows for a higher effective sample rate of the CPU utilization.

receivers:
  hostmetrics:
    collection_interval: 1s
    scrapers:
      cpu:
        metrics:
          system.cpu.time:
            enabled: false
          system.cpu.utilization:
            enabled: true

processors:
  metricstats:
    interval: 1m
    include: '^.*$$'
    stats: ["avg", "min", "max"]

exporters:
  googlecloud:


service:
  pipelines:
    metrics:
      receivers: [hostmetrics]
      processors: [metricstats]
      exporters: [googlecloud]

This configuration will emit a "system.cpu.utilization.max", "system.cpu.utilization.avg", "system.cpu.utilization.min" metric every minute, and sends them to Google Cloud Monitoring.

Documentation ¶

Overview ¶

Package metricstatsprocessor provides a processor that samples pdata base level objects.

Index ¶

func NewFactory() processor.Factory
type Config
- func (cfg Config) StatTypes() []stats.StatType
- func (cfg Config) Validate() error

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func NewFactory ¶

func NewFactory() processor.Factory

NewFactory creates a new ProcessorFactory with default configuration

Types ¶

type Config ¶

type Config struct {
	Interval time.Duration `mapstructure:"interval"`
	// Include is a regex that must match the metric name for it to be sampled.
	// Otherwise, the metric is passed through.
	Include string `mapstructure:"include"`
	// List of stats to calculate for each metric
	Stats []stats.StatType `mapstructure:"stats"`
}

Config is the configuration for the processor

func (Config) StatTypes ¶

func (cfg Config) StatTypes() []stats.StatType

StatTypes gets the default stats to calculate if none were specified, otherwise the configured stat types

func (Config) Validate ¶

func (cfg Config) Validate() error

Validate validates the processor configuration

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
internal
stats Package stats implements structs that are used to calculate statistics from datapoints.	Package stats implements structs that are used to calculate statistics from datapoints.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL