ClickHouse Exporter
Status: experimental
Supported pipeline types: logs.
This exporter supports sending OpenTelemetry logs to ClickHouse. It will also support spans and metrics in the future.
ClickHouse is an open-source, high performance columnar OLAP database management system for real-time analytics using SQL.
Throughput can be measured in rows per second or megabytes per second.
If the data is placed in the page cache, a query that is not too complex is processed on modern hardware at a speed of approximately 2-10 GB/s of uncompressed data on a single server.
If 10 bytes of columns are extracted, the speed is expected to be around 100-200 million rows per second.
Note:
Always add batch-processor to collector pipeline, as ClickHouse document says:
We recommend inserting data in packets of at least 1000 rows, or no more than a single request per second. When inserting to a MergeTree table from a tab-separated dump, the insertion speed can be from 50 to 200 MB/s.
User Cases
-
Use Grafana Clickhouse datasource or
vertamedia-clickhouse-datasource to make dashboard.
Support time-series graph, table and logs.
-
Analyze logs via powerful clickhouse SQL.
/* get error count about my service last 1 hour.*/
SELECT count(*)
FROM otel_logs
WHERE SeverityText='ERROR' AND Timestamp >= NOW() - INTERVAL 1 HOUR;
/* find log.*/
SELECT *
FROM otel_logs
WHERE Timestamp >= NOW() - INTERVAL 1 HOUR;
/* find log with specific attribute .*/
SELECT Body
FROM otel_logs
WHERE LogAttributes.Value[indexOf(LogAttributes.Key, 'http_method')] = 'post' AND Timestamp >= NOW() - INTERVAL 1 HOUR;
Configuration options
The following settings are required:
The following settings can be optionally configured:
ttl_days
(defaul t= 0): The data time-to-live in days, 0 means no ttl.
logs_table_name
(default = otel_logs): The table name for logs.
timeout
(default = 5s): The timeout for every attempt to send data to the backend.
sending_queue
queue_size
(default = 5000): Maximum number of batches kept in memory before dropping data.
retry_on_failure
enabled
(default = true)
initial_interval
(default = 5s): The Time to wait after the first failure before retrying; ignored if enabled
is false
max_interval
(default = 30s): The upper bound on backoff; ignored if enabled
is false
max_elapsed_time
(default = 300s): The maximum amount of time spent trying to send a batch; ignored if enabled
is false
Example
receivers:
examplereceiver:
processors:
batch:
timeout: 10s
exporters:
clickhouse:
dsn: tcp://127.0.0.1:9000?database=default
ttl_days: 3
timeout: 5s
retry_on_failure:
enabled: true
initial_interval: 5s
max_interval: 30s
max_elapsed_time: 300s
service:
pipelines:
logs:
receivers: [examplereceiver]
processors: [batch]
exporters: [clickhouse]
Schema
CREATE TABLE IF NOT EXISTS otel_logs (
Timestamp DateTime CODEC(Delta, ZSTD(1)),
TraceId String CODEC(ZSTD(1)),
SpanId String CODEC(ZSTD(1)),
TraceFlags UInt32,
SeverityText LowCardinality(String) CODEC(ZSTD(1)),
SeverityNumber Int32,
Body String CODEC(ZSTD(1)),
ResourceAttributes Nested
(
Key LowCardinality(String),
Value String
) CODEC(ZSTD(1)),
LogAttributes Nested
(
Key LowCardinality(String),
Value String
) CODEC(ZSTD(1)),
INDEX idx_attr_keys ResourceAttributes.Key TYPE bloom_filter(0.01) GRANULARITY 64,
INDEX idx_res_keys LogAttributes.Key TYPE bloom_filter(0.01) GRANULARITY 64
) ENGINE MergeTree()
TTL Timestamp + INTERVAL 3 DAY
PARTITION BY toDate(Timestamp)
ORDER BY (toUnixTimestamp(Timestamp));