Logtrain
Overview
Logtrain is a system for dynamically forwarding and transforming logs, similar to fluentd but a bit more specialized to solve two issues...
- Have very low overhead (e.g., less than 64Mi)
- Dynamically route logs based on various data sources.
Drain Types
Elastic Search
es+https://user:password@host?[auth=apikey|bearer|basic]&[index=...]
es+http://user:password@host?[auth=apikey|bearer|basic]&[index=...]
The bearer token is taken from the password portion of the url.
Http
http://host/path
https://host/path
Syslog
syslog+tls://host:port?[ca=]
syslog+http://host:port
syslog+https://host:port
syslog+tcp://host:port
syslog+udp://
(aliases, syslog://
)
Using Logtrain API
TODO
Using Logtrain with Kubernetes
Running in Kubernetes
Below is the recommended way of deploying Log Train. Note that this must run as a privileged container,
so that it can read files from /var/log/containers
. The daemonset also contains an initContainer that
will change the sysctl fs.inotify.max_user_instances
to 2048
on your nodes (usually from 128).
kubectl apply -f ./deployments/kubernetes/logtrain-serviceaccount.yaml
kubectl apply -f ./deployments/kubernetes/logtrain-service.yaml
kubectl apply -f ./deployments/kubernetes/logtrain-daemonset.yaml
Adding drains in Kubernetes
Once deployed you can use the following annotations on deployments, daemonsets or statefulsets to forward logs.
logtrain.akkeris.io/drains
This annoation is a comma delimited list of drains (See Drain Types above).
logtrain.akkeris.io/hostname
Explicitly set the hostname used when reading in logs from kubernetes, if not set this will default to the name.namespace
.
logtrain.akkeris.io/tag
Explicitly set the tag when reading in logs from kuberntes, if not set this will default to the pod name.
Using Logtrain with Servers
TODO
Advanced Configuration
General
HTTP_PORT
- The port to use for the http server, shared by any http (payload) and http (syslog) inputs.
Postgres (datasource)
Whether to watch a postgres database for information on where to foward logs to.
POSTGRES
- set to true
DATABASE_URL
- The database url to use to listen for drain changes.
Kubernetes (datasource)
Whether to watch kubernetes deployments, statefulsets and daemonsets for annotations indicating
where logs should be forwarded to.
KUBERNETES_DATASOURCE
- set to true
Kubernetes
Whether to watch the KUBERNETES_LOG_PATH
directory for pod logs and forward them.
KUBERNETES
- set to true
KUBERNETES_LOG_PATH
- optional, the path on each node to look for logs. Defaults to /var/log/containers
Envoy/Istio
Whether to open a gRPC access log stream end point for istio/envoy to stream http log traffic to.
ENVOY
- set to true
ENVOY_PORT
- The port number to listen for gRPC access log streams (default is 9001
)
Http (events)
HTTP_EVENTS
- set to true
HTTP_EVENTS_PATH
- optional, The path on the http server to receive http event payloads, defaults to /events
Note, the port is inherited from HTTP_PORT
. The endpoint only allows one per event over the body and must
be the format defined by pkg/output/packet/packet.go.
Syslog (HTTP)
HTTP_SYSLOG
- set to true
HTTP_SYSLOG_PATH
- optional, The path on the http server to receive syslog streams as http, defaults to /syslog
Note, the port is inherited from HTTP_PORT
.
Syslog (TCP)
SYSLOG_TCP
- set to true
SYSLOG_TCP_PORT
- optional, defaults tcp 9002
Syslog (UDP)
SYSLOG_UDP
- set to true
SYSLOG_UDP_PORT
- optional, defaults to 9003
Syslog (TLS)
SYSLOG_TLS
- set to true
SYSLOG_TLS_CERT_PEM
- The PEM encoded certificate
SYSLOG_TLS_CA_PEM
- The PEM encoded certificate authority (optional)
SYSLOG_TLS_KEY_PEM
- The PEM encoded certificate key
SYSLOG_TLS_SERVER_NAME
- The servername the TLS server should use for SNI
SYSLOG_TLS_PORT
- optional, defaults to 9004
AKKERIS=true
- for Akkeris formatting of output
The logtrain has been tested to be below 64MB (avg 59MB) and < 100m (5%)
CPU for 52 pods on a node with 1500+ deployments being watched. For more pods per node
or more deployments than the benchmark expect (and reset any limits/requests) for memory.
While targeting a 64MB top limit, logtrain should have a limit of 128MB.
Troubleshooting
Too many open files error
If you receive an error on startup with too many files open
error message you'll need to increase
the fs.inotify.max_user_instances
and user.max_inotify_instances
. These are generally set to
128
by default, depending how many pods are running this may be insufficient.
sysctl -w fs.inotify.max_user_instances=2048
sysctl -w user.max_inotify_instances=2048
Developing
Building Logtrain
go build -o logtrain cmd/logtrain/main.go
Testing
go test -v .../.
(Note if you're using GoConvey its best to set the parallel packges to 1 via -packages 1
)
Code Coverage
go test -coverprofile cover.out -v ./... && go tool cover -html=cover.out && rm cover.out