microcache

package module
v1.0.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Sep 25, 2024 License: MIT Imports: 16 Imported by: 0

README

Microcache

A non-standard HTTP cache implemented as Go middleware.

GoDoc Go Report Card Code Coverage

HTTP Microcaching is a common strategy for improving the efficiency, availability and response time variability of HTTP web services. These benefits are especially relevant in microservice architectures where a service's synchronous dependencies sometimes become unavailable and it is not always feasible or economical to add a separate caching layer between all services.

To date, very few software packages exist to solve this specific problem. Most microcache deployments make use of existing HTTP caching middleware like NGINX. This presents a challenge. When an HTTP cache exists for the purpose of microcaching between an origin server and a CDN, the origin must choose whether to use standard HTTP caching headers with aggressive short TTLs for the microcache or less aggressive longer TTL headers more suitable to CDNs. The overlap in HTTP header key space prevents these two cache layers from coexisting without some additional customization.

All request specific custom response headers supported by this cache are prefixed with microcache- and scrubbed from the response. Most of the common HTTP caching headers one would expect to see in an http cache are ignored (except Vary). This was intentional and support may change depending on developer feedback. The purpose of this cache is not to act as a substitute for a robust HTTP caching layer but rather to serve as an additional caching layer with separate controls for shorter lived, more aggressive caching measures.

The manner in which this cache operates (writing response bodies to byte buffers) may not be suitable for all applications. Caching should certainly be disabled for any resources serving very large and/or streaming responses. For instance, caching is automatically disabled for all websocket requests.

More info in the docs: https://godoc.org/github.com/kevburnsjr/microcache

Example

package main

import (
	"bytes"
	"log"
	"net/http"
	"time"

	"github.com/kevburnsjr/microcache"
)

func main() {
	cache := microcache.New(microcache.Config{
		Nocache:              true,
		Timeout:              3 * time.Second,
		TTL:                  30 * time.Second,
		StaleIfError:         3600 * time.Second,
		StaleRecache:         true,
		StaleWhileRevalidate: 30 * time.Second,
		CollapsedForwarding:  true,
		HashQuery:            true,
		QueryIgnore:          []string{},
		Exposed:              true,
		SuppressAgeHeader:    false,
		Monitor:              microcache.MonitorFunc(5*time.Second, logStats),
		Driver:               microcache.NewDriverLRU(1e4),
		Compressor:           microcache.CompressorSnappy{},
	})
	defer cache.Stop()

	h := cache.Middleware(handler{})

	http.ListenAndServe(":80", h)
}

type handler struct {
}

var body = bytes.Repeat([]byte("1234567890"), 1e3)

// This example fills up to 1.2GB of memory, so at least 2.0GB of RAM is recommended
func (h handler) ServeHTTP(w http.ResponseWriter, r *http.Request) {

	// Enable cache
	w.Header().Set("microcache-cache", "1")

	// Return a 10 kilobyte response body
	w.Write(body)
}

func logStats(stats microcache.Stats) {
	total := stats.Hits + stats.Misses + stats.Stales
	log.Printf("Size: %d, Total: %d, Hits: %d, Misses: %d, Stales: %d, Backend: %d, Errors: %d\n",
		stats.Size,
		total,
		stats.Hits,
		stats.Misses,
		stats.Stales,
		stats.Backend,
		stats.Errors,
	)
}

Features

May improve service efficiency by reducing origin read traffic

  • ttl - response caching with global or request specific ttl
  • collapsed-forwarding - deduplicate requests for cacheable resources

May improve client facing response time variability

  • stale-while-revalidate - serve stale content while fetching cacheable resources in the background

May improve service availability

  • request-timeout - kill long running requests
  • stale-if-error - serve stale responses on error (or request timeout)
  • stale-recache - recache stale responses following stale-if-error

Supports content negotiation with global and request specific cache splintering

  • vary - splinter requests by request header value
  • vary-query - splinter requests by URL query parameter value

Control Flow Diagram

This diagram illustrates the basic internal operation of the middleware.

microcache-architecture.svg

Compression

The Snappy compressor is recommended to optimize for CPU over memory efficiency compared with gzip

Snappy provides:

  • 14x faster compression over gzip
  • 8x faster expansion over gzip
  • but the result is 1.5 - 2x the size compared to gzip (for specific json examples)

Your mileage may vary. See compare_compression.go to test your specific workloads

> go run tools/compare_compression.go -f large.json
Original: 616,611 bytes of json
zlib   compress 719.853807ms  61,040 bytes (10.1x)
gzip   compress 720.731066ms  61,052 bytes (10.1x)
snappy compress 48.836002ms  106,613 bytes (5.8x)
zlib   expand 211.538416ms
gzip   expand 220.011961ms
snappy expand 26.973263ms

Benchmarks

All benchmarks are lies. Running example code above on 5820k i7 @ 3.9Ghz DDR4. GOMAXPROCS=2, 10KB response.

> gobench -u http://localhost/ -c 10 -t 10
Dispatching 10 clients
Waiting for results...

Requests:                           404120 hits
Successful requests:                404120 hits
Network failed:                          0 hits
Bad requests failed (!2xx):              0 hits
Successful requests rate:            40412 hits/sec
Read throughput:                 410714568 bytes/sec
Write throughput:                  3273453 bytes/sec
Test time:                              10 sec

The intent of this middleware is to serve cached content with minimal overhead. We could probably do some more gymnastics to reduce allocs but overall performance is quite good. It achieves the goal of reducing hit response times to the order of microseconds.

$ go test -bench=. -benchmem
goos: linux
goarch: amd64
pkg: github.com/kevburnsjr/microcache
BenchmarkHits-12                            827476   1449 ns/op    678 B/op   14 allocs/op
BenchmarkNocache-12                        1968576    613 ns/op    312 B/op    6 allocs/op
BenchmarkMisses-12                          307652   3890 ns/op   1765 B/op   37 allocs/op
BenchmarkCompression1kHits-12               648564   1928 ns/op   1384 B/op   15 allocs/op
BenchmarkCompression1kNocache-12           1965351    611 ns/op    312 B/op    6 allocs/op
BenchmarkCompression1kMisses-12             232980   5119 ns/op   3365 B/op   39 allocs/op
BenchmarkParallelCompression1kHits-12      2525994    511 ns/op   1383 B/op   15 allocs/op
BenchmarkParallelCompression1kNocache-12   3876728    335 ns/op    312 B/op    6 allocs/op
BenchmarkParallelCompression1kMisses-12     666580   2226 ns/op   3326 B/op   38 allocs/op
PASS
ok      github.com/kevburnsjr/microcache        7.188s

Release Status

API is stable. 100% test coverage.

At least one large scale deployment of this library has been running in production on a high volume internet facing API at an Alexa Top 500 global website for over a year.

Notes

Vary query by parameter presence as well as value

Modify Monitor.Error to accept request, response and error
Add Monitor.Timeout accepting request, response and error

Separate middleware:
  Sanitize lang header? (first language)
  Sanitize region? (country code)

etag support?
if-modified-since support?
HTCP?
TCI?
Custom rule handling?
  Passthrough: func(r) bool

Documentation

Overview

microcache is a non-standard HTTP microcache implemented as Go middleware.

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func MonitorFunc

func MonitorFunc(interval time.Duration, logFunc func(Stats)) *monitorFunc

MonitorFunc turns a function into a Monitor

func New

func New(o Config) *microcache

New creates and returns a configured microcache instance

Types

type Compressor

type Compressor interface {

	// Compress compresses a response prior to being saved in the cache and returns a clone
	// usually by compressing the response body
	Compress(Response) Response

	// Expand decompresses a response's body (destructively)
	Expand(Response) Response
}

Compressor is the interface for response compressors

type CompressorGzip

type CompressorGzip struct {
}

CompressorGzip is a gzip compressor

func (CompressorGzip) Compress

func (c CompressorGzip) Compress(res Response) Response

func (CompressorGzip) Expand

func (c CompressorGzip) Expand(res Response) Response

type CompressorSnappy

type CompressorSnappy struct {
}

CompressorSnappy is a Snappy compressor 14x faster compress than gzip 8x faster expand than gzip ~ 1.5 - 2x larger result (see README)

func (CompressorSnappy) Compress

func (c CompressorSnappy) Compress(res Response) Response

func (CompressorSnappy) Expand

func (c CompressorSnappy) Expand(res Response) Response

type Config

type Config struct {
	// Nocache prevents responses from being cached by default
	// Can be overridden by the microcache-cache and microcache-nocache response headers
	Nocache bool

	// Timeout specifies the maximum execution time for backend responses
	// Example: If the underlying handler takes more than 10s to respond,
	// the request is cancelled and the response is treated as 503
	// Recommended: 10s
	// Default: 0
	Timeout time.Duration

	// TTL specifies a default ttl for cached responses
	// Can be overridden by the microcache-ttl response header
	// Recommended: 10s
	// Default: 0
	TTL time.Duration

	// StaleWhileRevalidate specifies a period during which a stale response may be
	// served immediately while the resource is fetched in the background. This can be
	// useful for ensuring consistent response times at the cost of content freshness.
	// More Info: https://tools.ietf.org/html/rfc5861
	// Recommended: 20s
	// Default: 0
	StaleWhileRevalidate time.Duration

	// StaleIfError specifies a default stale grace period
	// If a request fails and StaleIfError is set, the object will be served as stale
	// and the response will be re-cached for the duration of this grace period
	// Can be overridden by the microcache-ttl-stale response header
	// More Info: https://tools.ietf.org/html/rfc5861
	// Recommended: 20s
	// Default: 0
	StaleIfError time.Duration

	// StaleRecache specifies whether to re-cache the response object for ttl while serving
	// stale response on backend error
	// Recommended: true
	// Default: false
	StaleRecache bool

	// CollapsedForwarding specifies whether to collapse duplicate requests
	// This helps prevent servers with a cold cache from hammering the backend
	// Default: false
	CollapsedForwarding bool

	// HashQuery determines whether all query parameters in the request URI
	// should be hashed to differentiate requests
	// Default: false
	HashQuery bool

	// QueryIgnore is a list of query parameters to ignore when hashing
	// Default: nil
	QueryIgnore []string

	// Vary specifies a list of http request headers by which all requests
	// should be differentiated. When making use of this option, it may be a good idea
	// to normalize these headers first using a separate piece of middleware.
	//
	//   []string{"accept-language", "accept-encoding", "xml-http-request"}
	//
	// Default: []string{}
	Vary []string

	// Driver specifies a cache storage driver
	// Default: lru with 10,000 item capacity
	Driver Driver

	// Compressor specifies a compressor to use for reducing the memory required to cache
	// response bodies
	// Default: nil
	Compressor Compressor

	// Monitor is an optional parameter which will periodically report statistics about
	// the cache to enable monitoring of cache size, cache efficiency and error rate
	// Default: nil
	Monitor Monitor

	// Exposed determines whether to add a header to the response indicating the response state
	// Microcache: ( HIT | MISS | STALE )
	// Default: false
	Exposed bool

	// SuppressAgeHeader determines whether to suppress the age header in responses
	// The age header is added by default to all HIT and STALE responses
	// Age: ( seconds )
	// Default: false
	SuppressAgeHeader bool
}

type Driver

type Driver interface {

	// SetRequestOpts stores request options in the request cache.
	// Requests contain request-specific cache configuration based on response headers
	SetRequestOpts(string, RequestOpts) error

	// GetRequestOpts retrieves request options from the request cache
	GetRequestOpts(string) (RequestOpts, bool)

	// Set stores a response object in the response cache.
	// This contains the full response as well as an expiration date.
	Set(string, Response) error

	// Get retrieves a response object from the response cache
	Get(string) (Response, bool)

	// Remove removes a response object from the response cache.
	// Required by HTTP spec to purge cached responses after successful unsafe request.
	Remove(string) error

	// GetSize returns the number of objects stored in the cache
	GetSize() int
}

Driver is the interface for cache drivers

type DriverARC

type DriverARC struct {
	RequestCache  *lru.ARCCache
	ResponseCache *lru.ARCCache
}

DriverARC is a driver implementation using github.com/hashicorp/golang-lru ARCCache is a thread-safe fixed size Adaptive Replacement Cache (ARC). It requires more ram and cpu than straight LRU but can be more efficient https://godoc.org/github.com/hashicorp/golang-lru#ARCCache

func NewDriverARC

func NewDriverARC(size int) DriverARC

NewDriverARC returns an ARC driver. size determines the number of items in the cache. Memory usage should be considered when choosing the appropriate cache size. The amount of memory consumed by the driver will depend upon the response size. Roughly, memory = cacheSize * averageResponseSize / compression ratio ARC caches have additional CPU and memory overhead when compared with LRU ARC does not support eviction monitoring

func (DriverARC) Get

func (c DriverARC) Get(hash string) (res Response, collision bool)

func (DriverARC) GetRequestOpts

func (c DriverARC) GetRequestOpts(hash string) (req RequestOpts, collision bool)

func (DriverARC) GetSize

func (c DriverARC) GetSize() int

func (DriverARC) Remove

func (c DriverARC) Remove(hash string) error

func (DriverARC) Set

func (c DriverARC) Set(hash string, res Response) error

func (DriverARC) SetRequestOpts

func (c DriverARC) SetRequestOpts(hash string, req RequestOpts) error

type DriverLRU

type DriverLRU struct {
	RequestCache  *lru.Cache
	ResponseCache *lru.Cache
}

DriverLRU is a driver implementation using github.com/hashicorp/golang-lru

func NewDriverLRU

func NewDriverLRU(size int) DriverLRU

NewDriverLRU returns the default LRU driver configuration. size determines the number of items in the cache. Memory usage should be considered when choosing the appropriate cache size. The amount of memory consumed by the driver will depend upon the response size. Roughly, memory = cacheSize * averageResponseSize / compression ratio

func (DriverLRU) Get

func (c DriverLRU) Get(hash string) (res Response, collision bool)

func (DriverLRU) GetRequestOpts

func (c DriverLRU) GetRequestOpts(hash string) (req RequestOpts, collision bool)

func (DriverLRU) GetSize

func (c DriverLRU) GetSize() int

func (DriverLRU) Remove

func (c DriverLRU) Remove(hash string) error

func (DriverLRU) Set

func (c DriverLRU) Set(hash string, res Response) error

func (DriverLRU) SetRequestOpts

func (c DriverLRU) SetRequestOpts(hash string, req RequestOpts) error

type DriverRistretto

type DriverRistretto struct {
	Cache *ristretto.Cache[string, any]
}

DriverRistretto is a driver implementation using github.com/dgraph-io/ristretto

func NewDriverRistretto

func NewDriverRistretto(requests, size int64) DriverRistretto

NewDriverRistretto returns the default Ristretto driver configuration. requests should be the number of items you expect to keep in the cache when full. Estimating this on the higher side is better. size determines the maximum number of bytes in the cache.

func (DriverRistretto) Get

func (d DriverRistretto) Get(hash string) (res Response, collision bool)

func (DriverRistretto) GetRequestOpts

func (d DriverRistretto) GetRequestOpts(hash string) (req RequestOpts, collision bool)

func (DriverRistretto) GetSize

func (d DriverRistretto) GetSize() int

func (DriverRistretto) Remove

func (d DriverRistretto) Remove(hash string) error

func (DriverRistretto) Set

func (d DriverRistretto) Set(hash string, res Response) error

func (DriverRistretto) SetRequestOpts

func (d DriverRistretto) SetRequestOpts(hash string, req RequestOpts) error

type Microcache

type Microcache interface {
	Middleware(http.Handler) http.Handler
	Start()
	Stop()
	// contains filtered or unexported methods
}

type Monitor

type Monitor interface {
	GetInterval() time.Duration
	Log(Stats)
	Hit()
	Miss()
	Stale()
	Backend()
	Error()
	Collision()
}

Monitor is an interface for collecting metrics about the microcache

type RequestOpts

type RequestOpts struct {
	// contains filtered or unexported fields
}

RequestOpts stores per-request cache options. This is necessary to allow custom response headers to be evaluated, cached and applied prior to response object retrieval (ie. microcache-vary, microcache-nocache, etc)

type Response

type Response struct {
	// contains filtered or unexported fields
}

Response is used both as a cache object for the response and to wrap http.ResponseWriter for downstream requests.

func (*Response) Header

func (res *Response) Header() http.Header

func (*Response) Write

func (res *Response) Write(b []byte) (int, error)

func (*Response) WriteHeader

func (res *Response) WriteHeader(code int)

type Stats

type Stats struct {
	Size       int
	Hits       int
	Misses     int
	Stales     int
	Backend    int
	Errors     int
	Collisions int
}

Directories

Path Synopsis
examples
tools

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL