sandglass

package module

v0.1.3 Latest Latest Go to latest Published: Oct 29, 2017 License: Apache-2.0 Imports: 4 Imported by: 4

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/celrenheit/sandglass

Links

Open Source Insights

README ¶

Sandglass

Sandglass is a distributed, horizontally scalable, persistent, delayed message queue. It was developped to support asynchronous tasks. It supports synchronous tasks as well. It supports the competing consumers pattern.

Features

Horizontal scalability
Highly available
Persistent storage
Roughly strong ordering with a single consumer in a consumer group
Round robin consumption between multiple consumers in a consumer group (looses ordering)
Produce message to be consumed in the future
Acknowledge each message individualy
Automatic consumer offset tracking

Project status

EXPERIMENTAL: This is a prototype. This should not be used in production in its current form.

See TODO section below for more information

Installation

You can grab binaries here. If you're using MacOS you can also use homebrew to install sandglass:

$ brew install celrenheit/taps/sandglass

Getting started

NOTE: All data will be stored in /tmp/node1. If you wish to change this, copy demo/node1.yaml and modify it accordingly.

First, let's launch sandglass server:

$ sandglass --config https://raw.githubusercontent.com/celrenheit/sandglass/master/demo/node1.yaml

We are using a single node cluster, this is not recommended for production.

In another terminal window, create a payments topic:

$ sandctl topics create payments --num_partitions 300 --replication_factor 1

...produce 10,000 messages:

$ sandctl produce payments '{"hello" : "world"}' -n 10000

...and consume from the payments topic:

$ sandctl consume payments

(or if you wish to watch you can use sandctl consume -f payments to see messages coming live)

Architecture

General

                                                                  +-----------------+
                                                                  |                 |
                          +--------------------------+     +------>  Consumer       |
                          |                          |     |      |                 |
                          |    Sandglass Cluster     |     |      +-----------------+
                          |                          |     |
                          |                          +-----+  Round robin consumption
+-----------------+       |   +------------------+   |     |
|                 |       |   |                  |   |     |      +-----------------+
|  Producer       +------->   |                  |   |     |      |                 |
|                 |       |   |    Broker 1      |   |     +------>  Consumer       |
+-----------------+       |   |                  |   |            |                 |
                          |   |                  |   |            +-----------------+
                          |   +------------------+   |
+-----------------+       |                          |
|                 |       |   +------------------+   |
|  Producer       +------->   |                  |   |            +-----------------+
|                 |       |   |                  |   |            |                 |
+-----------------+       |   |    Broker 2      |   +-----+------>  Consumer       |
                          |   |                  |   |     |      |                 |
                          |   |                  |   |     |      +-----------------+
+-----------------+       |   +------------------+   |     |
|                 |       |                          |     |  Failover consumption
|  Producer       +------->   +------------------+   |     |     (NOT DONE YET)
|                 |       |   |                  |   |     |
+-----------------+       |   |                  |   |     |      +-----------------+
                          |   |    Broker 3      |   |     |      |                 |
                          |   |                  |   |     +------+  Consumer       |
                          |   |                  |   |            |                 |
                          |   +------------------+   |            +-----------------+
                          |                          |
                          |                          |
                          +--------------------------+

Topics

There is two kinds of topics:

Timer:
- Fixed number of partitions (set up-front, could change)
- Time ordered using sandflake IDs
- Can produce messages in the future
Compacted (might change the name for this):
- Fixed number of partitions (set up-front, cannot change)
- Behaves like a distributed key value store

A topic has a number of partitions. Data is written into a single partition. Either the destination partition is specified by the producer. Otherwise, we fallback to choosing the destination partition using a consistent hashing algorithm.

Each produced message to a partition writes a message to a Write Ahead Log (WAL) and to the message log. The WAL is used for the replication logic, it is sorted in the order each message was produced. The message log is used for message comsumption, it is mainly sorted by time (please refer to sandflake ids for the exact composition)

The content of the message is stored in the message log and not in the WAL (only the keys are important). This way the message log is used for fast consumption avoiding random reads.

This will probably change in order to have the WAL as the only source of truth instead of storing the content in the message log. This of course will have an impact because we are transfering random reads to the consumption path. Utlimately, we are going to have to store the content of the message in both logs for better performance at the cost of disk space.

A message is composed of the following fields:

    topic
    partition

    index   <- position in the WAL

    offset  <- position in the message log for timer topics
    key     <- position in the message log for key for compacted topics

    value   <- your payload

TODO

Clean up all the mess
Fix replication and re assign partitions correctly when a node goes down
Save all the registered nodes and not rely on gossip to allow topic creation even if there is not enough nodes
More TODOs in TODO section (#inception)
Make everything more robust...

Documentation ¶

Index ¶

type Node

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Node ¶

type Node struct {
	ID       string
	Name     string
	IP       string
	GRPCAddr string
	RAFTAddr string
	HTTPAddr string
	Status   serf.MemberStatus

	sgproto.BrokerServiceClient
	sgproto.InternalServiceClient
	// contains filtered or unexported fields
}

func (*Node) Close ¶

func (n *Node) Close() error

func (*Node) Dial ¶

func (n *Node) Dial() (err error)

func (*Node) IsAlive ¶

func (n *Node) IsAlive() bool

func (*Node) String ¶

func (n *Node) String() string

Source Files ¶

View all Source files

sandglass.go

Directories ¶

Path	Synopsis
broker
client
demo
demo/consumer
demo/producer
cmd
cmdcommon
sandctl
sandctl/cmd
sandglass
sandglass/cmd
logy
raft
server
sgproto Package sgproto is a generated protocol buffer package.	Package sgproto is a generated protocol buffer package.
sgutils
storage
badger
rocksdb
scommons
topic
watchy

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL