rocketkv

command module

v0.0.4 Latest Latest Go to latest Published: Mar 4, 2022 License: MIT Imports: 9 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/intob/rocketkv

Links

Open Source Insights

README ¶

RocketKV

Minimalisic, highly performant key-value storage, written in Go.

Usage

Install go install github.com/intob/rocketkv
(Optional) Define config.json (see configuration)
Start ./rocketkv -c cfg.json

Config

This project uses Viper.

Unless you explicitly provide a config file with -c, the config file should be named config, with one of the supported extensions, in one of the supported formats. E.g. TOML, YAML, JSON, CONF.

# /tmp/rocketkv/config.toml

# network & auth
network = "tcp"
address = ":8100"
auth = "supersecretsecret,wait,it'sinthereadme"

# general
segments = 16 # make 256 blocks (16 parts * 16 blocks)
buffersize = 2000000 # 2MB
scanperiod = 10

# persistence
persist = true
dir = "/etc/rocketkv"
writeperiod = 10

[tls]
  cert = "path/to/x509/cert.pem"
  key = "path/to/x509/key.pem"

For periords, unit of time is one second. I will add support for parsing time strings.

For each part, the number of blocks created is equal to the part count. So, 8 parts will result in 64 blocks.

Play

Install CLI tool, rkteer go install github.com/intob/rkteer
Bind to your rocketkv instance rkteer bind [NETWORK] [ADDRESS] --a [AUTHSECRET]
Call set, get, del, list, or count

./rkteer set coffee beans
status: OK
./rkteer get coffee
beans

In progress

Support for horizontal scaling

To do

Re-partitioning
Test membership using Bloom filter before GET

Keys

Keys can be specified in two forms; bare & namespaced.

Bare

A bare key has no namespace prefix. It is simply a string with no / path separator.

Namespaced

Grouping keys is acheived by namespacing. This is done by prefixing the key with a path.

 randomnamespaced/examplekey
|---namespace----|---name---|

Note that the namespace includes the final /.

All keys for a given namespace will land in the same block. This greatly improves performance for collecting & listing multiple keys, because only a single block must be searched.

Segmentation

To reduce load on the file system & and decrease blocking, the dataset is split into 2 layers. Each layer contains the configured number of segments.

Partitions

This is the top layer.

When initialising a dataset, the number of partitions (parts) created will be equal to the configured Parts.Count property.

Identifying the partition corresponding to a key is the first step to locating (or placing) a key.

Blocks

Each part is split into blocks. The number of blocks in each part is equal to the number of parts. So 8 parts will result in 64 blocks.

Each block has it's own mutex & map of keys.

When a key is written to or deleted, the parent block is flagged as changed.

If persistence is enabled in the config via "Parts.Persist": true, then each block is written to the file system periodically, when changed.

Partition:Block:Key mapping

Distance from key to a partition or block is calculated using Hamming distance. If the key contains a namespace, only the namespace is hashed.

d := hash(key) ^ blockId // or partId

The lookup process goes as follows:

Find closest part
Find closest block in part

This 2-step approach scales well for large datasets where many blocks are desired to reduce blocking.

Re-partitioning (to do)

Each time the partition list is loaded, it must be compared to the configured partition count. If they do not match, a re-partitioning process must occur before serving connections.

Create new manifest (partition:block list) in sub-directory
Create new Store
For each current part, re-map all keys to their new part
Write each part after all keys are re-mapped

Key expiry

The expires time is evaluated periodically. The period between scans can be configured using ExpiryScanPeriod, giving a number of seconds.

Protocol

Msg

A normal operation is transmitted in the serialized form of protocol.Msg.

type Msg struct {
	Op      byte
	Status  byte
	Key     string
	Value   []byte
	Expires int64
}

Serialization

| 0             | 1             | 2             | 3             |
|0 1 2 3 4 5 6 7|0 1 2 3 4 5 6 7|0 1 2 3 4 5 6 7|0 1 2 3 4 5 6 7|
+---------------+---------------+---------------+---------------+
| < OP        > | < STATUS    > | < EXPIRES UNIX UINT64         |
|                                                               |
|                             > | < KEY LEN UINT16            > |
  KEY ...                                                       
  VALUE ...

Op codes

Byte	Meaning
0x01	Close
0x02	Auth
0x10	Ping
0x11	Pong
0x20	Get
0x30	Set
0x31	SetAck
0x40	Del
0x41	DelAck
0x50	List

Status codes

Byte	Rune	Meaning
0x5F	_	OK
0x2F	/	StreamEnd
0x2E	.	NotFound
0x21	!	Error
0x23	#	Unauthorized

Endianness

Big endian

Scaling (in progress)

The aim is to support horizontal scaling to increase availability & load capacity.

For now, we will assume that each node is aware of every other node by configuration. Dynamic service discovery will follow later. Therefore, adding a node involves updating the configuration of all other nodes.

The current solution involves both the client & server.

Client

When a client wants to read/write a key, they will execute the following process.

Read

Hash key
Calculate closest node using the Rendezvous hash (Hamming)
Request key from closest node
Fallback to next node, recurring until successful or end of node-list is reached

Write

Hash key
Calculate closest 3 nodes using the Rendezvous hash (Hamming)
Send the request concurrently to each node
The operation can be considered complete when the desired number of nodes have acknowleged the request (sent an OK response). 1 node is not sufficient for consistency. 2 of 3 nodes is sufficient for eventual consistency.
If a node does not respond, it can optionally be marked by the client as 'down' for a defined period, to prevent future requests timing-out.

RocketKV

The solution for eventual consistency is a little simpler for RocketKV. Eventual consistency is acheived by periodically replicating blocks to all other known nodes.

Low-latency & consistency is acheived because the mapping for key:node is deterministic.

For now, the modifed-date is used to determine causality. As long as nodes have somewhat syncronised clocks, this is perfectly adequate.

Service discovery (later)

Each RocketKV node, and all clients would query a single service or cluster.

The single role of this service is to tell clients & RocketKV nodes which nodes currently exist, and their health.

Updates

Adding a node to the RocketKV network is as simple as spinning it up & making sure that this service knows about it.

A simple solution for automating this would be to include a key-pair in the configuration of each node. A new node can then securely notify this service of it's presence.

An even simpler (but less secure) method would be the use of a shared secret.

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
cfg
client
protocol
repl
store
util

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL