fdbq

command module
v0.2.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jun 9, 2023 License: MIT Imports: 1 Imported by: 0

README

FDBQ

FDBQ provides a query language and an alternative client API for Foundation DB. Some things this project aims to achieve are:

  • Provide a query language for FDB.
  • Provide a textual description of key-value schemas.
  • Provide a Go API which is structurally equivalent to the query language.
  • Simplify the ergonomics of the FoundationDB API.
    • Gracefully handle multi-transaction range-reads.
    • Gracefully handle transient errors.
  • Provide an environment for exploring FDB data.
  • Import/Export subsets of FDB data.

Building & Running

Without Docker

With the Foundation DB client library (>= v6.2.0) and Go (>= v1.20) installed, you can simply run go build in the root of this repo. This will create an fdbq binary in the root of the repo.

Docker Environment

Building, linting, and testing can all be performed in a Docker environment. This allows any host to perform these operations with only Docker as a dependency. The build.sh script can be used to perform these operations. This is the same script used by the CI/CD workflow of this repo.

To build, lint, & test the current state of the codebase, run ./build.sh --verify. To learn more about the build script, run ./build.sh --help.

Docker Image

FDBQ is available as a Docker image for executing queries. The first argument passed to the container is the contents of the cluster file. The remaining arguments are passed to the FDBQ binary.

# 'my_cluster:baoeA32@172.20.3.33:4500' is used as the contents
# for the cluster file. '-log' and '/my/dir{<>}=42' are passed
# as args to the FDBQ binary.
docker run docker.io/janderland/fdbq 'my_cluster:baoeA32@172.20.3.33:4500' -log '/my/dir{<>}=42'

Within the cluster file contents (first argument), any instances of a hostname wrapped in curly braces (e.g. '{my_hostname}') are replaced by the equivalent IP address. FDB doesn't support connecting to a cluster via hostnames, so this functional provides a workaround. This can simplify connecting to a Docker instance of FDB.

docker network create my_net
docker run --network my_net --name fdb -d foundationdb/foundationdb

# The substring '{fdb}' in the first argument will be replaced with
# the IP address of the FDB container started above before the cluster
# file is written to disk.
docker run --network my_net docker.io/janderland/fdbq 'docker:docker@{fdb}:4500' -log '/my/dir{<>}=42'

Query Language

Here is the syntax definition for the query language. Currently, FDBQ is focused on reading & writing key-values created using the directory and tuple layers. Reading or writing keys of arbitrary byte strings is not supported.

FDBQ queries are a textual representation of a specific key-value or a schema describing the structure of many key-values. These queries have the ability to write a key-value, read one or more key-values, and list directories.

Components & Structure

This section will explain the components and structure of an FDBQ query. The semantic meaning of these queries will be explained below in the Kinds of Queries section.

Primitives

FDBQ utilizes textual representations of the element types supported by the tuple layer. These types are known as primitives. Besides as tuple elements, primitives can also be used as the value portion of a key-value.

Type Example
nil nil
int -14
uint 7
bool true
float 33.4
string "string"
bytes 0xa2bff2438312aac032
uuid 5a5ebefd-2193-47e2-8def-f464fc698e31

When primitives are used as tuple elements, they are encoded using the tuple layer. When they are used as the value portion of a key-value, they are encoded by FDBQ as outlined below.

Type Encoding
nil nil
int 64-bit, endianness configurable
uint 64-bit, endianness configurable
bool single bit, 0 means false
float IEEE 754, endianness configurable
string ASCII byte string
bytes As provided
uuid 16-byte string

Ideally, the encoding of these primitives would align with common community practices to maximize usefulness. Let me know if you believe it doesn't.

Even though a big int encoding is supported by the tuple layer, FDBQ does not currently support using big ints.

Directories

A directory is specified as a sequence of strings, each prefixed by a forward slash:

/my/dir/path_way

The strings of the directory do not need quotes if they only contain alphanumericals, underscores, dashes, or periods. To use other symbols, the strings must be quoted:

/my/"dir@--\o/"/path_way

The quote character may be backslash escaped:

/my/"\"dir\""/path_way
Tuples

A tuple is specified as a sequence of elements, separated by commas, wrapped in a pair of curly braces. The elements may be a tuple or any of the primitive types.

{"one", 2, 0x03, { "subtuple" }, 5825d3f8-de5b-40c6-ac32-47ea8b98f7b4}

The last element of a tuple may be the ... token.

{0xFF, "thing", ...}

Any combination of spaces, tabs, and newlines is allowed after the opening
brace and commas.

{
  1,
  2,
  3,
}
Key-Values

A key-value is specified as a directory, tuple, equal symbol, and value appended together:

/my/dir{"this", 0}=0xabcf03

The value following the equal symbol may be any of the primitives or a tuple:

/my/dir{22.3, -8}={"another", "tuple"}

The value can also be the clear token.

/some/where{"home", "town", 88.3}=clear
Variables

A variable may be used in place of a directory element, tuple element, or value.

/my/dir/<>{"first", <>, "third"}=<>

If the variable is a tuple element or value, it may contain a list of primitive types separated by pipes, except for the nil type. The variable may also contain the any type which is equivalent to specifying every type. Specifying no types is also equivalent to specifying the any type.

/my/dir{"that", <int|float|bytes>}=<any>
Kinds of Queries

This section showcases the various kinds of FDBQ queries, their semantic meaning, and the equivalent FDB API calls implemented in Go.

Set

Set queries write a single key-value. The query must not contain the clear or ... tokens, nor a variable.

/my/dir{"hello", "world"}=42
db.Transact(func(tr fdb.Transaction) (interface{}, error) {
  dir, err := directory.CreateOrOpen(tr, []string{"my", "dir"}, nil)
  if err != nil {
    return nil, err
  }

  val := make([]byte, 8)
  binary.LittleEndian.PutUint64(val, 42)
  tr.Set(dir.Pack(tuple.Tuple{"hello", "world"}), val)
  return nil, nil
})
Clear

Clear queries delete a single key-value. The query must contain the clear token as it's value and must not contain the ... token or variables.

/my/dir{"hello", "world"}=clear
db.Transact(func(tr fdb.Transaction) (interface{}, error) {
  dir, err := directory.Open(tr, []string{"my", "dir"}, nil)
  if err != nil {
    if errors.Is(err, directory.ErrDirNotExists) {
      return nil, nil
    }
    return nil, err
  }

  tr.Clear(dir.Pack(tuple.Tuple{"hello", "world"}))
  return nil, nil
})
Read Single Key

Read-single queries read a single key-value. These queries must not have the ... token or a variable in their key. The value must be a variable.
Deserialization of the value is attempted for each type in the order specified by the variable. The first successful deserialization is used as the output. If the value cannot be deserialized as any of the types specified then the key-value is not returned or an error is returned, depending on configuration.

/my/dir{99.8, 7dfb10d1-2493-4fb5-928e-889fdc6a7136}=<int|string>
db.Transact(func(tr fdb.Transaction) (interface{}, error) {
  dir, err := directory.Open(tr, []string{"my", "dir"}, nil)
  if err != nil {
    if errors.Is(err, directory.ErrDirNotExists) {
      return nil, nil
    }
    return nil, err
  }

  val := tr.MustGet(dir.Pack(tuple.Tuple{99.8,
    tuple.UUID{0x7d, 0xfb, 0x10, 0xd1, 0x24, 0x93, 0x4f, 0xb5, 0x92, 0x8e, 0x88, 0x9f, 0xdc, 0x6a, 0x71, 0x36}))
  
     
  if len(val) == 8 {
      return binary.LittleEndian.Uint64(val), nil
  }
  return string(val), nil
})

As a shorthand, these query may be specified without the = token or value. This implies an empty variable as the value. In the code block below, the three queries are equivalent.

/my/dir{99.8, 7dfb10d1-2493-4fb5-928e-889fdc6a7136}
/my/dir{99.8, 7dfb10d1-2493-4fb5-928e-889fdc6a7136}=<>
/my/dir{99.8, 7dfb10d1-2493-4fb5-928e-889fdc6a7136}=<any>
Read Range of Keys

Read-many queries read a range of values based on a key prefix. These queries have a ... token or a variable in their key. If a key-value is encountered which does not match the schema defined by the query then the key-value is not returned or an error is returned, depending on configuration. These queries are implemented using FDB's range-read mechanism with additional filtering performed on the client. Care must be taken with these queries as they may result in large amounts of data being sent to the client and most of the data being filtered out.

/people{3392, <string|int>, <>}={<uint>, ...}
db.ReadTransact(func(tr fdb.ReadTransaction) (interface{}, error) {
  dir, err := directory.Open(tr, []string{"people"}, nil)
  if err != nil {
    if errors.Is(err, directory.ErrDirNotExists) {
      return nil, nil
    }
    return nil, err
  }

  rng, err := fdb.PrefixRange(dir.Pack(tuple.Tuple{3392}))
  if err != nil {
    return nil, err
  }

  var results []fdb.KeyValue
  iter := tr.GetRange(rng, fdb.RangeOptions{}).Iterator()
  for iter.Advance() {
    kv := iter.MustGet()

    tup, err := dir.Unpack(kv.Key)
    if err != nil {
      return nil, err
    }

    if len(tup) != 3 {
      return nil, fmt.Errorf("invalid kv: %v", kv)
    }

    switch tup[0].(type) {
    default:
      return nil, fmt.Errorf("invalid kv: %v", kv)
    case string | int64:
    }

    val, err := tuple.Unpack(kv.Value)
    if err != nil {
      return nil, fmt.Errorf("invalid kv: %v", kv)
    }
    if len(val) == 0 {
      return nil, fmt.Errorf("invalid kv: %v", kv)
    }
    if _, isInt := val[0].(uint64); !isInt {
      return nil, fmt.Errorf("invalid kv: %v", kv)
    }

    results = append(results, kv)
  }
  return results, nil
})
List Directory Paths

If only a directory is provided as a query, then the directory layer is queried. Empty variables may be included as placeholders for any directory name.

/root/<>/items/<>
db.ReadTransact(func(tr fdb.ReadTransaction) (interface{}, error) {
  root, err := directory.Open(tr, []string{"root"}, nil)
  if err != nil {
    if errors.Is(err, directory.ErrDirNotExists) {
      return nil, nil
    }
    return nil, err
  }

  oneDeep, err := root.List(tr, nil)
  if err != nil {
    return nil, err
  }

  var results [][]string
  for _, dir1 := range oneDeep {
    twoDeep, err := root.List(tr, []string{dir1, "items"})
    if err != nil {
      return nil, err
    }

    for _, dir2 := range twoDeep {
      results = append(results, []string{"root", dir1, dir2})
    }
  }
  return results, nil
})

Documentation

Overview

fdbq is a query language for Foundation DB

Usage:

fdbq [flags] query ...

Flags:

-b, --bytes            print full byte strings instead of just their length
-c, --cluster string   path to cluster file
-h, --help             help for fdbq
    --limit int        limit the number of KVs read in range-reads
-l, --little           encode/decode values as little endian instead of big endian
    --log              perform debug logging
-r, --reverse          query range-reads in reverse order
-s, --strict           throw an error if a KV is read which doesn't match the schema
-w, --write            allow write queries

Directories

Path Synopsis
Package engine executes queries.
Package engine executes queries.
facade
Package facade provide interfaces for FDB APIs.
Package facade provide interfaces for FDB APIs.
stream
Package stream performs range-reads and filtering.
Package stream performs range-reads and filtering.
internal
app
Package keyval contains types representing key-values and related utilities.
Package keyval contains types representing key-values and related utilities.
class
Package class classifies a key-value by the kind of operation it represents.
Package class classifies a key-value by the kind of operation it represents.
compare
Package compare validates tuples against a schema.
Package compare validates tuples against a schema.
convert
Package convert converts between FDBQ and FDB types.
Package convert converts between FDBQ and FDB types.
operation
Operation generates the interfaces and glue-code for the visitor pattern.
Operation generates the interfaces and glue-code for the visitor pattern.
values
Package values serializes and deserializes values.
Package values serializes and deserializes values.
Package parser converts query strings into key-values.
Package parser converts query strings into key-values.
format
Package format converts key-values into query strings.
Package format converts key-values into query strings.
scanner
Package scanner tokenizes query strings.
Package scanner tokenizes query strings.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL