blobmap

package module

v0.0.1 Latest Latest Go to latest Published: Sep 4, 2024 License: GPL-3.0 Imports: 6 Imported by: 2

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/draganm/blobmap

Links

Open Source Insights

README ¶

Blobmap

Blobmap is a specialized data structure for efficient storage and retrieval of Binary Large Objects (Blobs) within a continuous keyspace of 64-bit unsigned integers (uint64). The keyspace begins at a specified value n and spans m consecutive keys, covering the range from n to n + m - 1.

The data is stored in a read-only memory-mapped file to enable constant-time (O(1)) access, making it scalable for handling large datasets.

File Format

Structure Overview

1. Header

The file begins with a header containing metadata critical for blob management:

Number of blobs: The total count of blobs in the file.
Key offset: The starting key n for the keyspace.

2. Offset Table

An array of offset records follows the header. Each record stores the end offset of a blob, encoded as a 64-bit big-endian integer. The start of each blob is implicitly defined by the end offset of the preceding blob.

3. Blob Data

The blobs themselves are stored sequentially in the file. The data for each blob can be accessed by determining its byte range from the offset records.

4. Integrity Check

The file concludes with an xxHash checksum, covering all preceding data. This can be used to verify the integrity of the blobmap during reads.

Example Layout

+----------------+----------------------+-------------------+-------------+
|     Header     |    Offset Table      |    Blob Data      |   xxHash    |
+----------------+----------------------+-------------------+-------------+
| Num of Blobs   | End Offset of Blob 1 | Blob 1 Data Bytes | Hash Value  |
| Key Offset (n) | End Offset of Blob 2 | Blob 2 Data Bytes |             |
|                | End Offset of Blob 3 | Blob 3 Data Bytes |             |
+----------------+----------------------+-------------------+-------------+

Header: Stores the number of blobs and the starting key.
Offset Table: Defines the end offsets of each blob.
Blob Data: Contains the actual binary data of each blob, laid out sequentially.
xxHash: Provides a checksum to ensure data integrity.

Blob Access

To access a specific blob, compute its byte range using the corresponding offsets in the table:

The start of blob i is the end offset of blob i-1 (or immediately after the offset table for the first blob).
The end of blob i is the offset at position i in the table.

This layout enables fast, direct access to any blob, minimizing overhead and maximizing scalability for large datasets.

Documentation ¶

Index ¶

type Builder
- func NewBuilder(fileName string, firstKey, numberOfKeys uint64) (*Builder, error)
- func (b *Builder) Add(key uint64, value []byte) error
- func (b *Builder) Build() error
type Reader
- func Open(fileName string) (*Reader, error)

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Builder ¶

type Builder struct {
	// contains filtered or unexported fields
}

func NewBuilder ¶

func NewBuilder(fileName string, firstKey, numberOfKeys uint64) (*Builder, error)

func (*Builder) Add ¶

func (b *Builder) Add(key uint64, value []byte) error

func (*Builder) Build ¶

func (b *Builder) Build() error

type Reader ¶

type Reader struct {
	// contains filtered or unexported fields
}

func Open ¶

func Open(fileName string) (*Reader, error)

func (*Reader) Close ¶

func (r *Reader) Close() error

func (*Reader) FirstKey ¶

func (r *Reader) FirstKey() uint64

func (*Reader) LastKey ¶

func (r *Reader) LastKey() uint64

func (*Reader) Read ¶

func (r *Reader) Read(key uint64) ([]byte, error)

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL