postgresql

package

v1.7.0 Latest Latest Go to latest Published: Nov 25, 2024 License: Apache-2.0 Imports: 33 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/google/trillian

Links

Open Source Insights

README ¶

PostgreSQL storage implementation

Origin

This storage implementation, added in PR #3644, began as a fork of the MySQL storage implementation after PR #3201 was merged.

Motivation

Sectigo operates Certificate Transparency logs that run Trillian with MariaDB, using the MySQL storage implementation. One log's MariaDB database suffered unrecoverable corruption as a result of disk space exhaustion, and another log has struggled to sequence entries quickly enough. Sectigo has more experience with PostgreSQL, believes that PostgreSQL databases are not vulnerable to corruption due to disk space exhaustion, and anticipates that PostgreSQL can achieve significantly greater sequencing throughput.

Database driver

The pgx driver is used directly. This offers faster performance than the standard database/sql interface, and provides access to a number of PostgreSQL-specific features such as COPY.

Major changes compared to the MySQL storage implementation

Implemented bulk processing to greatly improve performance, making use of COPY, temporary tables, and database functions.
Switched to batched queuing, for further performance gains.
Removed SQL statement caching, because pgx does this itself automatically.
Removed several vestigial features (e.g., pre-#3201 subtree revisions).
Forked storage/testdb to storage/postgresql/testdbpgx, because the former only supports the database/sql interface.

Bulk processing

The QueueLeaves, AddSequencedLeaves, UpdateSequencedLeaves, and storeSubtrees functions all operate on sets of records. The individual INSERT statements inherited from the MySQL storage implementation have been replaced by the use of PostgreSQL's COPY interface, which bulk-loads data more efficiently and in far fewer network round trips.

QueueLeaves, AddSequencedLeaves, and storeSubtrees each bulk-load data into temporary tables that are bound to a single transaction. This approach enables each function to perform its processing efficiently, after which the processed data is written to the real tables.

QueueLeaves and AddSequencedLeaves each use a corresponding PL/pgSQL function to perform multiple processing steps involving the temporary tables, which includes the leaf deduplication logic. This could all instead have been implemented as multiple SQL statements called from the Go code, but the approach taken reduces the number of network round trips and the amount of data being transferred to and from the database, and therefore improves performance.

AddSequencedLeaves avoids having to use (and to sometimes rollback) savepoints, which further improves performance compared to the equivalent MySQL implementation.

Documentation ¶

Overview ¶

Package postgresql provides a PostgreSQL-based storage layer implementation.

Index ¶

func GetDatabase() (*pgxpool.Pool, error)
func NewAdminStorage(db *pgxpool.Pool) *postgresqlAdminStorage
func NewLogStorage(db *pgxpool.Pool, mf monitoring.MetricFactory) storage.LogStorage
func OpenDB(dbURL string) (*pgxpool.Pool, error)

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func GetDatabase ¶

func GetDatabase() (*pgxpool.Pool, error)

GetDatabase returns an instance of PostgreSQL database, or creates one.

TODO(robstradling): Make the dependency of PostgreSQL quota provider from PostgreSQL storage provider explicit.

func NewAdminStorage ¶

func NewAdminStorage(db *pgxpool.Pool) *postgresqlAdminStorage

NewAdminStorage returns a PostgreSQL storage.AdminStorage implementation backed by DB.

func NewLogStorage ¶

func NewLogStorage(db *pgxpool.Pool, mf monitoring.MetricFactory) storage.LogStorage

NewLogStorage creates a storage.LogStorage instance for the specified PostgreSQL URL. It assumes storage.AdminStorage is backed by the same PostgreSQL database as well.

func OpenDB ¶

func OpenDB(dbURL string) (*pgxpool.Pool, error)

OpenDB opens a database connection pool for all PostgreSQL-based storage implementations.

Types ¶

This section is empty.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
testdbpgx Package testdbpgx creates new PostgreSQL databases for tests.	Package testdbpgx creates new PostgreSQL databases for tests.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL