Documentation ¶
Overview ¶
Package rsync provides an implementation of the rsync algorithm as described in Andrew Tridgell's thesis (https://www.samba.org/~tridge/phd_thesis.pdf) and the rsync technical report (https://rsync.samba.org/tech_report). Rsync algorithmic functionality is provided by the Engine type, and a transport protocol for pipelined rsync operations is provided by the Transmit function and Receiver types.
Index ¶
- Constants
- func DecodeToReceiver(decoder Decoder, count uint64, receiver Receiver) error
- func OptimalBlockSizeForBase(base io.Seeker) (uint64, error)
- func OptimalBlockSizeForBaseLength(baseLength uint64) uint64
- func Transmit(root string, paths []string, signatures []Signature, receiver Receiver) error
- type BlockHash
- type Decoder
- type Encoder
- type Engine
- func (e *Engine) BytesSignature(base []byte, blockSize uint64) Signature
- func (e *Engine) Deltafy(target io.Reader, base Signature, maxDataOpSize uint64, ...) error
- func (e *Engine) DeltafyBytes(target []byte, base Signature, maxDataOpSize uint64) []Operation
- func (e *Engine) Patch(destination io.Writer, base io.ReadSeeker, signature Signature, ...) error
- func (e *Engine) PatchBytes(base []byte, signature Signature, delta []Operation) ([]byte, error)
- func (e *Engine) Signature(base io.Reader, blockSize uint64) (Signature, error)
- type Monitor
- type Operation
- type OperationTransmitter
- type Receiver
- func NewEncodingReceiver(encoder Encoder) Receiver
- func NewMonitoringReceiver(receiver Receiver, paths []string, monitor Monitor) Receiver
- func NewPreemptableReceiver(receiver Receiver, run context.Context) Receiver
- func NewReceiver(root string, paths []string, signatures []Signature, sinker Sinker) (Receiver, error)
- type ReceiverStatus
- func (*ReceiverStatus) Descriptor() ([]byte, []int)
- func (s *ReceiverStatus) EnsureValid() error
- func (m *ReceiverStatus) GetPath() string
- func (m *ReceiverStatus) GetReceived() uint64
- func (m *ReceiverStatus) GetTotal() uint64
- func (*ReceiverStatus) ProtoMessage()
- func (m *ReceiverStatus) Reset()
- func (m *ReceiverStatus) String() string
- func (m *ReceiverStatus) XXX_DiscardUnknown()
- func (m *ReceiverStatus) XXX_Marshal(b []byte, deterministic bool) ([]byte, error)
- func (dst *ReceiverStatus) XXX_Merge(src proto.Message)
- func (m *ReceiverStatus) XXX_Size() int
- func (m *ReceiverStatus) XXX_Unmarshal(b []byte) error
- type Signature
- type Sinker
- type Transmission
Constants ¶
const ( // DefaultBlockSize is the default block size that will be used if a zero // value is passed into Engine.Signature for the blockSize parameter. DefaultBlockSize = 1 << 13 // DefaultMaximumDataOperationSize is the default maximum data size // permitted per operation. The optimal value for this isn't at all // correlated with block size - it's just what's reasonable to hold // in-memory and pass over the wire in a single transmission. This value // will be used if a zero value is passed into Engine.Deltafy for the // maxDataOpSize parameter. DefaultMaximumDataOperationSize = 1 << 16 )
Variables ¶
This section is empty.
Functions ¶
func DecodeToReceiver ¶
DecodeToReceiver decodes messages from the specified Decoder and forwards them to the specified receiver. It must be passed the number of files to be received so that it knows when forwarding is complete. It is designed to be used with an encoding receiver, such as that returned by NewEncodingReceiver. It finalizes the provided receiver before returning.
func OptimalBlockSizeForBase ¶
OptimalBlockSizeForBase is a convenience function that will determine the optimal block size for a base that implements io.Seeker. It calls down to OptimalBlockSizeForBaseLength. After determining the base's length, it will attempt to reset the base to its original position.
func OptimalBlockSizeForBaseLength ¶
OptimalBlockSizeForBaseLength uses a simpler heuristic to choose a block size based on the base length. It starts by choosing the optimal block length using the formula given in the rsync thesis. It then enforces that the block size is within a sensible range. TODO: Should we add rounding to "nice" values, e.g. the nearest multiple of 1024 bytes? Would this improve read throughput?
Types ¶
type BlockHash ¶
type BlockHash struct { // Weak is the weak hash for the block. Weak uint32 // Strong is the strong hash for the block. Strong [sha1.Size]byte }
BlockHash represents a pair of weak and strong hash for a base block.
type Decoder ¶
type Decoder interface { // Decode decodes a value and should adhere to gob semantics. Decode(value interface{}) error }
Decoder is the decoding interface used by DecodeToReceiver. It should adhere to gob semantics.
type Encoder ¶
type Encoder interface { // Encode encodes a value and should adhere to gob semantics. It should not // retain the value across calls because the value's internal buffers will // be re-used. Encode(value interface{}) error }
Encoder is the encoding interface used by encoding receivers. It should adhere to gob semantics.
type Engine ¶
type Engine struct {
// contains filtered or unexported fields
}
Engine provides rsync functionality without any notion of transport. It is designed to be re-used to avoid heavy buffer allocation.
func (*Engine) BytesSignature ¶
BytesSignature computes the signature for a byte slice.
func (*Engine) Deltafy ¶
func (e *Engine) Deltafy(target io.Reader, base Signature, maxDataOpSize uint64, transmit OperationTransmitter) error
Deltafy computes delta operations to reconstitute the target data stream using the base stream (based on the provided base signature). It streams operations to the provided transmission function. The internal engine buffer will be resized to the sum of the maximum data operation size plus the block size, and retained for the lifetime of the engine, so a reasonable value should be provided. The data buffer passed to the transmission function is reused, so the transmission function should transmit or make a copy of the data before returning.
func (*Engine) DeltafyBytes ¶
DeltafyBytes computes delta operations for a byte slice. Unlike the streaming Deltafy method, it returns a slice of operations, which should be reasonable since the target data can already fit into memory.
func (*Engine) Patch ¶
func (e *Engine) Patch(destination io.Writer, base io.ReadSeeker, signature Signature, operation Operation) error
Patch applies a single operation against a base stream to reconstitute the target into the destination stream.
func (*Engine) PatchBytes ¶
Patch applies a series of operations against a base byte slice to reconstitute the target byte slice.
type Monitor ¶
type Monitor func(*ReceiverStatus) error
Monitor is the interface that monitors must implement to capture status information from a monitoring receiver. The argument provided to this function will be allocated on each update and can be kept by the monitoring callback. There's no point in attempting to re-use the allocated argument because (a) it would be complicated and the callback would most likely just copy it anyway and (b) it will only be allocated once per received file, and the per-file allocations are already significantly higher.
type Operation ¶
type Operation struct { // Data contains data for data operations. If its length is 0, the operation // is assumed to be a non-data operation. Operation transmitters and // receivers may thus treat a length-0 buffer as semantically equivalent to // a nil buffer and utilize that fact to efficiently re-use buffer capacity, // e.g. by truncating the buffer and doing a gob receive into it. Data []byte // Start is the 0-indexed starting block for block operations. Start uint64 // Count is the number of blocks for block operations. Count uint64 }
Operation represents an rsync operation, which can be either a data operation or a block operation.
type OperationTransmitter ¶
OperationTransmitter transmits an operation. Operation data buffers are re-used between calls to the transmitter, so the transmitter should not return until it has either transmitted the data buffer (if any) or copied it for later transmission.
type Receiver ¶
type Receiver interface { // Receive processes a single message in a transmission stream. Receive(message Transmission) error // contains filtered or unexported methods }
Receiver manages the streaming reception of multiple files. It should be used in conjunction with the Transmit function.
func NewEncodingReceiver ¶
NewEncodingReceiver creates a new receiver that handles messages by encoding them with the specified Encoder. It is designed to be used with DecodeToReceiver.
func NewMonitoringReceiver ¶
NewMonitoringReceiver wraps a receiver and provides monitoring information via a callback.
func NewPreemptableReceiver ¶
NewPreemptableReceiver wraps a receiver and aborts on Receive if the specified context has been cancelled.
type ReceiverStatus ¶
type ReceiverStatus struct { // Path is the path currently being received. Path string `protobuf:"bytes,1,opt,name=path,proto3" json:"path,omitempty"` // Received is the number of paths that have already been received. Received uint64 `protobuf:"varint,2,opt,name=received,proto3" json:"received,omitempty"` // Total is the total number of paths expected. Total uint64 `protobuf:"varint,3,opt,name=total,proto3" json:"total,omitempty"` XXX_NoUnkeyedLiteral struct{} `json:"-"` XXX_unrecognized []byte `json:"-"` XXX_sizecache int32 `json:"-"` }
ReceivingStatus encodes that status of an rsync receiver.
func (*ReceiverStatus) Descriptor ¶
func (*ReceiverStatus) Descriptor() ([]byte, []int)
func (*ReceiverStatus) EnsureValid ¶
func (s *ReceiverStatus) EnsureValid() error
EnsureValid ensures that ReceiverStatus' invariants are respected.
func (*ReceiverStatus) GetPath ¶
func (m *ReceiverStatus) GetPath() string
func (*ReceiverStatus) GetReceived ¶
func (m *ReceiverStatus) GetReceived() uint64
func (*ReceiverStatus) GetTotal ¶
func (m *ReceiverStatus) GetTotal() uint64
func (*ReceiverStatus) ProtoMessage ¶
func (*ReceiverStatus) ProtoMessage()
func (*ReceiverStatus) Reset ¶
func (m *ReceiverStatus) Reset()
func (*ReceiverStatus) String ¶
func (m *ReceiverStatus) String() string
func (*ReceiverStatus) XXX_DiscardUnknown ¶
func (m *ReceiverStatus) XXX_DiscardUnknown()
func (*ReceiverStatus) XXX_Marshal ¶
func (m *ReceiverStatus) XXX_Marshal(b []byte, deterministic bool) ([]byte, error)
func (*ReceiverStatus) XXX_Merge ¶
func (dst *ReceiverStatus) XXX_Merge(src proto.Message)
func (*ReceiverStatus) XXX_Size ¶
func (m *ReceiverStatus) XXX_Size() int
func (*ReceiverStatus) XXX_Unmarshal ¶
func (m *ReceiverStatus) XXX_Unmarshal(b []byte) error
type Signature ¶
type Signature struct { // BlockSize is the block size used to compute the signature. BlockSize uint64 // LastBlockSize is the size of the last block in the signature. LastBlockSize uint64 // Hashes are the hashes of the blocks in the base. Hashes []BlockHash }
Signature represents an rsync base signature. It encodes the block size used to generate the signature, the size of the last block in the signature (which may be smaller than a full block), and the hashes for the blocks of the file.
type Sinker ¶
type Sinker interface { // Sink should return a new io.WriteCloser for staging the given path. Each // result it returns will be closed before Sink is invoked again. Sink(path string) (io.WriteCloser, error) }
Sinker provides the interface for a receiver to store incoming files.
type Transmission ¶
type Transmission struct { // Done indicates that the operation stream for the current file is // finished. If set, there will be no operation in the response, but there // may be an error. Done bool // Operation is the next operation in the stream for the current file. Operation Operation // Error indicates that a non-terminal error has occurred. It will only be // present if Done is true. Error string }
Transmission represents a single message in a transmission stream. Its internals are public to allow for transmission using a reflection-based encoder (such as gob), but it should otherwise be treated as an opaque type with a private implementation.