Documentation ¶
Overview ¶
Package window implements windowing constructs. In the world of data processing on an unbounded stream, Windowing is a concept of grouping data using temporal boundaries. We use event-time to discover temporal boundaries on an unbounded, infinite stream and Watermark to ensure the datasets within the boundaries are complete. A reduce function can be applied on this group of data.
Windows are of different types, quite popular ones are Fixed windows and Sliding windows. Sessions are managed via little less popular windowing strategy called Session windows. Windowing is implemented as a two stage process,
- Assign windows - assign the event to a window
- Merge windows - group all the events that below to the same window
The two stage approach is required because assignment of windows could happen as elements are streaming in, but merging could happen before the data materialization happens. This is important esp. when we handle session windows where a new event can change the end time of the window.
For simplicity, we will be truncating the windows' boundaries to the nearest time unit (say, 1 minute windows will be truncated to 0th second). Truncating window time to the nearest boundary will help us do mapping with constant time without affecting the correctness, except for the very first materialization of result (e.g., we started at 9:00.11 and the result will be materialized at 9:01.00 and not at 9:01:11).
Windows may be either aligned (e.g., Fixed, Sliding), i.e. applied across all the data for the window of time in question, or unaligned, (e.g., Session) i.e. applied across only specific subsets of the data (e.g. per key) for the given window of time.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type IntervalWindow ¶
type IntervalWindow struct { // Start is start time of the boundary which is inclusive. Start time.Time // End is the end time of the boundary and is exclusive. End time.Time }
IntervalWindow has the window boundary details.
type Option ¶ added in v0.6.1
func WithWindowDuration ¶ added in v0.6.1
WithWindowDuration sets the window duration
func WithWindowType ¶ added in v0.6.1
func WithWindowType(wt dfv1.WindowType) Option
WithWindowType sets the window type
type Options ¶ added in v0.6.1
type Options struct {
// contains filtered or unexported fields
}
func DefaultOptions ¶ added in v0.6.1
func DefaultOptions() *Options
type Windower ¶
type Windower interface { // AssignWindow assigns the event to the window based on give window configuration. AssignWindow(eventTime time.Time) []*IntervalWindow }
Windower assigns the element to zero or more windows.