windowed_wordcount

command
v2.6.0-RC1+incompatible Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jul 31, 2018 License: Apache-2.0, BSD-3-Clause, MIT Imports: 13 Imported by: 0

Documentation

Overview

windowed_wordcount counts words in text, and can run over either unbounded or bounded input collections.

This example is the last in a series of four successively more detailed 'word count' examples. First take a look at minimal_wordcount, wordcount, and debugging_wordcount.

Basic concepts, also in the preceeding examples: Reading text files; counting a PCollection; writing to GCS; executing a Pipeline both locally and using a selected runner; defining DoFns; user-defined PTransforms; defining pipeline options.

New Concepts:

  1. Unbounded and bounded pipeline input modes
  2. Adding timestamps to data
  3. Windowing
  4. Re-using PTransforms over windowed PCollections
  5. Accessing the window of an element

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL