Documentation ¶
Overview ¶
windowed_wordcount counts words in text, and can run over either unbounded or bounded input collections.
This example is the last in a series of four successively more detailed 'word count' examples. First take a look at minimal_wordcount, wordcount, and debugging_wordcount.
Basic concepts, also in the preceeding examples: Reading text files; counting a PCollection; writing to GCS; executing a Pipeline both locally and using a selected runner; defining DoFns; user-defined PTransforms; defining pipeline options.
New Concepts:
- Unbounded and bounded pipeline input modes
- Adding timestamps to data
- Windowing
- Re-using PTransforms over windowed PCollections
- Accessing the window of an element
Click to show internal directories.
Click to hide internal directories.