twitter_kafka_connector

command module
v0.0.0-...-d1baf05 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Aug 17, 2019 License: GPL-3.0 Imports: 16 Imported by: 0

README

Twitter kafka connector

This is a small go application that allows you to stream tweets into a kafka cluster. You can use this to learn KSQL, Kafka Streams or any other streaming processing framework.

The application relies on the public stream api and thus has certain limitation. See status/filter limitations: https://developer.twitter.com/en/docs/tweets/filter-realtime/overview

Configuration

All settings can be set via a config.yml in /etc/kafka_twitter_connector, ./ or environment variables. Note that env vars have to be upper case.

Kafka settings
  • kafka_brokers: The kafka brokers, to which the tweets should be streamed. By default "localhost:9092"
  • schema_registry: The schema registry host. By default "localhost:8081
  • tweet_topic: The topic to which the should be streamed. By default "tweets"
Twitter settings

You will need to request a developer account and create an app to get the following oauth settings.

  • client_key: The client API key
  • client_secret: The client Secret key
  • access_token: The access token key
  • access_secret: The access secret key
Twitter filters

You use the following settings to filter the stream. Note that all filters are concatenated by OR, this is a limitation of the public streaming API. Example tracking "dog" and "cat" means all posts with containg "dog" or "cat" will be streamed to kafka.

You also need to set at least one filter in order for the stream to work.

If you use environment variables, use a space to separate multiple items, e.g. `export keyword="kitten cat"

  • keywords: Track the given keywords

Documentation

The Go Gopher

There is no documentation for this package.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL