apidAnalytics

package module
v0.0.0-...-03e0658 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jan 4, 2018 License: Apache-2.0 Imports: 17 Imported by: 0

README

ApidAnalytics

Build Status GoDoc Go Report Card

This is a core plugin for apid and is responsible for collecting analytics data for runtime traffic from Micro and Enterprise Gateway and puplishing to Apigee.

Configuration
name description
apidanalytics_base_path string. default: /analytics
apidanalytics_data_path string. default: /ax
apidanalytics_collection_interval int. seconds. default: 120
apidanalytics_upload_interval int. seconds. default: 5
apidanalytics_uap_server_base string. url. required.
apidanalytics_use_caching boolean. default: true
apidanalytics_buffer_channel_size int. number of slots. default: 100
apidanalytics_cache_refresh_interval int. seconds. default: 1800
Startup Procedure
  1. Initialize crash recovery, upload and buffering manager to handle buffering analytics messages to files locally and then periodically upload these files to S3/GCS based on signedURL received from uapCollectionEndpoint exposed via edgex proxy
  2. Create a listener for Apigee-Sync event
    1. Each time a Snapshot is received, create an in-memory cache for data scope
    2. Each time a changeList is received, if data_scope info changed, then insert/delete info for changed scope from tenantCache
  3. Initialize POST /analytics/{scope_uuid} and POST /analytics API's
  4. Upon receiving requests
    1. Validate and enrich each batch of analytics records. If scope_uuid is given, then that is used to validate. If scope_uuid is not provided, then the payload should have organization and environment. The org/env is then used to validate the scope for this cluster.
    2. If valid, then publish records to an internal buffer channel
  5. Buffering Logic
    1. Buffering manager creates listener on the internal buffer channel and thus consumes messages as soon as they are put on the channel
    2. Based on the current timestamp either an existing directory is used to save these messages or new a new timestamp directory is created
    3. If a new directory is created, then an event will be published on the closeBucketEvent Channel at the expected directory closing time
    4. The messages are stored in a file under tmp/<timestamp_directory>
    5. Based on collection interval, periodically the files in tmp are closed by the routine listening on the closeBucketEvent channel and the directory is moved to staging directory
  6. Upload Manager
    1. The upload manager periodically checks the staging directory to look for new folders
    2. When a new folder arrives here, it means all files under that are closed and ready to uploaded
    3. Tenant info is extracted from the directory name and the files are sequentially uploaded to S3/GCS
    4. Based on the upload status
      1. If upload is successful then directory is deleted from staging and previously failed uploads are retried
      2. if upload fails, then upload is retried 3 times before moving the directory to failed directory
  7. Crash Recovery is a one time activity performed when the plugin is started to cleanly handle open files from a previous Apid stop or crash event
Exposed API
POST /analytics/{bundle_scope_uuid}
POST /analytics

Complete spec is listed in api.yaml

Documentation

The Go Gopher

There is no documentation for this package.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL