cdf

command module
v0.0.0-...-2ce81c6 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Oct 16, 2017 License: MIT Imports: 13 Imported by: 0

README

College de France automated audio transcripts

Worker and elasticsearch for automated College de France audio transcripts

Build Status GoDoc Go Report Card

Worker

The worker periodically polls datastore for scheduled transcriptions, if any it downloads the mp3 files from the College de France website, converts them to FLAC, stores them in a Google Storage bucket, sends a Speech to Text request, stores the transcription in the same storage bucket, and index the transcripts in an elasticsearch instance running in the same Kubernetes cluster.

A periodic job also runs to compute overall statistics about the transcriptions due to limitations of the datastore in this regard.

Elasticsearch

Elasticsearch runs as a single (thus "yellow") master&data node in a Kubernetes cluster, it does full text indexing of the transcripts using the French analyzer.

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis
Package main computes stats about lessons in datastore and saves them back in it.
Package main computes stats about lessons in datastore and saves them back in it.
io

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL