sdk

package module
v1.6.2 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Nov 7, 2024 License: MIT Imports: 10 Imported by: 1

README

Deepgram Go SDK

Discord

Official Go SDK for Deepgram. Start building with our powerful transcription & speech understanding API.

SDK Documentation

This SDK implements the Deepgram API found at https://developers.deepgram.com.

Documentation for specifics about the structs, interfaces, and functions of this SDK can be found here: Go SDK Documentation

For documentation relating to Speech-to-Text from Live/Streaming Audio:

For documentation relating to Speech-to-Text (and Intelligence) from PreRecorded Audio:

For documentation relating to Text-to-Speech:

For documentation relating to Text Intelligence:

For documentation relating to Manage API:

Getting an API Key

🔑 To access the Deepgram API you will need a free Deepgram API Key.

Installation

To incorporate this SDK into your project's go.mod file, run the following command from your repo:

go get github.com/deepgram/deepgram-go-sdk

Requirements

Go (version ^1.19)

Quickstarts

This SDK aims to reduce complexity and abtract/hide some internal Deepgram details that clients shouldn't need to know about. However you can still tweak options and settings if you need.

Speech-to-Text from Live/Streaming Audio Quickstart

You can find a walkthrough on our documentation site. Transcribing Live Audio can be done using the following sample code:

// options
transcriptOptions := &interfaces.LiveTranscriptionOptions{
    Language:    "en-US",
    Punctuate:   true,
    Encoding:    "linear16",
    Channels:    1,
    Sample_rate: 16000,
}

// create the client
dgClient, err := client.NewWebSocketWithDefaults(ctx, transcriptOptions, callback)
if err != nil {
    log.Println("ERROR creating LiveTranscription connection:", err)
    os.Exit(1)
}

// call connect!
wsconn := dgClient.Connect()
if wsconn == nil {
    log.Println("Client.Connect failed")
    os.Exit(1)
}
Speech-to-Text from PreRecorded Audio Quickstart

You can find a walkthrough on our documentation site. Transcribing Pre-Recorded Audio can be done using the following sample code:

// context
ctx := context.Background()

//client
c := client.NewRESTWithDefaults()
dg := prerecorded.New(c)

// transcription options
options := &interfaces.PreRecordedTranscriptionOptions{
    Punctuate:  true,
    Diarize:    true,
    Language:   "en-US",
}

// send URL
URL := "https://my-domain.com/files/my-conversation.mp3"
res, err := dg.FromURL(ctx, URL, options)
if err != nil {
    log.Fatalf("FromURL failed. Err: %v\n", err)
    os.Exit(1)
}
Text-to-Speech WebSocket Quickstart

You can find a walkthrough on our documentation site. Transcribing Live Audio can be done using the following sample code:

// set the TTS options
ttsOptions := &interfaces.SpeakOptions{
    Model: "aura-asteria-en",
}

// create the callback
callback := MyCallback{}

// create a new stream using the NewStream function
dgClient, err := speak.NewWebSocketWithDefaults(ctx, ttsOptions, callback)
if err != nil {
    fmt.Println("ERROR creating TTS connection:", err)
    os.Exit(1)
}

// connect the websocket to Deepgram
bConnected := dgClient.Connect()
if !bConnected {
    fmt.Println("Client.Connect failed")
    os.Exit(1)
}
Text-to-Speech REST Quickstart

You can find a walkthrough on our documentation site. Transcribing Live Audio can be done using the following sample code:

// set the Transcription options
options := &interfaces.SpeakOptions{
    Model: "aura-asteria-en",
}

// create a Deepgram client
c := client.NewRESTWithDefaults()
dg := api.New(c)

// send/process file to Deepgram
res, err := dg.ToSave(ctx, "Hello, World!", textToSpeech, options)
if err != nil {
    fmt.Printf("FromStream failed. Err: %v\n", err)
    os.Exit(1)
}

Examples

There are examples for *every- API call in this SDK. You can find all of these examples in the examples folder at the root of this repo.

These examples provide:

Speech-to-Text - Live Audio / WebSocket:

Speech-to-Text - PreRecorded / REST:

Speech-to-Text - Live Audio:

Text-to-Speech - WebSocket

Text-to-Speech - REST

Management API exercise the full CRUD operations for:

To run each example set the DEEPGRAM_API_KEY as an environment variable, then cd into each example folder and execute the example: go run main.go.

Logging

This SDK provides logging as a means to troubleshoot and debug issues encountered. By default, this SDK will enable Information level messages and higher (ie Warning, Error, etc) when you initialize the library as follows:

client.InitWithDefault();

To increase the logging output/verbosity for debug or troubleshooting purposes, you can set the TRACE level but using this code:

// init library
client.Init(client.InitLib{
    LogLevel: client.LogLevelTrace,
})

Testing

TBD

Backwards Compatibility

Older SDK versions will receive Priority 1 (P1) bug support only. Security issues, both in our code and dependencies, are promptly addressed. Significant bugs without clear workarounds are also given priority attention.

Development and Contributing

Interested in contributing? We ❤️ pull requests!

To make sure our community is safe for all, be sure to review and agree to our Code of Conduct. Then see the Contribution guidelines for more information.

Getting Help

We love to hear from you so if you have questions, comments or find a bug in the project, let us know! You can either:

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis
examples
pkg
api/analyze/v1
This package defines the Analyze API for Deepgram
This package defines the Analyze API for Deepgram
api/listen/v1/rest
This package defines the Pre-recorded API for Deepgram
This package defines the Pre-recorded API for Deepgram
api/listen/v1/rest/interfaces
This package provides the types for the Deepgram PreRecorded API.
This package provides the types for the Deepgram PreRecorded API.
api/listen/v1/websocket/interfaces
This package defines interfaces for the live API
This package defines interfaces for the live API
api/live/v1
*********** WARNING *********** This package provides the Live API
*********** WARNING *********** This package provides the Live API
api/live/v1/interfaces
*********** WARNING *********** This package defines interfaces for the live API
*********** WARNING *********** This package defines interfaces for the live API
api/manage/v1
Balances API: https://developers.deepgram.com/reference/get-all-balances https://developers.deepgram.com/reference/get-balance
Balances API: https://developers.deepgram.com/reference/get-all-balances https://developers.deepgram.com/reference/get-balance
api/manage/v1/interfaces
This package provides the types for the Deepgram Manage API.
This package provides the types for the Deepgram Manage API.
api/prerecorded/v1
*********** WARNING *********** This package provides the PreRecorded API
*********** WARNING *********** This package provides the PreRecorded API
api/prerecorded/v1/interfaces
This package provides the types for the Deepgram PreRecorded API.
This package provides the types for the Deepgram PreRecorded API.
api/speak/v1
*********** WARNING *********** This package provides the API for Speak on a REST interface
*********** WARNING *********** This package provides the API for Speak on a REST interface
api/speak/v1/rest
This package defines the Speak REST API for Deepgram
This package defines the Speak REST API for Deepgram
api/speak/v1/websocket/interfaces
This package defines interfaces for the live API
This package defines interfaces for the live API
api/version
This package handles the versioning in the API for the various clients (prerecorded, live, etc.)
This package handles the versioning in the API for the various clients (prerecorded, live, etc.)
audio/common
Implementation of a common functions used in portaudio
Implementation of a common functions used in portaudio
audio/microphone
Implementation of a microphone using portaudio
Implementation of a microphone using portaudio
audio/microphone/interfaces
Microphone defines a interface for a Microphone implementation
Microphone defines a interface for a Microphone implementation
audio/replay
Implementation for a replay device.
Implementation for a replay device.
client/analyze
This package points to the latest version of the analyze/read client
This package points to the latest version of the analyze/read client
client/analyze/v1
This package provides the analyze/read client implementation for the Deepgram API
This package provides the analyze/read client implementation for the Deepgram API
client/common/v1
This package provides the live/streaming client implementation for the Deepgram API
This package provides the live/streaming client implementation for the Deepgram API
client/common/v1/interfaces
This package defines interfaces for the live API
This package defines interfaces for the live API
client/interfaces/v1
This package contains the interface to manage the prerecorded and live/stream interfaces for the Deepgram API
This package contains the interface to manage the prerecorded and live/stream interfaces for the Deepgram API
client/listen
This package provides the prerecorded client implementation for the Deepgram API
This package provides the prerecorded client implementation for the Deepgram API
client/listen/v1/rest
This package provides the prerecorded client implementation for the Deepgram API
This package provides the prerecorded client implementation for the Deepgram API
client/listen/v1/websocket
This package provides the live/streaming client implementation for the Deepgram API
This package provides the live/streaming client implementation for the Deepgram API
client/live
*********** WARNING *********** This package provides the live client implementation for the Deepgram API
*********** WARNING *********** This package provides the live client implementation for the Deepgram API
client/manage
This package contains the code for the Keys APIs in the Deepgram Manage API
This package contains the code for the Keys APIs in the Deepgram Manage API
client/manage/v1
This package contains the code for the Keys APIs in the Deepgram Manage API
This package contains the code for the Keys APIs in the Deepgram Manage API
client/prerecorded
*********** WARNING *********** This package provides the prerecorded client implementation for the Deepgram API
*********** WARNING *********** This package provides the prerecorded client implementation for the Deepgram API
client/rest
*********** WARNING *********** This package provides a generic reusable REST client
*********** WARNING *********** This package provides a generic reusable REST client
client/rest/v1
This package implements a reusable REST client
This package implements a reusable REST client
client/rest/v1/debug
This package provides API debugging at the REST/HTTP level
This package provides API debugging at the REST/HTTP level
client/speak
This package provides the speak client implementation for the Deepgram API
This package provides the speak client implementation for the Deepgram API
client/speak/v1/rest
This package provides the speak client implementation for the Deepgram API
This package provides the speak client implementation for the Deepgram API
client/speak/v1/websocket
This package provides the speak/streaming client implementation for the Deepgram API
This package provides the speak/streaming client implementation for the Deepgram API
common
This package contains the initialization code for the Deepgram Go SDK
This package contains the initialization code for the Deepgram Go SDK
tests

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL