timit

package
v0.9.17 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jan 8, 2022 License: BSD-3-Clause Imports: 6 Imported by: 0

Documentation

Overview

Package timit Phones of the TIMIT database. For recognition testing the full set of 61 is typically reduced to 39 with confusable sounds folded into a group, e.g. "sh" and "zh" See Speaker-Independent Phone Recognition Using Hidden Markov Models, Kai-Fu Lee and Hsiao-Wuen Hon in IEEE Transactions on Acoustics, Speech and Signal Processing, Vol 37, 1989 for the original set and collapsing to 39 phones Many later studies use the 39 phone set

Index

Constants

This section is empty.

Variables

View Source
var PhoneCats = []string{"iy", "ih", "eh", "ae", "ix", "ah", "uw", "uh", "ao", "ey",
	"ay", "oy", "aw", "ow", "l", "r", "y", "w", "er", "m", "n", "ng",
	"ch", "jh", "dh", "b", "d", "dx", "g", "p", "t", "k", "z", "zh", "v", "f", "th", "s",
	"hh", "pcl", "q"}

PhoneCats and PhoneMap must maintain same order!

View Source
var PhoneList = []string{"iy", "ih", "eh", "ae", "ix", "ah", "ax", "ax-h", "uw", "ux", "uh", "ao", "aa", "ey",
	"ay", "oy", "aw", "ow", "l", "el", "r", "y", "w", "er", "axr", "m", "em", "n", "nx", "en", "ng",
	"eng", "ch", "jh", "dh", "b", "d", "dx", "g", "p", "t", "k", "z", "zh", "v", "f", "th", "s", "sh",
	"hh", "hv", "cl", "pcl", "tcl", "kcl", "qcl", "vcl", "bcl", "dcl", "gcl", "epi", "sil", "h#", "#h", "pau"}

PhoneList is the full list of phones. Some phones get folded together and the reduced set is the PhoneCats variable.

View Source
var Phones = map[string]int{
	"iy":   0,
	"ih":   1,
	"eh":   2,
	"ae":   3,
	"ix":   4,
	"ah":   5,
	"ax":   5,
	"ax-h": 5,
	"uw":   6,
	"ux":   6,
	"uh":   7,
	"ao":   8,
	"aa":   8,
	"ey":   9,
	"ay":   10,
	"oy":   11,
	"aw":   12,
	"ow":   13,
	"l":    14,
	"el":   14,
	"r":    15,
	"y":    16,
	"w":    17,
	"er":   18,
	"axr":  18,
	"m":    19,
	"em":   19,
	"n":    20,
	"nx":   20,
	"en":   20,
	"ng":   21,
	"eng":  21,
	"ch":   22,
	"jh":   23,
	"dh":   24,
	"b":    25,
	"d":    26,
	"dx":   27,
	"g":    28,
	"p":    29,
	"t":    30,
	"k":    31,
	"z":    32,
	"zh":   33,
	"sh":   33,
	"v":    34,
	"f":    35,
	"th":   36,
	"s":    37,
	"hh":   38,
	"hv":   38,
	"pcl":  39,
	"tcl":  39,
	"kcl":  39,
	"bcl":  39,
	"dcl":  39,
	"gcl":  39,
	"h#":   39,
	"pau":  39,
	"epi":  39,
	"q":    40,
}

Functions

func IdxFmSnd

func IdxFmSnd(s string, id string) (v int, ok bool)

IdxFmSnd returns the slice index of the snd if found. id is ignored if the corpus doesn't have subsets of sounds

func LoadTimes

func LoadTimes(fn string, names []string) ([]speech.SpeechUnit, error)

LoadTimes loads both the timing and transcription data for timit files so the names slice is unused

func LoadTranscription

func LoadTranscription(fn string) ([]string, error)

LoadTranscription is a "no op" for timit, LoadTimes does the work of both

func SndFmIdx

func SndFmIdx(idx int, id string) (phone string, ok bool)

SndFmIdx returns the sound if found in the map of sounds of the corpus. id is ignored if the corpus doesn't have subsets of sounds

Types

This section is empty.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL