DocumentCode
3294410
Title
Language identification using shifted delta cepstra
Author
Kohler, M.A. ; Kennedy, M.
Author_Institution
Dept. of Defense, Fort Meade, MD, USA
Volume
3
fYear
2002
fDate
4-7 Aug. 2002
Abstract
A variety of speech identification technologies currently use Gaussian mixture models. Until recently, however, they were considered inferior to parallel phone recognition language modeling for identifying the language of a speaker. Experiments in the last year have shown that Gaussian mixture models can provide high performance language identification when shifted delta cepstra are used as the feature set. Not only can they achieve comparative or even superior performance to parallel phone recognition language modeling, Gaussian mixture models also require less computation. Performance can be further improved by altering the shifted delta cepstra parameters and the number of mixtures. The optimal parameter set varies depending on the languages to be identified. This paper describes a method for finding the optimal parameters for identifying a set of languages, specifies these parameters for a language identification task, and provides a performance comparison.
Keywords
Gaussian distribution; speech recognition; Gaussian mixture models; language identification; optimal parameter set; optimal parameters; parallel phone recognition language modeling; shifted delta cepstra; speech identification technologies; Algorithm design and analysis; Cepstral analysis; Concurrent computing; Humans; Natural languages; Speech processing; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2002. MWSCAS-2002. The 2002 45th Midwest Symposium on
Print_ISBN
0-7803-7523-8
Type
conf
DOI
10.1109/MWSCAS.2002.1186972
Filename
1186972
Link To Document