DocumentCode
2704614
Title
Language Recognition with Word Lattices and Support Vector Machines
Author
Campbell, W.M. ; Richardson, F. ; Reynolds, Douglas A.
Author_Institution
MIT Lincoln Lab., Lexington, MA, USA
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
Language recognition is typically performed with methods that exploit phonotactics - a phone recognition language modeling (PRLM) system. A PRLM system converts speech to a lattice of phones and then scores a language model. A standard extension to this scheme is to use multiple parallel phone recognizers (PPRLM). In this paper, we modify this approach in two distinct ways. First, we replace the phone tokenizer by a powerful speech-to-text system. Second, we use a discriminative support vector machine for language modeling. Our goals are twofold. First, we explore the ability of a single speech-to-text system to distinguish multiple languages. Second, we fuse the new system with an SVM PRLM system to see if it complements current approaches. Experiments on the 2005 NIST language recognition corpus show the new word system accomplishes these goals and has significant potential for language recognition.
Keywords
natural language processing; speech processing; speech recognition; language recognition; multiple parallel phone recognizers; phone recognition language modeling; phone tokenizer; phonotactics; speech-to-text system; support vector machines; word lattices; Decoding; Lattices; NIST; Natural languages; Power system modeling; Speaker recognition; Speech processing; Speech recognition; Support vector machines; Target recognition; natural languages; speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367238
Filename
4218269
Link To Document