DocumentCode
2265475
Title
Dictionary learning for spontaneous speech recognition
Author
Slobada, T. ; Waibel, Alex
Author_Institution
Univ. of Karlsruhe, Germany
Volume
4
fYear
1996
fDate
3-6 Oct 1996
Firstpage
2328
Abstract
Spontaneous speech adds a variety of phenomena to a speech recognition task: false starts, human and nonhuman noises, new words, and alternative pronunciations. All of these phenomena have to be tackled when adapting a speech recognition system for spontaneous speech. We focus on how to automatically expand and adapt phonetic dictionaries for spontaneous speech recognition. Especially for spontaneous speech it is important to choose the pronunciations of a word according to the frequency in which they appear in the database rather than the “correct” pronunciation as might be found in a lexicon. Therefore, we proposed a data driven approach to add new pronunciations to a given phonetic dictionary (T. Slobada, 1995) in a way that they model the given occurrences of words in the database. We show how this algorithm can be extended to produce alternative pronunciations for word tuples and frequently misrecognized words. We also discuss how further knowledge can be incorporated into the phoneme recognizer in a way that it learns to generalize from pronunciations which were found previously. The experiments have been performed on the German Spontaneous Scheduling Task (GSST), using the speech recognition engine of JANUS 2, the spontaneous speech to speech translation system of the Interactive Systems Laboratories at Carnegie Mellon and Karlsruhe University (A. Waibel et al., 1996; M. Woszcyna et al., 1994)
Keywords
database management systems; glossaries; language translation; natural languages; speech processing; speech recognition; word processing; German Spontaneous Scheduling Task; JANUS 2; data driven approach; database; dictionary learning; misrecognized words; phoneme recognizer; phonetic dictionaries; pronunciations; speech recognition engine; speech recognition system; speech recognition task; spontaneous speech recognition; spontaneous speech to speech translation system; word tuples; Acoustics; Databases; Dictionaries; Engines; Frequency; Humans; Interactive systems; Laboratories; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607274
Filename
607274
Link To Document