Investigating stochastic speech understanding

Author

Bonneau-Maynard, Héléne ; Lefevre, Francois

Author_Institution

Lab. d´´Informatique pour la Mecanique et les Sci. de l´´Ingenieur, CNRS, Orsay, France

fYear

2001

fDate

2001

Firstpage

260

Lastpage

263

Abstract

The need for human expertise in the development of a speech understanding system can be greatly reduced by the use of stochastic techniques. However corpus-based techniques require the annotation of large amounts of training data. Manual semantic annotation of such corpora is tedious, expensive, and subject to inconsistencies. This work investigates the influence of the training corpus size on the performance of the understanding module. The use of automatically annotated data is also investigated as a means to increase the corpus size at a very low cost. First, a stochastic speech understanding model developed using data collected with the LIMSI ARISE dialog system is presented. Its performance is shown to be comparable to that of the rule-based caseframe grammar currently used in the system. In a second step, two ways of reducing the development cost are pursued: (1) reducing of the amount of manually annotated data used to train the stochastic models and (2) using automatically annotated data in the training process.

Keywords

interactive systems; natural language interfaces; speech recognition; speech-based user interfaces; stochastic processes; LIMSI ARISE dialog system; automatically annotated data; development cost reduction; performance; speech understanding system; training corpus size; Costs; Data mining; Humans; Natural languages; Performance evaluation; Speech analysis; Stochastic processes; Stochastic systems; Telephony; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on

Print_ISBN

0-7803-7343-X

Type

conf

DOI

10.1109/ASRU.2001.1034637

Filename

1034637