Title :
Understanding and improving speech recognition performance through the use of diagnostic tools
Author :
Eide, Ellen ; Gish, Herbert ; Jeanrenaud, Philippe ; Mielke, Angela
Author_Institution :
BBN Syst. & Technol. Corp., Cambridge, MA, USA
Abstract :
The goal of this work is to highlight aspects of an experiment other than the word error rate. When a speech recognition experiment is performed, the word error rate provides no insight into the factors responsible for the recognition errors. We begin this paper by describing an experiment which contrasts the language of conversational speech with that of text from the Wall Street Journal. The remainder of the paper is devoted to the description of a more general approach to performance diagnosis which identifies significant sources of error in a given experiment. The technique is based on the use of binary classification trees; we refer to the results of our analyses as diagnostic trees. Beyond providing understanding, diagnostic trees allow for improvements in the performance of a recognizer through the use of feedback provided by quantifying confidence in the recognition
Keywords :
error analysis; feedback; speech recognition; trees (mathematics); Wall Street Journal; binary classification trees; conversational speech language; diagnostic tools; diagnostic trees; error sources; feedback; performance diagnosis; recognition confidence; recognition errors; speech recognition experiment; speech recognition performance; text; word error rate; Acoustic testing; Classification tree analysis; Databases; Decoding; Error analysis; Feedback; Length measurement; Microphones; Natural languages; Read only memory; Speech recognition; Switches;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479404