DocumentCode
3312319
Title
Development of the LEMS speech recognizer: improving performance using feature-sets
Author
Mashao, Daniel J.
Author_Institution
Div. of Eng., Brown Univ., Providence, RI, USA
fYear
1997
fDate
9-10 Sep 1997
Firstpage
157
Lastpage
160
Abstract
This paper discusses the work performed at the Laboratory of Engineering Man-Machine Systems (LEMS) to build a state-of-the-art speech recognizer. In a period of about four years the performance of the speech recognizer has been improved from 83% to 92%, representing a 53% reduction in error rate. These performance gains were obtained by improving the feature-set algorithms and the HMM models. The main change in the feature-set was switching from the once popular LPC-based methods to a novel DFT-based method. The parameterized DFT-based method improved performance and confirmed what has been generally accepted that mel-scale warping is superior for machine speech recognition. Performance gains were also achieved by using the semi-continuous HMM model instead of the fast discrete HMM system. This change appears to offer a fixed 2-3% recognition rate improvement
Keywords
discrete Fourier transforms; feature extraction; hidden Markov models; speech recognition; HMM models; LEMS speech recognizer; Laboratory of Engineering Man-Machine Systems; error rate reduction; feature-set algorithms; machine speech recognition; mel-scale warping; parameterized DFT-based method; performance; recognition rate; semi-continuous HMM model; Databases; Error analysis; Hidden Markov models; Laboratories; Man machine systems; Performance gain; Signal design; Speech recognition; Systems engineering and theory; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Signal Processing, 1997. COMSIG '97., Proceedings of the 1997 South African Symposium on
Conference_Location
Grahamstown
Print_ISBN
0-7803-4173-2
Type
conf
DOI
10.1109/COMSIG.1997.630001
Filename
630001
Link To Document