Title :
Development of the LEMS speech recognizer: improving performance using feature-sets
Author :
Mashao, Daniel J.
Author_Institution :
Div. of Eng., Brown Univ., Providence, RI, USA
Abstract :
This paper discusses the work performed at the Laboratory of Engineering Man-Machine Systems (LEMS) to build a state-of-the-art speech recognizer. In a period of about four years the performance of the speech recognizer has been improved from 83% to 92%, representing a 53% reduction in error rate. These performance gains were obtained by improving the feature-set algorithms and the HMM models. The main change in the feature-set was switching from the once popular LPC-based methods to a novel DFT-based method. The parameterized DFT-based method improved performance and confirmed what has been generally accepted that mel-scale warping is superior for machine speech recognition. Performance gains were also achieved by using the semi-continuous HMM model instead of the fast discrete HMM system. This change appears to offer a fixed 2-3% recognition rate improvement
Keywords :
discrete Fourier transforms; feature extraction; hidden Markov models; speech recognition; HMM models; LEMS speech recognizer; Laboratory of Engineering Man-Machine Systems; error rate reduction; feature-set algorithms; machine speech recognition; mel-scale warping; parameterized DFT-based method; performance; recognition rate; semi-continuous HMM model; Databases; Error analysis; Hidden Markov models; Laboratories; Man machine systems; Performance gain; Signal design; Speech recognition; Systems engineering and theory; Vocabulary;
Conference_Titel :
Communications and Signal Processing, 1997. COMSIG '97., Proceedings of the 1997 South African Symposium on
Conference_Location :
Grahamstown
Print_ISBN :
0-7803-4173-2
DOI :
10.1109/COMSIG.1997.630001