DocumentCode :
1255572
Title :
Speech recognition
Author :
Alotaibi, Yousef A. ; Shahsavari, M.M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Florida Inst. of Technol., Melbourne, FL, USA
Volume :
17
Issue :
1
fYear :
1998
Firstpage :
23
Lastpage :
28
Abstract :
The authors have designed, successfully trained and tested an Arabic speech recognition system. This system was implemented using C++ programming language on Windows 95. It can be partitioned into five main modules. These modules are the front-end, feature extraction, training, pattern recognition and decision making and display. The front-end module functions as signal preparation and calibration. This includes: setting the signal sampling rate, removing the DC component from the signal, setting the scaling factor of the signal and detecting the endpoints of the utterance. The endpoint task removes the non-speech signal portions created by the speaker´s pauses. This reduces the system computation time needed and the memory requirements. The feature extraction module is mainly a digital signal processor. The training module is the one that finds the best templates for every word or sound (phonemes) in the system´s database. In short, this module needs to be executed only one time before users can utilize the system. The next module is the pattern recognition module. Its function is to compare the given utterance (test utterance) to all the stored templates (the reference module). The decision and display module functions as an interface between the user and the hidden system modules. In other words, after getting the recognition module results, this module displays the best candidate(s) and/or their likelihood percentage. The error rates are computed and displayed in this module
Keywords :
feature extraction; modules; natural languages; pattern recognition; speech processing; speech recognition; Arabic speech recognition system; C++ programming language; Windows 95; decision and display module; digital signal processor; distance measure; error rates; feature extraction module; front-end module; pattern recognition module; reference module; scaling factor; signal preparation; signal sampling rate; speech recognition; stored templates; system architecture; test utterance; training module; utterance endpoints detection; Calibration; Computer languages; Decision making; Displays; Feature extraction; Pattern recognition; Signal detection; Signal sampling; Speech recognition; System testing;
fLanguage :
English
Journal_Title :
Potentials, IEEE
Publisher :
ieee
ISSN :
0278-6648
Type :
jour
DOI :
10.1109/45.652853
Filename :
652853
Link To Document :
بازگشت