مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

1255572

Title :

Speech recognition

Author :

Alotaibi, Yousef A. ; Shahsavari, M.M.

Author_Institution :

Dept. of Electr. & Comput. Eng., Florida Inst. of Technol., Melbourne, FL, USA

Volume :

Issue :

fYear :

1998

Firstpage :

Lastpage :

Abstract :

The authors have designed, successfully trained and tested an Arabic speech recognition system. This system was implemented using C++ programming language on Windows 95. It can be partitioned into five main modules. These modules are the front-end, feature extraction, training, pattern recognition and decision making and display. The front-end module functions as signal preparation and calibration. This includes: setting the signal sampling rate, removing the DC component from the signal, setting the scaling factor of the signal and detecting the endpoints of the utterance. The endpoint task removes the non-speech signal portions created by the speaker´s pauses. This reduces the system computation time needed and the memory requirements. The feature extraction module is mainly a digital signal processor. The training module is the one that finds the best templates for every word or sound (phonemes) in the system´s database. In short, this module needs to be executed only one time before users can utilize the system. The next module is the pattern recognition module. Its function is to compare the given utterance (test utterance) to all the stored templates (the reference module). The decision and display module functions as an interface between the user and the hidden system modules. In other words, after getting the recognition module results, this module displays the best candidate(s) and/or their likelihood percentage. The error rates are computed and displayed in this module

Keywords :

feature extraction; modules; natural languages; pattern recognition; speech processing; speech recognition; Arabic speech recognition system; C++ programming language; Windows 95; decision and display module; digital signal processor; distance measure; error rates; feature extraction module; front-end module; pattern recognition module; reference module; scaling factor; signal preparation; signal sampling rate; speech recognition; stored templates; system architecture; test utterance; training module; utterance endpoints detection; Calibration; Computer languages; Decision making; Displays; Feature extraction; Pattern recognition; Signal detection; Signal sampling; Speech recognition; System testing;

fLanguage :

English

Journal_Title :

Potentials, IEEE

Publisher :

ieee

ISSN :

0278-6648

Type :

jour

DOI :

10.1109/45.652853

Filename :

652853

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1255572