DocumentCode :
3303133
Title :
Isolated uttered words recognition based on GMM/HMM algorithms using SoPC/Nios II processor build on Altera Cyclone II FPGA chip
Author :
Abbas, Eyad I. ; Refeis, Alaa Abdulhussain
Author_Institution :
Dept. of Electr. Eng., Univ. of Technol., Baghdad, Iraq
fYear :
2012
fDate :
7-8 Nov. 2012
Firstpage :
1
Lastpage :
8
Abstract :
This paper introduced an approach to design and implement an embedded SoPC (System on a Programmable Chip) technique with Altera Nios II processor for real-time speech recognition system by developing hardware/software with minimum usage of resources (hardware components) and relatively small size software to reduce memory utilization. This is achieved by using Mel Frequency Cepstral Coefficients (MFCCs) technique as speech signal feature extraction (observation vector). Using Gaussian Mixture Model (GMM) to model the observation vector of voice information. Finally, this model passed to the Hidden Markov Model (HMM) as probabilistic model to process the GMM statistically to make decision on utterance words recognition, whether a single or composite, one or more syllable words (i.e. one, six, target). The total framework was implemented on Altera Cyclone II EP2C70F896C6N FPGA chip sitting on ALTERA DE2-70 Development Board. The utility software which are used as tools for design and development hardware/software are Quartus II 11.0sp1 (32-Bit) and Nios II 11.0spl IDE/C++ respectively. Each word model (template) stored as Transition Matrix, Diagonal Covariance Matrices, and Mean Vectors in the system memory. Each word model utilizes only 4.45Kbytes regardless of the spoken word length. Accuracy of the recognition words (digit/0 to digit/10) given 100% for the individual speaker. Training and recognition software has size (code + initialized data) equal to 312Kbytes.
Keywords :
Gaussian processes; cepstral analysis; covariance matrices; embedded systems; feature extraction; field programmable gate arrays; hidden Markov models; mixture models; speech recognition; system-on-chip; utility programs; ALTERA DE2-70 development board; Altera Cyclone-II EP2C70F896C6N FPGA chip; Altera Nios-II processor; C++ language; GMM algorithm; Gaussian mixture model; HMM algorithm; IDE; MFCC technique; Nios-II-11.0sp1; Quartus-II-11.0sp1; decision making; diagonal covariance matrices; embedded SoPC technique; hardware components; hardware-software design; hardware-software development; hidden Markov model; isolated uttered word recognition; mean vectors; mel frequency cepstral coefficient technique; memory utilization reduction; minimum resource usage; observation vector model; probabilistic model; real-time speech recognition system; recognition software; speech signal feature extraction; spoken word length; statistical processing; syllable words; system memory; system-on-a-programmable chip technique; templates; training software; transition matrix; utility software; voice information; word model; Feature extraction; Filter banks; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Vectors; Gaussian Mixture Model (GMM); Hidden Markov Model (HMM); Mel Frequency Cepstral Coefficients (MFCC); Nios II processor; System on a Programmable Chip (SoPC);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering Sciences (FNCES), 2012 First National Conference for
Conference_Location :
Baghdad
Print_ISBN :
978-1-4673-5033-4
Type :
conf
DOI :
10.1109/NCES.2012.6740477
Filename :
6740477
Link To Document :
بازگشت