Title :
Implementation of a phoneme recognition system using zero-crossing and magnitude sum function
Author :
Manikandan, J. ; Venkataramani, B. ; Preeti, P. ; Sananda, G. ; Sadhana, K.V.
Author_Institution :
Dept. of ECE, Nat. Inst. of Technol., Trichy, India
Abstract :
A number of techniques have been proposed in the literature for phoneme based speech recognition system. In this paper, a technique for automatic phoneme recognition using zero-crossings (ZC) and magnitude sum function (MSF) is proposed. The number of zero-crossings and magnitude sum function per frame are extracted and a minimum distance classifier is proposed to recognize the phonemes in each frame with these features. In order to increase the recognition accuracy of phonemes, a finite state machine is also proposed. The performance of the proposed phoneme recognition system is evaluated using TTS database and compared with the system using Linear Predictive Coefficients(LPC) feature inputs. Phoneme recognition accuracies of 70.93% and 55.25% are obtained for the system using LPC and the one using ZC along with MSF respectively. However, using the finite state machine proposed in this paper, 100% recognition accuracy is obtained for both the techniques. The computational costs required for recognizing various sentences using both of the feature extraction techniques are evaluated. It is observed that the proposed technique requires about 9.3 times lower computational cost than the one using LPC. The proposed phoneme recognition system is also implemented on an Altera Cyclone II FPGA using Nios II soft-core processor and custom instructions. The custom instructions for floating point arithmetic and Minimum distance classifier provide an acceleration factor of 41 and 1.87 respectively. The technique proposed here is also applicable for speech inputs from other database.
Keywords :
field programmable gate arrays; linear predictive coding; pattern classification; speech recognition; Altera Cyclone II FPGA; Nios II; distance classifier; linear predictive coefficients; magnitude sum function; phoneme recognition system; soft-core processor; speech recognition system; zero-crossing; Automata; Computational efficiency; Cyclones; Feature extraction; Field programmable gate arrays; Floating-point arithmetic; Linear predictive coding; Spatial databases; Speech recognition; Speech synthesis; FPGA; LPC; Magnitude Sum function; Phoneme Recognition; Zero-crossings;
Conference_Titel :
TENCON 2009 - 2009 IEEE Region 10 Conference
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-4546-2
Electronic_ISBN :
978-1-4244-4547-9
DOI :
10.1109/TENCON.2009.5395954