DocumentCode :
2752356
Title :
Using neural networks and LPCC to improve speech recognition
Author :
Zbancioc, Marius ; Costin, Madalin
Volume :
2
fYear :
2003
fDate :
0-0 2003
Firstpage :
445
Abstract :
Linear Predictive Coding (LPC), powerful speech analysis technique, is very useful for encoding speech at a low bit rate and provides extremely accurate estimates of speech parameters - based on the assumption that speech signal is produced by a buzzer at the end of the tube (the glottis produces the buzz, characterized by its intensity and frequency, and the vocal tract forms the tube, characterized by resonance frequencies (formants) according to Calliope(1989), is very efficient for the vocalic areas. The model is less efficient for transient, unvowel or not stationary regions according to R. Lawrence and B. Hwang Juang (1993). A Radial Basis Function network is able to recognize in a satisfying percent a set of phonemes pronounced by different speakers, using LPC sets as input.
Keywords :
linear predictive coding; radial basis function networks; speech recognition; LPCC; Levinson-Durbin recursion; linear predictive coding; neural networks; radial basis function network; resonance frequencies; speech analysis technique; speech encoding; speech parameters; speech recognition; speech signal; vocal tract;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on
Print_ISBN :
0-7803-7979-9
Type :
conf
DOI :
10.1109/SCS.2003.1227085
Filename :
5731318
Link To Document :
بازگشت