DocumentCode
430843
Title
Spoken digits recognition by subspace decomposition method
Author
Kusakari, K. ; Kurihara, K. ; Murakami, T. ; Ishida, Y.
Author_Institution
Department of Electronics and Communication, Meiji University, 1-1-1 Higashimita, Tama-ku, Kawasaki-shi, Kanagawa, 214-8571, Japan
Volume
A
fYear
2004
fDate
24-24 Nov. 2004
Firstpage
72
Lastpage
75
Abstract
In this paper, we propose a method for spoken digits recognition using dynamic programming (DP) matching combined with subspace decomposition, which linearly separates phonetic information from speech data based on principal component analysis (PCA). This method is capable of more robust speech recognition of less reference speech patterns. The use of the spectral envelope by linear predictive coding (LPC) in speech recognition is unable to avoid errors in recognition due to the uncertainty of personalities, the dynamic variation of features, and so on. By using the subspace method, the proposed method eliminates these problems and enables good recognition results of less standard speech patterns. We use DP matching in recognizing, because it is capable of more efficient pattern matching by normalizing the length of vowels. Simulation results show that the proposed method, using projection onto phonetic subspace with less speaker information, is superior to the conventional method using spectral envelopes, which is obtained by LPC, and DP matching. Projection onto phonetic subspace is a kind of feature vector that contains less speaker information.
Keywords
Dynamic programming; Euclidean distance; Linear predictive coding; Pattern matching; Pattern recognition; Principal component analysis; Robustness; Speech analysis; Speech recognition; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2004. 2004 IEEE Region 10 Conference
Conference_Location
Chiang Mai
Print_ISBN
0-7803-8560-8
Type
conf
DOI
10.1109/TENCON.2004.1414359
Filename
1414359
Link To Document