Title :
Digit recognition using wavelet and SVM in Brazilian Portuguese
Author :
De Bresolin, Adriano Andrade ; Neto, Adrião Duarte Dória ; Alsina, Pablo Javier
Author_Institution :
Technol. Fed. Univ. of the Parana, Curitiba
fDate :
March 31 2008-April 4 2008
Abstract :
In this paper we used WPT (wavelet packet transform) and neural classifier SVM (support vector machine) to recognize spoken digits from 0 to 9 in Brazilian Portuguese. The main objective this work is to find out the Wavelet mother that better represents the speech signal in Brazilian Portuguese. The results obtained were compared with MFCC (Mel frequency cepstral coefficients). We carried out sixteen experiments with different Wavelets in dependent- case and four experiments in independent-case. The database was recorded in three months with 82 eighteen-to- forty years old male speakers. The SVM was used as a classifier in a "one vs. all" strategy. Best results have been obtained using Wavelets Daubechies 5, Meyer and Coiflet 5. Finally, we used a neural network MLP (multi layer perceptron) in order to improve the SVM results.
Keywords :
multilayer perceptrons; natural language processing; speech recognition; support vector machines; wavelet transforms; Brazilian Portuguese; Mel frequency cepstral coefficients; SVM; digit recognition; multilayer perceptron; neural classifier; neural network MLP; speech recognition; support vector machine; wavelet packet transform; Band pass filters; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Spatial databases; Speech recognition; Support vector machine classification; Support vector machines; Wavelet packets; Wavelet transforms; Multilayer Perceptrons; Neural Networks (SVM); Speech Recognition; Wavelet Transforms;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517917