مرکز منطقه ای اطلاع رساني علوم و فناوري - Discrete Wavelet Transform for automatic speaker recognition

DocumentCode :

3254915

Title :

Discrete Wavelet Transform for automatic speaker recognition

Author :

Král, Pavel

Author_Institution :

Dept. Inf. & Comput. Sci., Univ. of West Bohemia, Plzeñ, Czech Republic

Volume :

fYear :

2010

fDate :

16-18 Oct. 2010

Firstpage :

3514

Lastpage :

3518

Abstract :

This paper deals with automatic speaker recognition. We consider here a context independent speaker recognition task with a closed set of speakers. We have shown in [1] a comparative study about the most frequently used parametrization/classification methods for the Czech language. Wavelet Transform (WT) is a modern parametrization method successfully used for some signal processing tasks. WT often outperforms parametrizations based on Fourier Transform, due to its capability to represent the signal precisely, in both frequency and time domains. The main goal of this paper is thus to use and evaluate several Wavelet Transforms instead of the conventional parametrizations that were used previously as a parametrization method of automatic speaker recognition. All experiments are performed on two Czech speaker corpora that contain speech of ten and fifty Czech native speakers, respectively. Three discrete wavelet families with different number of coefficients have been used and evaluated: Daubechies, Symlets and Coiflets with two classifiers: Gaussian Mixture Model (GMM) and Multi-Layer Perceptron (MLP). We show that recognition accuracy of wavelet parametrizations is very good and sometimes outperform the best parametrizations that were presented in our previous work.

Keywords :

Fourier transforms; Gaussian processes; discrete wavelet transforms; multilayer perceptrons; speaker recognition; Coiflets; Czech language; Czech speaker corpora; Daubechies; Fourier transform; Gaussian mixture model; Symlets; automatic speaker recognition; context independent speaker recognition task; discrete wavelet transform; multilayer perceptron; parametrization-classification methods; signal processing; Accuracy; Hidden Markov models; Speaker recognition; Speech; Training data; Wavelet transforms;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Image and Signal Processing (CISP), 2010 3rd International Congress on

Conference_Location :

Yantai

Print_ISBN :

978-1-4244-6513-2

Type :

conf

DOI :

10.1109/CISP.2010.5646691

Filename :

5646691

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3254915