DocumentCode
1749680
Title
Fractal dimension applied to speaker identification
Author
Petry, A. ; Barone, Dante A. C.
Author_Institution
Instituto de Informatica, Univ. Fed. do Rio Grande do Sul, Porto Alegre
Volume
1
fYear
2001
fDate
2001
Firstpage
405
Abstract
Reports the results obtained in a speaker identification system based on Bhattacharrya distance, which combines LP-derived cepstral coefficients, with a nonlinear dynamic feature namely fractal dimension. The nonlinear dynamic analysis starts with the phase space reconstruction, and the fractal dimension of the correspondent attractor trajectory is estimated. This analysis is performed in every speech window, providing a measure of a time-dependent fractal dimension. The corpus used in the tests is composed by 37 different speakers, and the best results are obtained when the fractal dimension is included, suggesting that the information added with this feature was not present so far
Keywords
fractals; prediction theory; probability; speaker recognition; time series; Bhattacharrya distance; LP-derived cepstral coefficients; attractor trajectory; fractal dimension; nonlinear dynamic analysis; nonlinear dynamic feature; phase space reconstruction; speaker identification; speech window; Cepstral analysis; Data mining; Delay effects; Fractals; Multidimensional systems; Nonlinear dynamical systems; Phase estimation; Speaker recognition; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940853
Filename
940853
Link To Document