DocumentCode :
2173924
Title :
Improving text-independent phonetic segmentation based on the Microcanonical Multiscale Formalism
Author :
Khanagha, Vahid ; Daoudi, Khalid ; Pont, Oriol ; Yahia, Hussein
Author_Institution :
GEOSTAT Team, INRIA Bordeaux Sud-Ouest, Talence, France
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4484
Lastpage :
4487
Abstract :
In an earlier work, we proposed a novel phonetic segmentation method based on speech analysis under the Microcanonical Multiscale Formalism (MMF). The latter relies on the computation of local geometrical parameters, singularity exponents (SE). We showed that SE convey valuable information about the local dynamics of speech that can readily and simply used to detect phoneme boundaries. By performing error analysis of our original algorithm, in this paper we propose a 2-steps technique which better exploits SE to improve the segmentation accuracy. In the first step, we detect the boundaries of the original signal and of a low-pass filtred version, and we consider the union of all detected boundaries as candidates. In the second step, we use a hypothesis test over the local SE distribution of the original signal to select the final boundaries. We carry out a detailed evaluation and comparison over the full training set of the TIMIT database which could be useful to other researchers for comparison purposes. The results show that the new algorithm not only outperforms the original one, but also is significantly much more accurate than state-of-the-art ones.
Keywords :
error analysis; low-pass filters; speech processing; 2-step technique; MMF; TIMIT database; error analysis; local SE distribution; local geometrical parameters; low-pass filtred version; microcanonical multiscale formalism; speech analysis; text-independent phonetic segmentation; Accuracy; Algorithm design and analysis; Databases; Fractals; Motion segmentation; Speech; Training; complex signals and systems; multiscale signal processing; non-linear speech processing; phonetic segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947350
Filename :
5947350
Link To Document :
بازگشت