Title :
Wavelet method for breath detection in audio signals
Author :
Igras, Magdalena ; Ziolko, Bartosz
Author_Institution :
Dept. of Electron., AGH Univ. of Sci. & Technol., Kraków, Poland
Abstract :
An algorithm for automatic detection of breath events in a speech signal is suggested in this paper. The issues of breath events occurrences in recordings are discussed as well as their statistical parameters. Also the role of breath pauses for signalizing punctuation and emotional or physical state of the speaker, in both spontaneous and read speech, is described. Wavelet parameters of energy in frequency subbands are obtained from discrete packet wavelet decomposition with mel-scale as patterns of breaths. In the beginning of the detection procedure, preliminary hypotheses of breath are indicated in the analyzed speech signal using temporal features. Then, discrete wavelet transform parameters are calculated. Dynamic time warping is applied to establish final recognition. The algorithm will be used for automatic speech recognition for measuring similarity between training and test features vectors.
Keywords :
audio signal processing; discrete wavelet transforms; feature extraction; speech recognition; statistical analysis; audio signals; automatic breath event detection; automatic speech recognition; discrete packet wavelet decomposition; discrete wavelet transform parameters; dynamic time warping; frequency subbands; mel-scale; read speech; speaker emotional state; speaker physical state; speech signal analysis; statistical parameters; temporal features; test feature vectors; training feature vectors; wavelet energy parameters; Abstracts; Discrete wavelet transforms; Speech; DWT; breath detection; breath pauses; event spotting in audio;
Conference_Titel :
Multimedia and Expo (ICME), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
DOI :
10.1109/ICME.2013.6607428