Title :
Phase difference of filter-stable part-tones as acoustic feature
Author :
Tüske, Zoltán ; Drepper, Friedhelm R. ; Schlüter, Ralf
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
Abstract :
A part-tone decomposition of voiced sections of speech is introduced, which is adapted with high accuracy to the frequency of the glottal oscillator of the speaker. The iterative replacement of the center filter frequency contours (chosen locally as linear chirp) of the non-stationary bandpass filters converges extremely fast and leads to the extraction of filter-stable part-tones with uncorrupted phases. In contrast to phases of frequency decomposition with a priori defined, constant filter frequencies, the phase differences of filter-stable part-tones promise to become a useful supplement of the amplitude based acoustic features used for conventional automatic speech recognition. The derived phase features are tested in vowel classification experiments based on the phonetically rich TIMIT database.
Keywords :
acoustic filters; band-pass filters; oscillators; speech recognition; amplitude based acoustic features; automatic speech recognition; center filter frequency; constant filter frequencies; filter-stable part-tones; frequency decomposition; glottal oscillator; nonstationary bandpass filters; part-tone decomposition; phase difference; phonetically rich TIMIT database; speaker; speech voiced sections; vowel classification experiments; Equations; Harmonic analysis; Mel frequency cepstral coefficient; Speech; Speech processing; Time frequency analysis; acoustic feature; filter stable part-tones; relative phase; time-frequency decomposition; voiced speech;
Conference_Titel :
Statistical Signal Processing Workshop (SSP), 2012 IEEE
Conference_Location :
Ann Arbor, MI
Print_ISBN :
978-1-4673-0182-4
Electronic_ISBN :
pending
DOI :
10.1109/SSP.2012.6319705