Title :
Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
Author :
Jackson, Philip J B ; Shadle, Christine H.
Author_Institution :
Sch. of Electron. & Electr. Eng., Birmingham Univ., UK
fDate :
10/1/2001 12:00:00 AM
Abstract :
Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker´s vocal tract. In this paper, we propose a method-the pitch-scaled harmonic filter (PSHF)-which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four reconstructed time series signals by decomposing the original speech signal, first, according to amplitude, and then according to power of the Fourier coefficients. Thus, one pair of periodic and aperiodic signals is optimized for subsequent time-series analysis, and another pair for spectral analysis. The performance of the PSHF algorithm is tested on synthetic signals, using three forms of disturbance (jitter, shimmer and additive noise), and the results were used to predict the performance on real speech. Processing recorded speech examples elicited latent features from the signals, demonstrating the PSHF´s potential for analysis of mixed-source speech
Keywords :
Fourier analysis; acoustic noise; acoustic signal processing; digital filters; frequency estimation; harmonic analysis; maximum likelihood estimation; spectral analysis; speech processing; time series; Fourier coefficients; acoustic source; additive noise; amplitude; aperiodic components; jitter; latent features; maximum likelihood approach; mixed-source speech; periodic components; phonation; pitch-scaled estimation; pitch-scaled harmonic filter; power; reconstructed time series signals; shimmer; spectral analysis; speech signal; synthetic signals; time-series analysis; turbulence-noise components; vocal tract; voiced components; Additive noise; Harmonic filters; Jitter; Maximum likelihood estimation; Signal analysis; Spectral analysis; Speech analysis; Speech processing; Testing; Time series analysis;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on