DocumentCode :
1330959
Title :
On decomposing speech into modulated components
Author :
Rao, Ashwin ; Kumaresan, Ramdas
Author_Institution :
Dragon Syst. Inc., Newton, MA, USA
Volume :
8
Issue :
3
fYear :
2000
fDate :
5/1/2000 12:00:00 AM
Firstpage :
240
Lastpage :
254
Abstract :
We model a segment of filtered speech signal as a product of elementary signals as opposed to a sum of sinusoidal signals. Using this model, one can better appreciate the basic relationships between envelopes and phases or instantaneous frequencies (IFs) of signals. These relationships reveal some interesting properties of the signal´s modulations. For instance, if the contribution due to a signal´s envelope, specifically the Hilbert transform of its log-envelope, is removed from the signal´s phase, then the resulting signal´s IF is strictly positive. In addition, filtered speech signal having a bandwidth of B Hz can be essentially represented by the log-envelope and IF that have the same B Hz bandwidths. We extend the above ideas to decompose speech into modulated components. Specifically, a bank of data-adaptive filters (in a cross-coupled configuration) are used to decompose speech into its components; each adaptive filter is a simple single resonance bandpass filter (whose center-frequency or pole-location closely follows the desired formant frequency) supplemented by an adaptive all-zero filter (whose zero-locations sufficiently reduce unwanted leakage from neighboring formants). The filtered components are then represented by their respective log-envelopes and positive IFs; these small number of modulations closely approximate the speech signal
Keywords :
Hilbert transforms; adaptive filters; adaptive signal processing; band-pass filters; channel bank filters; circuit resonance; filtering theory; modulation; poles and zeros; prediction theory; speech processing; time-domain analysis; time-varying filters; Hilbert transform; adaptive all-zero filter; bandwidth; center-frequency; cross-coupled configuration; data-adaptive filter bank; elementary signals; filtered components; filtered speech signal; filtered speech signal segment; formant frequency; instantaneous frequencies; linear prediction; log-envelope; modulated components; pole-location; positive IF; resonance bandpass filter; signal envelope; signal modulation; signal phase; speech decomposition; speech signal approximation; time varying filter bank; time-domain analysis; zero-locations; Adaptive filters; Band pass filters; Bandwidth; Filter bank; Frequency; Signal analysis; Signal processing; Speech analysis; Speech processing; Speech synthesis;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.841207
Filename :
841207
Link To Document :
بازگشت