DocumentCode :
3348299
Title :
Multiband audio modeling for single-channel acoustic source separation
Author :
Reyes-Gomez, Manuel J. ; Ellis, Daniel P W ; Jojic, Nebojsa
Author_Institution :
Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Detailed hidden Markov models (HMMs) that capture the constraints implicit in a particular sound can be used to estimate obscured or corrupted portions from partial observations, the situation encountered when trying to identify multiple, overlapping sounds. However, when the complexity and variability of the sounds are high, as in a particular speaker´s voice, a detailed model might require several thousand states to cover the full range of different short-term spectra with adequate resolution. To address the tractability problems of such large models, we break the source signals into multiple frequency bands, and build separate but coupled HMMs for each band, requiring many fewer states per model. To prevent non-natural full spectral states and to enforce consistency within and between bands, at any given frame, the state in a particular band is determined by the previous state in that band and the states in the adjacent bands. Coupling the bands in this manner results in a grid like model for the full spectrum. Since exact inference of such a model is intractable, we derive an efficient approximation based on variational methods. Results in source separation of combined signals modeled with this approach outperform the separation obtained by full-band models.
Keywords :
audio signal processing; hidden Markov models; source separation; variational techniques; combined signal source separation; coupled HMM; hidden Markov models; inter-band consistency; intra-band consistency; multiband audio modeling; multiband speech models; multiple frequency bands; multiple overlapping sounds; single-channel acoustic source separation; spectrum grid like model; variational approximation methods; Automatic speech recognition; Frequency synchronization; Graphical models; Hidden Markov models; Signal resolution; Source separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327192
Filename :
1327192
Link To Document :
بازگشت