DocumentCode :
20259
Title :
Mixtures of Local Dictionaries for Unsupervised Speech Enhancement
Author :
Minje Kim ; Smaragdis, Paris
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Volume :
22
Issue :
3
fYear :
2015
fDate :
Mar-15
Firstpage :
293
Lastpage :
297
Abstract :
We propose a novel extension of Nonnegative Matrix Factorization (NMF) that models a signal with multiple local dictionaries activated sparsely. This set of local dictionaries for a source, e.g., speech, disjointly constitute a superset that is more discriminative than an ordinary NMF dictionary, because its local structures represent the source´s manifold better. A block sparsity constraint is used to regularize the NMF solutions so that only one or a small number of blocks are active at a given time. Moreover, a concentrationz prior further regularizes each block of bases to be close to each other for better locality preservation. We test the proposed Mixture of Local Dictionaries (MLD) on single-channel speech enhancement tasks and show that it outperforms the state of the art technology by up to 2 dB in signal-to-distortion ratio, especially in the unsupervised environment where neither the speaker identity nor the type of noise is known in advance.
Keywords :
dictionaries; matrix decomposition; speech enhancement; unsupervised learning; MLD; NMF; mixture of local dictionaries; nonnegative matrix factorization; single-channel speech enhancement tasks; unsupervised speech enhancement; Dictionaries; Manifolds; Noise; Speech; Speech enhancement; Training; Vectors; Manifold learning; nonnegative matrix factorization; speech enhancement;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2014.2346506
Filename :
6874558
Link To Document :
بازگشت