Title :
Adaptive processing and learning for audio source separation
Author :
Jen-Tzung Chien ; Sawada, Hideyuki ; Makino, Shigeru
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fDate :
Oct. 29 2013-Nov. 1 2013
Abstract :
This paper overviews a series of recent advances in adaptive processing and learning for audio source separation. In real world, speech and audio signal mixtures are observed in reverberant environments. Sources are usually more than mixtures. The mixing condition is occasionally changed due to the moving sources or when the sources are changed or abruptly present or absent. In this survey article, we investigate different issues in audio source separation including overdetermined/underdetermined problems, permutation alignment, convolutive mixtures, contrast functions, nonstationary conditions and system robustness. We provide a systematic and comprehensive view for these issues and address new approaches to overdetermined/underdetermined convolutive separation, sparse learning, nonnegative matrix factorization, information-theoretic learning, online learning and Bayesian approaches.
Keywords :
learning (artificial intelligence); speech processing; Bayesian approaches; adaptive learning; adaptive processing; audio signal mixtures; audio source separation; contrast functions; convolutive mixtures; convolutive separation; information-theoretic learning; nonnegative matrix factorization; nonstationary conditions; online learning; overdetermined-underdetermined problems; permutation alignment; reverberant environments; sparse learning; speech signal mixtures; system robustness; Bayes methods; Frequency modulation; Source separation; Speech; Speech recognition; Time-frequency analysis; Vectors;
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
DOI :
10.1109/APSIPA.2013.6694302