On the Use of Auditory Representations for Sparsity-Based Sound Source Separation

Author

Burred, Juan José ; Sikora, Thomas

Author_Institution

Commun. Syst. Group, Tech. Univ. of Berlin

fYear

0

fDate

0-0 0

Firstpage

1466

Lastpage

1470

Abstract

Sparsity-based source separation algorithms often rely on a transformation into a sparse domain to improve mixture disjointness and therefore facilitate separation. To this end, the most commonly used time-frequency representation has been the short time Fourier transform (STFT). The purpose of this paper is to study the use of auditory-based representations instead of the STFT. We first evaluate the STFT disjointness properties for the case of speech and music signals, and show that auditory representations based on the equal rectangular bandwidth (ERB) and Bark frequency scales can improve the disjointness of the transformed mixtures

Keywords

acoustic signal processing; audio signal processing; music; source separation; time-frequency analysis; ERB; auditory representation; equal rectangular bandwidth; mixture disjointness; sparsity-based sound source separation; speech and music signal; time-frequency representation; Bandwidth; Blind source separation; Fourier transforms; Humans; Image analysis; Independent component analysis; Multiple signal classification; Source separation; Speech analysis; Time frequency analysis; auditory scales; mixture disjointness; source separation; sparse signals;

fLanguage

English

Publisher

ieee

Conference_Titel

Information, Communications and Signal Processing, 2005 Fifth International Conference on

Conference_Location

Bangkok

Print_ISBN

0-7803-9283-3

Type

conf

DOI

10.1109/ICICS.2005.1689302

Filename

1689302