DocumentCode
2479823
Title
On the Use of Auditory Representations for Sparsity-Based Sound Source Separation
Author
Burred, Juan José ; Sikora, Thomas
Author_Institution
Commun. Syst. Group, Tech. Univ. of Berlin
fYear
0
fDate
0-0 0
Firstpage
1466
Lastpage
1470
Abstract
Sparsity-based source separation algorithms often rely on a transformation into a sparse domain to improve mixture disjointness and therefore facilitate separation. To this end, the most commonly used time-frequency representation has been the short time Fourier transform (STFT). The purpose of this paper is to study the use of auditory-based representations instead of the STFT. We first evaluate the STFT disjointness properties for the case of speech and music signals, and show that auditory representations based on the equal rectangular bandwidth (ERB) and Bark frequency scales can improve the disjointness of the transformed mixtures
Keywords
acoustic signal processing; audio signal processing; music; source separation; time-frequency analysis; ERB; auditory representation; equal rectangular bandwidth; mixture disjointness; sparsity-based sound source separation; speech and music signal; time-frequency representation; Bandwidth; Blind source separation; Fourier transforms; Humans; Image analysis; Independent component analysis; Multiple signal classification; Source separation; Speech analysis; Time frequency analysis; auditory scales; mixture disjointness; source separation; sparse signals;
fLanguage
English
Publisher
ieee
Conference_Titel
Information, Communications and Signal Processing, 2005 Fifth International Conference on
Conference_Location
Bangkok
Print_ISBN
0-7803-9283-3
Type
conf
DOI
10.1109/ICICS.2005.1689302
Filename
1689302
Link To Document