• DocumentCode
    2479823
  • Title

    On the Use of Auditory Representations for Sparsity-Based Sound Source Separation

  • Author

    Burred, Juan José ; Sikora, Thomas

  • Author_Institution
    Commun. Syst. Group, Tech. Univ. of Berlin
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    1466
  • Lastpage
    1470
  • Abstract
    Sparsity-based source separation algorithms often rely on a transformation into a sparse domain to improve mixture disjointness and therefore facilitate separation. To this end, the most commonly used time-frequency representation has been the short time Fourier transform (STFT). The purpose of this paper is to study the use of auditory-based representations instead of the STFT. We first evaluate the STFT disjointness properties for the case of speech and music signals, and show that auditory representations based on the equal rectangular bandwidth (ERB) and Bark frequency scales can improve the disjointness of the transformed mixtures
  • Keywords
    acoustic signal processing; audio signal processing; music; source separation; time-frequency analysis; ERB; auditory representation; equal rectangular bandwidth; mixture disjointness; sparsity-based sound source separation; speech and music signal; time-frequency representation; Bandwidth; Blind source separation; Fourier transforms; Humans; Image analysis; Independent component analysis; Multiple signal classification; Source separation; Speech analysis; Time frequency analysis; auditory scales; mixture disjointness; source separation; sparse signals;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Communications and Signal Processing, 2005 Fifth International Conference on
  • Conference_Location
    Bangkok
  • Print_ISBN
    0-7803-9283-3
  • Type

    conf

  • DOI
    10.1109/ICICS.2005.1689302
  • Filename
    1689302