• DocumentCode
    3403691
  • Title

    Evaluation of several strategies for single sensor speech/music separation

  • Author

    Blouet, R. ; Rapaport, G. ; Fevotte, Cedric

  • Author_Institution
    MIST Technol. Res., Paris
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    37
  • Lastpage
    40
  • Abstract
    In this paper we address the application of single sensor source separation techniques to mixtures of speech and music. Three strategies for source modeling are presented, namely Gaussian scaled mixture models (GSMM), autoregressive (AR) models and amplitude factor (AF). The common ingredient to the methods is the use of a codebook containing elementary spectral shapes to represent non- stationary signals, and to handle separately spectral shape and amplitude information. We propose a new system that employs separate models for the speech and music signals. The speech signal proves to be best modeled with the AR-based codebook, while the music signal is best modeled with the AF-based codebook. Experimental results demonstrate the improved performance of the proposed approach for speech/music separation in some evaluation criteria.
  • Keywords
    Gaussian processes; autoregressive processes; music; source separation; spectral analysis; speech processing; Gaussian scaled mixture model; amplitude factor; autoregressive model; codebook; single sensor speech-music source separation; spectral estimation; Amplitude estimation; Cities and towns; Fourier transforms; Frequency; Independent component analysis; Linearity; Multiple signal classification; Source separation; Spectral shape; Speech analysis; Gaussian mixture models; Single sensor source separation; autoregressive model; spectral estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4517540
  • Filename
    4517540