• DocumentCode
    3581481
  • Title

    Compositional models for signal processing - Perspectives from audio processing

  • Author

    Virtanen, Tuomas

  • Author_Institution
    Dept. of Signal Process., Tampere Univ. of Technol. (TUT), Tampere, Finland
  • fYear
    2014
  • Firstpage
    11
  • Lastpage
    11
  • Abstract
    Many classes of data are composed as purely additive combinations of latent parts that do not result in subtraction or diminishment of the parts. Compositional models such as non-negative matrix factorization can effectively learn these latent structures of the data. Even though such models most naturally applies to non-signal data such as counts of populations, they can be employed to explain other forms of data as well. On signal processing, these models can be used to give more interpretable representations than what is obtained with many established signal processing methods. Therefore, during the last few years such models have provided new paradigms to solve old standing signal processing problems, e.g. source separation and robust pattern recognition. For example in the field of audio processing where we often deal with mixtures of sounds, the models have been used as parts of processing systems to advance the state of the art on many problems, for example on the analysis of polyphonic music and recognition of noisy speech. In this presentation we show how compositional models can be powerful tools for signal processing, providing highly interpretable representations, and enabling diverse applications such as signal analysis, recognition, manipulation, and enhancement. We will use several examples from the field of audio processing to demonstrate the effectiveness of the models.
  • Keywords
    audio signal processing; speech recognition; audio processing; compositional models; noisy speech recognition; non-negative matrix factorization; non-signal data; pattern recognition; polyphonic music; signal analysis; signal enhancement; signal manipulation; signal processing; signal recognition; source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), 2014
  • ISSN
    2326-0262
  • Print_ISBN
    978-8-3620-6518-9
  • Type

    conf

  • Filename
    7067261