• DocumentCode
    180130
  • Title

    Exploiting long-term temporal dependencies in NMF using recurrent neural networks with application to source separation

  • Author

    Boulanger-Lewandowski, Nicolas ; Mysore, Gautham J. ; Hoffman, Matthias

  • Author_Institution
    Univ. de Montreal Montreal, Montreal, QC, Canada
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    6969
  • Lastpage
    6973
  • Abstract
    This paper seeks to exploit high-level temporal information during feature extraction from audio signals via non-negative matrix factorization. Contrary to existing approaches that impose local temporal constraints, we train powerful recurrent neural network models to capture long-term temporal dependencies and event co-occurrence in the data. This gives our method the ability to “fill in the blanks” in a smart way during feature extraction from complex audio mixtures, an ability very useful for a number of audio applications. We apply these ideas to source separation problems.
  • Keywords
    audio signal processing; matrix decomposition; recurrent neural nets; source separation; NMF; audio applications; audio signals; complex audio mixtures; feature extraction; high-level temporal information; local temporal constraints; long-term temporal dependencies; nonnegative matrix factorization; recurrent neural networks; source separation; Hidden Markov models; Recurrent neural networks; Smoothing methods; Source separation; Spectrogram; Speech; Vectors; Recurrent neural networks; audio source separation; long-term temporal dependencies; non-negative matrix factorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854951
  • Filename
    6854951