• DocumentCode
    730066
  • Title

    Singing voice detection with deep recurrent neural networks

  • Author

    Leglaive, Simon ; Hennequin, Romain ; Badeau, Roland

  • Author_Institution
    Audionamix, Paris, France
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    121
  • Lastpage
    125
  • Abstract
    In this paper, we propose a new method for singing voice detection based on a Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Network (RNN). This classifier is able to take a past and future temporal context into account to decide on the presence/absence of singing voice, thus using the inherent sequential aspect of a short-term feature extraction in a piece of music. The BLSTM-RNN contains several hidden layers, so it is able to extract a simple representation fitted to our task from low-level features. The results we obtain significantly outperform state-of-the-art methods on a common database.
  • Keywords
    recurrent neural nets; speech; BLSTM; RNN; bidirectional long short-term memory; deep recurrent neural networks; recurrent neural network; short-term feature extraction; singing voice detection; state-of-the-art methods; Computer architecture; Context; Databases; Feature extraction; Harmonic analysis; Recurrent neural networks; Training; Deep Learning; Long Short-Term Memory; Recurrent Neural Networks; Singing Voice Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177944
  • Filename
    7177944