Title :
Singing voice detection with deep recurrent neural networks
Author :
Leglaive, Simon ; Hennequin, Romain ; Badeau, Roland
Author_Institution :
Audionamix, Paris, France
Abstract :
In this paper, we propose a new method for singing voice detection based on a Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Network (RNN). This classifier is able to take a past and future temporal context into account to decide on the presence/absence of singing voice, thus using the inherent sequential aspect of a short-term feature extraction in a piece of music. The BLSTM-RNN contains several hidden layers, so it is able to extract a simple representation fitted to our task from low-level features. The results we obtain significantly outperform state-of-the-art methods on a common database.
Keywords :
recurrent neural nets; speech; BLSTM; RNN; bidirectional long short-term memory; deep recurrent neural networks; recurrent neural network; short-term feature extraction; singing voice detection; state-of-the-art methods; Computer architecture; Context; Databases; Feature extraction; Harmonic analysis; Recurrent neural networks; Training; Deep Learning; Long Short-Term Memory; Recurrent Neural Networks; Singing Voice Detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177944