DocumentCode :
730066
Title :
Singing voice detection with deep recurrent neural networks
Author :
Leglaive, Simon ; Hennequin, Romain ; Badeau, Roland
Author_Institution :
Audionamix, Paris, France
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
121
Lastpage :
125
Abstract :
In this paper, we propose a new method for singing voice detection based on a Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Network (RNN). This classifier is able to take a past and future temporal context into account to decide on the presence/absence of singing voice, thus using the inherent sequential aspect of a short-term feature extraction in a piece of music. The BLSTM-RNN contains several hidden layers, so it is able to extract a simple representation fitted to our task from low-level features. The results we obtain significantly outperform state-of-the-art methods on a common database.
Keywords :
recurrent neural nets; speech; BLSTM; RNN; bidirectional long short-term memory; deep recurrent neural networks; recurrent neural network; short-term feature extraction; singing voice detection; state-of-the-art methods; Computer architecture; Context; Databases; Feature extraction; Harmonic analysis; Recurrent neural networks; Training; Deep Learning; Long Short-Term Memory; Recurrent Neural Networks; Singing Voice Detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7177944
Filename :
7177944
Link To Document :
بازگشت