مرکز منطقه ای اطلاع رساني علوم و فناوري - Singing voice detection with deep recurrent neural networks

DocumentCode :

730066

Title :

Singing voice detection with deep recurrent neural networks

Author :

Leglaive, Simon ; Hennequin, Romain ; Badeau, Roland

Author_Institution :

Audionamix, Paris, France

fYear :

2015

fDate :

19-24 April 2015

Firstpage :

121

Lastpage :

125

Abstract :

In this paper, we propose a new method for singing voice detection based on a Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Network (RNN). This classifier is able to take a past and future temporal context into account to decide on the presence/absence of singing voice, thus using the inherent sequential aspect of a short-term feature extraction in a piece of music. The BLSTM-RNN contains several hidden layers, so it is able to extract a simple representation fitted to our task from low-level features. The results we obtain significantly outperform state-of-the-art methods on a common database.

Keywords :

recurrent neural nets; speech; BLSTM; RNN; bidirectional long short-term memory; deep recurrent neural networks; recurrent neural network; short-term feature extraction; singing voice detection; state-of-the-art methods; Computer architecture; Context; Databases; Feature extraction; Harmonic analysis; Recurrent neural networks; Training; Deep Learning; Long Short-Term Memory; Recurrent Neural Networks; Singing Voice Detection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location :

South Brisbane, QLD

Type :

conf

DOI :

10.1109/ICASSP.2015.7177944

Filename :

7177944

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=730066