مرکز منطقه ای اطلاع رساني علوم و فناوري - Improved automatic speech recognition system using sparse decomposition by basis pursuit with deep rectifier neural networks and compressed sensing recomposition of speech signals

DocumentCode :

1770604

Title :

Improved automatic speech recognition system using sparse decomposition by basis pursuit with deep rectifier neural networks and compressed sensing recomposition of speech signals

Author :

Gavrilescu, Mihai

Author_Institution :

Dept. of Telecommun., Univ. “Politeh.” Bucharest, Bucharest, Romania

fYear :

2014

fDate :

29-31 May 2014

Firstpage :

Lastpage :

Abstract :

Research on the common limitations of Automatic Speech Recognition (ASR) systems state problems ranging from environmental noise, and channel or speaker variability to the limitations imposed by the measurement device. In mobile applications for automatic speech recognition, the Nyquist criteria imposes more limitations on the sampling rate at which a device can acquire signal, often the lack of fidelity of the acquired signal causing bad speech recognition. This is a specific problem for mobile devices (which are also, nowadays, the prime beneficiaries of speech recognition applications) as in this case the sampling rate is limited. We envisage a way to get the best out of any acquired signal, by use of sparsity decomposition algorithms and compressed sensing recomposition. We focus on the fact that complex sounds can be viewed as an overlapping of a number of sounds coming from simple sparse sources. Therefore, we decompose the measured signal in a linear combination of simple sparse signals and we reconstruct each sparse signal by means of compressed sensing recomposition in order to gain a better signal fidelity. We make use of deep rectifier neural network designed to decompose a training set of signals and compute a specific dictionary with simple sparse signals. The resulted sparse signals are used for decomposing the acquired signal by means of sparse algorithms, and, consequently, the resulted combination of sparse signals will be used for signal reconstruction in a compressed sensing algorithm. We test the framework for different simulated speech signals, as well as its usability in automatic speech recognition, discussing the improvements this upgrade brings to an ASR. In this paper we will describe the framework and the algorithms used and present the experimental results.

Keywords :

Nyquist criterion; compressed sensing; neural nets; sampling methods; signal reconstruction; speech recognition; ASR; Nyquist criteria; basis pursuit; compressed sensing recomposition; deep rectifier neural networks; improved automatic speech recognition system; sampling rate; signal reconstruction; sparse decomposition; sparsity decomposition algorithms; speech signals; Acoustics; Adaptation models; Hidden Markov models; Neural networks; Rectifiers; Speech; Speech recognition; automatic speech recognition system; basis pursuit; compressed sensing; deep rectifier neural networks; sparse signals;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Communications (COMM), 2014 10th International Conference on

Conference_Location :

Bucharest

Type :

conf

DOI :

10.1109/ICComm.2014.6866711

Filename :

6866711

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1770604