مرکز منطقه ای اطلاع رساني علوم و فناوري - Synthesized stereo mapping via deep neural networks for noisy speech recognition

DocumentCode :

178077

Title :

Synthesized stereo mapping via deep neural networks for noisy speech recognition

Author :

Jun Du ; Li-Rong Dai ; Qiang Huo

Author_Institution :

Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

1764

Lastpage :

1768

Abstract :

In our previous work, we extend the traditional stereo-based stochastic mapping by relaxing the constraint of stereo-data, which is not practical in real applications, via HMM-based speech synthesis to construct the “clean” channel data for noisy speech recognition. In this paper, we propose to use deep neural networks (DNNs) for stereo mapping compared with the joint Gaussian mixture model (GMM). The experimental results on Aurora3 databases show that our proposed DNN based synthesized stereo mapping can achieve consistently significant improvements of recognition performance over joint GMM based synthesized stereo mapping in the well-matched (WM) condition among four different European languages.

Keywords :

Gaussian processes; mixture models; neural nets; speech recognition; Aurora3 databases; European languages; HMM-based speech synthesis; deep neural networks; joint Gaussian mixture model; noisy speech recognition; stereo-based stochastic mapping; synthesized stereo mapping; well-matched condition; Databases; Hidden Markov models; Joints; Neural networks; Noise measurement; Speech; Speech recognition; HMM-based speech synthesis; deep neural network; joint Gaussian mixture model; noisy speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6853901

Filename :

6853901

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178077