مرکز منطقه ای اطلاع رساني علوم و فناوري - Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition

DocumentCode :

3744885

Title :

Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition

Author :

Ning Ma;Ricard Marxer;Jon Barker;Guy J. Brown

Author_Institution :

Department of Computer Science, University of Sheffield, Sheffield S1 4DP, UK

fYear :

2015

Firstpage :

490

Lastpage :

495

Abstract :

This paper presents a novel system that exploits synchrony spectra and deep neural networks (DNNs) for automatic speech recognition (ASR) in challenging noisy environments. Synchrony spectra measure the extent to which each frequency channel in an auditory model is entrained to a particular pitch period, and they are used together with F0 estimates either in a DNN for time-frequency (T-M) mask estimation or to augment the input features for a DNN-based ASR system. The proposed approach was evaluated in the context of the CHiME 3 Challenge. Our experiments show that the synchrony spectra features work best when augmenting the input features to the DNN-based ASR system. Compared to the CHiME-3 baseline system, our best system provides a word error rate (WER) reduction of more than 14% absolute and achieved a WER of 18.56% on the evaluation test set.

Keywords :

"Speech","Noise measurement","Neural networks","Time-frequency analysis","Training","Correlation","Speech recognition"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404835

Filename :

7404835

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744885