مرکز منطقه ای اطلاع رساني علوم و فناوري - Structured discriminative models using deep neural-network features

DocumentCode :

3744839

Title :

Structured discriminative models using deep neural-network features

Author :

R. C. van Dalen;J. Yang;H. Wang;A. Ragni;C. Zhang;M. J. F. Gales

Author_Institution :

Department of Engineering, University of Cambridge, United Kingdom

fYear :

2015

Firstpage :

160

Lastpage :

166

Abstract :

State-of-the-art speech recognisers employ neural networks in various configurations. A standard (hybrid) speech recogniser computes the likelihood for one time frame and state, using only one out of thousands of possible neural-network outputs. However, the whole output vector carries information. In this paper, features from state-of-the-art speech recognisers are collected per phone given a particular context, and input to a discriminative log-linear model. The log-linear model is trained with conditional maximum likelihood or a large-margin criterion. A key element is the prior on the parameters of the log-linear model. The mean of the prior is set to the point where the performance of the original systems is attained. The log-linear model then provides an additional increase over the state-of-the-art performance of the individual systems.

Keywords :

"Hidden Markov models","Feature extraction","Speech recognition","Speech","Neural networks","Context modeling","Acoustics"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404789

Filename :

7404789

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744839