DocumentCode :
178401
Title :
Learning dynamic features with neural networks for phoneme recognition
Author :
Xin Zheng ; Zhiyong Wu ; Meng, Hsiang-Yun ; Lianhong Cai
Author_Institution :
Shenzhen Key Lab. of Inf. Sci. & Technol., Tsinghua Univ., Shenzhen, China
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
2524
Lastpage :
2528
Abstract :
Dynamic features such as delta and delta-delta of basic acoustic features have long been used in various speech applications and give satisfactory performance. The explicit physical meaning and simplicity of dynamic features clearly compound their prevalence. In this paper, we propose a new framework with neural network to learn the alternatives of traditional delta and higher order differences. Instead of embracing the interpretability and simplicity, our framework is able to learn a new transformation that simulates what differences do but is more relevant to a specific task such as phoneme recognition. We determine the best way to learn such a new transformation among several most probable alternatives. Our experiments indicate that dynamic features obtained with transformation learned this way are better than traditional differences in both frame classification and phoneme recognition. The improvement of performance is even clearer when higher-order of differences are applied.
Keywords :
feature extraction; higher order statistics; neural nets; signal classification; speech recognition; acoustic features; frame classification; higher order differences; neural networks; phoneme recognition; speech applications; Hidden Markov models; Mel frequency cepstral coefficient; Neural networks; Speech; Speech processing; Speech recognition; deep neural network (DNN); delta; difference; higher order; neural network; phoneme recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854055
Filename :
6854055
Link To Document :
بازگشت