DocumentCode :
3527138
Title :
Multiple time resolution analysis of speech signal using MCE training with application to speech recognition
Author :
Dimopoulos, Spiros ; Potamianos, Alexandros ; Lussier, Eric-Fosler ; Lee, Chin-Hui
Author_Institution :
Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3801
Lastpage :
3804
Abstract :
In this paper, we propose two methods of multiple time-resolution analysis of speech and their application to automatic speech recognition (ASR). Constant frame-rate multi-scale analysis is proposed based on a box of multi-scale features. Then a variable rate analysis is proposed based on the selection of the optimal temporal resolution on the fly by a properly trained non-linear classifier unit. The classifier´s parameters are trained using the discriminative method of minimum classification error (MCE) training. We use the recently proposed conditional random fields (CRF) phonetic recognition system that effectively combines highly correlated features. Results are reported on a frame-wise classification task and also on TIMIT phone recognition task. Results show that (i) CRFs can effectively combine multi-scale features and (ii) MCE trained variable rate CRFs are competitive with the ldquoboxrdquo combination method.
Keywords :
speech processing; speech recognition; TIMIT phone recognition task; automatic speech recognition; conditional random fields; frame-wise classification task; minimum classification error; multiple time resolution analysis; multiscale features; phonetic recognition; speech signal; Application software; Automatic speech recognition; Computer science; Hidden Markov models; Signal analysis; Signal resolution; Spectral analysis; Speech analysis; Speech processing; Speech recognition; ASR; Conditional Random Fields; MCE; Multiple Frame Rates; Variable Frame Rate;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960455
Filename :
4960455
Link To Document :
بازگشت