DocumentCode :
2177227
Title :
MLP based phoneme detectors for Automatic Speech Recognition
Author :
Thomas, Samuel ; Nguyen, Patrick ; Zweig, Geoffrey ; Hermansky, Hynek
Author_Institution :
Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5024
Lastpage :
5027
Abstract :
Phoneme posterior probabilities estimated using Multi-Layer Perceptrons (MLPs) are extensively used both as acoustic scores and features for speech recognition. In this paper we explore a different application of these posteriors as phonetic event detectors for speech recognition. We show how these detectors can be built to reliably capture phonetic events in the acoustic signal by integrating both acoustic and phonetic information about sound classes. These event detectors are used along with Segmental Conditional Random Fields (SCRFs) to improve the performance of speech recognition systems on the Broadcast News task.
Keywords :
multilayer perceptrons; probability; speech processing; speech recognition; MLP based phoneme detector; SCRF; acoustic signal; automatic speech recognition; broadcast news task; multilayer perceptron based phoneme detector; phoneme posterior probability estimation; phonetic event detector; segmental conditional random field; Acoustics; Detectors; Feature extraction; Hidden Markov models; Viterbi algorithm; Multi-layer Perceptrons; Phoneme Posteriors; Segmental Conditional Random Fields;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947485
Filename :
5947485
Link To Document :
بازگشت