Title :
MLP based phoneme detectors for Automatic Speech Recognition
Author :
Thomas, Samuel ; Nguyen, Patrick ; Zweig, Geoffrey ; Hermansky, Hynek
Author_Institution :
Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
Phoneme posterior probabilities estimated using Multi-Layer Perceptrons (MLPs) are extensively used both as acoustic scores and features for speech recognition. In this paper we explore a different application of these posteriors as phonetic event detectors for speech recognition. We show how these detectors can be built to reliably capture phonetic events in the acoustic signal by integrating both acoustic and phonetic information about sound classes. These event detectors are used along with Segmental Conditional Random Fields (SCRFs) to improve the performance of speech recognition systems on the Broadcast News task.
Keywords :
multilayer perceptrons; probability; speech processing; speech recognition; MLP based phoneme detector; SCRF; acoustic signal; automatic speech recognition; broadcast news task; multilayer perceptron based phoneme detector; phoneme posterior probability estimation; phonetic event detector; segmental conditional random field; Acoustics; Detectors; Feature extraction; Hidden Markov models; Viterbi algorithm; Multi-layer Perceptrons; Phoneme Posteriors; Segmental Conditional Random Fields;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947485