Title :
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop
Author :
Zweig, G. ; Nguyen, P. ; Van Compernolle, D. ; Demuynck, K. ; Atlas, L. ; Clark, P. ; Sell, G. ; Wang, M. ; Sha, F. ; Hermansky, H. ; Karakos, D. ; Jansen, A. ; Thomas, S. ; Sivaram, G.S.V.S. ; Bowman, S. ; Kao, J.
Author_Institution :
Microsoft Res., Mountain View, CA, USA
Abstract :
This paper summarizes the 2010 CLSP Summer Workshop on speech recognition at Johns Hopkins University. The key theme of the workshop was to improve on state-of-the-art speech recognition systems by using Segmental Conditional Random Fields (SCRFs) to integrate multiple types of information. This approach uses a state of-the-art baseline as a springboard from which to add a suite of novel features including ones derived from acoustic templates, deep neural net phoneme detections, duration models, modulation features, and whole word point-process models. The SCRF framework is able to appropriately weight these different information sources to produce significant gains on both die Broadcast News and Wall Street Journal tasks.
Keywords :
neural nets; speech recognition; JHU CLSP 2010 Summer Workshop; SCRF framework; neural net phoneme detections; point-process models; segmental conditional random fields; speech recognition; Acoustics; Detectors; Feature extraction; Hidden Markov models; Speech; Speech recognition; Training; CRF; Segmental Conditional Random Field; Speech Recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947490