Title :
Matrix parser and its application to HMM-based speech recognition
Author :
Singer, Harald ; Sagayama, Shigeki
Author_Institution :
ATR Interpreting Telephony Res. Lab., Soraku-gun, Kyoto, Japan
Abstract :
The authors describe a unified framework for continuous speech recognition (CSR) under grammatical constraints, where trellis calculations and parsing are performed by the same simple fundamental operations, namely multiplication and addition of likelihood matrices. The matrix parser is shown to be a generalization of the Cocke-Younger-Kasami (CYK) parser, which because of its simplicity lends itself to efficient hardware implementation. It also facilitates explicit suprasegmental duration control for all grammatical categories. Preliminary results showed that improved duration control on the mora level raised the recognition accuracy from 86.6% to 88.2%.<>
Keywords :
context-free grammars; hidden Markov models; matrix algebra; speech recognition; HMM-based speech recognition; continuous speech recognition; explicit suprasegmental duration control; grammatical constraints; hardware implementation; likelihood matrices; matrix parser; recognition accuracy; trellis calculations;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319295