Title :
Alternative hypothesis generation using a weighted kernel feature matrix for ASR substitution error correction
Author :
Chao-Hong Liu ; Chung-Hsien Wu ; Sarwono, D.
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
Although automatic speech recognition (ASR) has been successfully used in several applications, it is still non-robust and imprecise especially in a harsh environment wherein the input speech is of low quality. Robust error correction for ASR outputs thus becomes important in addition to improving recognition performance. In recent approaches to error correction, linguistic or domain information is used to generate the alternative hypotheses for the ASR outputs followed by the selection of the most likely alternative. In this study, the distances between ASR outputs and the potentially correct alternatives are estimated based on a weighted context-dependent syllable cluster-based kernel feature matrix followed by multidimensional scaling (MDS)-based distance rescaling. These distances are then used to construct an alternative syllable lattice and the dynamic programming is used to obtain the most likely correct output with respect to the original ASR results. Experiments show that the proposed method achieved about 1.95% improvement on the word error rate compared to the correction pair approach using the MATBN Mandarin Chinese broadcast news corpus.
Keywords :
dynamic programming; error correction; linguistics; matrix algebra; speech recognition; ASR substitution error correction; MATBN Mandarin Chinese broadcast news corpus; alternative hypothesis generation; automatic speech recognition; distance rescaling; domain information; dynamic programming; linguistic; multidimensional scaling; robust error correction; syllable lattice; weighted context-dependent syllable cluster; weighted kernel feature matrix; Decision trees; Error analysis; Error correction; Kernel; Lattices; Speech; Speech recognition; ASR substitution error; MDS-based distance rescaling; context-dependent syllable cluster; error correction;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423475