Title :
Word/sub-word lattices decomposition and combination for speech recognition
Author :
Le, Viet-Bac ; Seng, Sopheap ; Besacier, Laurent ; Bigi, Brigitte
Author_Institution :
LIG Lab., Grenoble
fDate :
March 31 2008-April 4 2008
Abstract :
This paper presents the benefit of using multiple lexical units in the post-processing stage of an ASR system. Since the use of sub-word units can reduce the high out-of-vocabulary rate and improve the lack of text resources in statistical language modeling, we propose several methods to decompose, normalize and combine word and sub-word lattices generated from different ASR systems. By using a sub-word information table, every word in a lattice can be decomposed into sub-word units. These decomposed lattices can be combined into a common lattice in order to generate a confusion network. This lattices combination scheme results in an absolute syllable error rate reduction of about 1.4% over the sentence MAP baseline method for a Vietnamese ASR task. By comparing with the N-best lists combination and voting method, the proposed method works better.
Keywords :
speech recognition; statistical analysis; multiple lexical units; speech recognition; statistical language modeling; voting method; word/subword lattices decomposition; Automatic speech recognition; Error analysis; Laboratories; Lattices; Measurement; Morphology; Natural languages; Particle separators; Speech recognition; Voting; ASR; confusion network; lattice decomposition; lattices combination;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518611