Title :
Efficient embedded speech recognition for very large vocabulary Mandarin car-navigation systems
Author :
Qian, Yanmin ; Liu, Jia ; Johnson, Michael T.
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fDate :
8/1/2009 12:00:00 AM
Abstract :
Automatic speech recognition (ASR) for a very large vocabulary of isolated words is a difficult task on a resource-limited embedded device. This paper presents a novel fast decoding algorithm for a Mandarin speech recognition system which can simultaneously process hundreds of thousands of items and maintain high recognition accuracy. The proposed algorithm constructs a semi-tree search network based on Mandarin pronunciation rules, to avoid duplicate syllable matching and save redundant memory. Based on a two-stage fixed-width beam-search baseline system, the algorithm employs a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce recognition time. This algorithm is aimed at an in-car navigation system in China and simulated on a standard PC workstation. The experimental results show that the proposed method reduces recognition time by nearly 6-fold and memory size nearly 2- fold compared to the baseline system, and causes less than 1% accuracy degradation for a 200,000 word recognition task.
Keywords :
natural languages; road traffic; speech recognition; Mandarin speech recognition system; automatic speech recognition; fast decoding algorithm; frame-synchronous word-level pruning strategy; semi-tree search network; two-stage fixed-width beam-search baseline system; variable beam-width pruning strategy; very large vocabulary Mandarin car-navigation systems; Automatic speech recognition; Computer networks; Decoding; Degradation; Laboratories; Navigation; Robustness; Speech recognition; Vocabulary; Workstations; Beam-search; Search network; Speech recognition; Word-level pruning;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2009.5278018