Title :
A decoder for large vocabulary continuous short message dictation on embedded devices
Author :
Olsen, Jesper ; Cao, Yang ; Ding, Guohong ; Yang, Xinxing
Author_Institution :
Nokia Res. Center, Beijing
fDate :
March 31 2008-April 4 2008
Abstract :
We present our recent progress towards implementing large vocabulary continuous SMS dictation in embedded devices. The dictation engine we describe here is based on the popular finite state transducer paradigm and is capable of handling large vocabularies and high order n-gram language models in a small memory footprint - even relative to what is available in current high end devices such as the Nokia N800 Internet tablet and the N95 Symbian phone. We illustrate the performance of the engine on a 20k vocabulary Chinese Mandarin dictation task which requires less than 10Mb RAM memory to run on the device. The accuracy of the continuous engine is similar to the accuracy of the isolated word dictation engine we have previously developed.
Keywords :
electronic messaging; finite state machines; natural language processing; speech synthesis; decoder; dictation engine; embedded devices; finite state transducer paradigm; high order n-gram language model; large vocabulary continuous short message dictation; Decoding; Hidden Markov models; Internet; Natural languages; Quantization; Random access memory; Read-write memory; Search engines; Speech recognition; Vocabulary; Speech recognition; finite automata; mobile communication; text communication;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518615