DocumentCode :
2228913
Title :
On designing pronunciation lexicons for large vocabulary continuous speech recognition
Author :
Lame, Lori ; Adda, Gilles
Author_Institution :
Lab. d´´Informatique pour la Mecanique et les Sci. de l´´Ingenieur, CNRS, Orsay, France
Volume :
1
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
6
Abstract :
Creation of pronunciation lexicons for speech recognition is widely acknowledged to be an important but labor-intensive, aspect of system development. Lexicons are often manually created and make use of knowledge and expertise that is difficult to codify. We describe our American English lexicon developed primarily for the ARPA WSJ/NAB tasks. The lexicon is phonemically represented, and contains alternate pronunciations for about 10% of the words. Tools have been developed to add new lexical items, as well as to help ensure consistency of the pronunciations. Our experience in large vocabulary, continuous speech recognition is that systematic lexical design can improve system performance. Some comparative results with commonly available lexicons are given
Keywords :
linguistics; natural language interfaces; software performance evaluation; speech recognition; vocabulary; ARPA WSJ/NAB tasks; American English lexicon; consistency; labor-intensive; large vocabulary continuous speech recognition; lexical items; pronunciation lexicons; system development; system performance; systematic lexical design; Error analysis; Industrial training; Natural languages; Performance evaluation; Speech recognition; System performance; Testing; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.606916
Filename :
606916
Link To Document :
بازگشت