DocumentCode :
294523
Title :
Large vocabulary continuous speech recognition using word graphs
Author :
Aubert, Xavier ; Ney, Hermann
Author_Institution :
Philips Res. Lab., Aachen, Germany
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
49
Abstract :
We address the problem of using word graphs (or lattices) for the integration of complex knowledge sources like long span language models or acoustic cross-word models, in large vocabulary continuous speech recognition. A method for efficiently constructing a word graph is reviewed and two ways of exploiting it are presented. By assuming the word pair approximation, a phrase level search is possible while in the other case a general graph decoder is set up. We show that the predecessor-word identity provided by a first bigram decoding might be used to constrain the word graph without impairing the next pass. This procedure has been applied to 64 k-word trigram decoding in conjunction with an incremental unsupervised speaker adaptation scheme. Experimental results are given for the North American Business corpus used in the November ´94 evaluation
Keywords :
decoding; graph theory; speech recognition; North American Business corpus; acoustic cross-word models; complex knowledge sources; first bigram decoding; general graph decoder; incremental unsupervised speaker adaptation scheme; large vocabulary continuous speech recognition; lattices; long span language models; phrase level search; predecessor-word identity; trigram decoding; word graphs; word pair approximation; Approximation algorithms; Capacitive sensors; Decoding; Laboratories; Lattices; Natural languages; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479270
Filename :
479270
Link To Document :
بازگشت