Speech recognition experiments using multi-span statistical language models

Author

Bellegarda, Jerome R.

Author_Institution

Spoken Language Group, Apple Comput. Inc., Cupertino, CA, USA

Volume

2

fYear

1999

fDate

15-19 Mar 1999

Firstpage

717

Abstract

A multi-span framework was proposed to integrate the various constraints, both local and global, that are present in the language. In this approach, local constraints are captured via n-gram language modeling, while global constraints are taken into account through the use of latent semantic analysis. The performance of the resulting multi-span language models, as measured by the perplexity, has been shown to compare favorably with the corresponding n-gram performance. This paper reports on actual speech recognition experiments, and shows that word error rate is also substantially reduced. On a subset of the Wall Street Journal speaker-independent, 20,000-word vocabulary, continuous speech task, the multi-span framework resulted in a reduction in average word error rate of up to 17%

Keywords

grammars; speech recognition; statistical analysis; Wall Street Journal speaker-independent vocabulary; average word error rate reduction; continuous speech task; global constraints; latent semantic analysis; local constraints; multi-span statistical language models; n-gram language modeling; n-gram performance; performance; perplexity; speech recognition experiments; word error rate; Data mining; Displays; Error analysis; Frequency; History; Natural languages; Robustness; Speech recognition; Training data; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

Conference_Location

Phoenix, AZ

ISSN

1520-6149

Print_ISBN

0-7803-5041-3

Type

conf

DOI

10.1109/ICASSP.1999.759767

Filename

759767