DocumentCode :
294561
Title :
Phrase bigrams for continuous speech recognition
Author :
Giachin, Egidio P.
Author_Institution :
CSELT, Torino, Italy
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
225
Abstract :
In some speech recognition tasks, such as man-machine dialogue systems, the spoken sentences include several recurrent phrases. A bigram language model does not adequately represent these phrases because it underestimates their probability. A better approach consists of modeling phrases as if they were individual dictionary elements. They we inserted as additional entries into the word lexicon, on which bigrams are finally computed. This paper discusses two procedures for automatically determining frequent phrases (within the framework of a probabilistic language model) in an unlabeled training set of written sentences. One procedure is optimal since it minimises the set perplexity. The other, based on information theoretic criteria, insures that the resulting model has a high statistical robustness. The two procedures are tested on a 762-word spontaneous speech recognition task. They give similar results and provide a moderate improvement over standard bigrams
Keywords :
grammars; information theory; interactive systems; natural languages; optimisation; probability; speech processing; speech recognition; statistical analysis; bigram language model; continuous speech recognition; dictionary elements; frequent phrases; information theoretic criteria; man-machine dialogue systems; phrase bigrams; probabilistic language model; recurrent phrases; set perplexity minimisation; spoken sentences; spontaneous speech recognition; statistical robustness; unlabeled training set; word lexicon; written sentences; Dictionaries; Humans; Laboratories; Man machine systems; Robustness; Spatial databases; Speech recognition; Telecommunications; Telephony; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479405
Filename :
479405
Link To Document :
بازگشت