On the use of linguistic consistency in systems for human-computer dialogues

Author

Estève, Yannick ; Raymond, Christian ; de Mori, Renato ; Janiszek, David

Author_Institution

Lab. d´´Informatique d´´Avignon, Avignon, France

Volume

11

Issue

6

fYear

2003

Firstpage

746

Lastpage

756

Abstract

This paper introduces new recognition strategies based on reasoning about results obtained with different Language Models (LMs). Strategies are built following the conjecture that the consensus among the results obtained with different models gives rise to different situations in which hypothesized sentences have different word error rates (WER) and may be further processed with other LMs. New LMs are built by data augmentation using ideas from latent semantic analysis and trigram analogy. Situations are defined by expressing the consensus among the recognition results produced with different LMs and by the amount of unobserved trigrams in the hypothesized sentence. The diagnostic power of the use of observed trigrams or their corresponding class trigrams is compared with that of situations based on values of sentence posterior probabilities. In order to avoid or correct errors due to syntactic inconsistence of the recognized sentence, automata, obtained by explanation-based learning, are introduced and used in certain conditions. Semantic Classification Trees are introduced to provide sentence patterns expressing constraints of long distance syntactic coherence. Results on a dialogue corpus provided by France Telecom R&D have shown that starting with a WER of 21.87% on a test set of 1422 sentences, it is possible to subdivide the sentences into three sets characterized by automatically recognized situations. The first one has a coverage of 68% with a WER of 7.44%. The second one has various types of sentences with a WER around 20%. The third one contains 13% of the sentences that should be rejected with a WER around 49%. The second set characterizes sentences that should be processed with particular care by the dialogue interpreter with the possibility of asking a confirmation from the user.

Keywords

learning (artificial intelligence); linguistics; natural language interfaces; semantic networks; speech recognition; speech-based user interfaces; trees (mathematics); dialogue interpreter; explanation-based learning; human-computer dialogues; language models; latent semantic analysis; learning by analogy; linguistic consistency; long distance syntactic coherence; reasoning-based recognition strategies; recognized sentence syntactic inconsistency; semantic classification trees; sentence patterns; sentence posterior probabilities; trigram analogy; word error rates; Automatic speech recognition; Automatic testing; Character recognition; Classification tree analysis; Error analysis; Error correction; Learning automata; Research and development; Scheduling; Telecommunications;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/TSA.2003.818318

Filename

1255462