DocumentCode :
846281
Title :
On the use of linguistic consistency in systems for human-computer dialogues
Author :
Estève, Yannick ; Raymond, Christian ; de Mori, Renato ; Janiszek, David
Author_Institution :
Lab. d´´Informatique d´´Avignon, Avignon, France
Volume :
11
Issue :
6
fYear :
2003
Firstpage :
746
Lastpage :
756
Abstract :
This paper introduces new recognition strategies based on reasoning about results obtained with different Language Models (LMs). Strategies are built following the conjecture that the consensus among the results obtained with different models gives rise to different situations in which hypothesized sentences have different word error rates (WER) and may be further processed with other LMs. New LMs are built by data augmentation using ideas from latent semantic analysis and trigram analogy. Situations are defined by expressing the consensus among the recognition results produced with different LMs and by the amount of unobserved trigrams in the hypothesized sentence. The diagnostic power of the use of observed trigrams or their corresponding class trigrams is compared with that of situations based on values of sentence posterior probabilities. In order to avoid or correct errors due to syntactic inconsistence of the recognized sentence, automata, obtained by explanation-based learning, are introduced and used in certain conditions. Semantic Classification Trees are introduced to provide sentence patterns expressing constraints of long distance syntactic coherence. Results on a dialogue corpus provided by France Telecom R&D have shown that starting with a WER of 21.87% on a test set of 1422 sentences, it is possible to subdivide the sentences into three sets characterized by automatically recognized situations. The first one has a coverage of 68% with a WER of 7.44%. The second one has various types of sentences with a WER around 20%. The third one contains 13% of the sentences that should be rejected with a WER around 49%. The second set characterizes sentences that should be processed with particular care by the dialogue interpreter with the possibility of asking a confirmation from the user.
Keywords :
learning (artificial intelligence); linguistics; natural language interfaces; semantic networks; speech recognition; speech-based user interfaces; trees (mathematics); dialogue interpreter; explanation-based learning; human-computer dialogues; language models; latent semantic analysis; learning by analogy; linguistic consistency; long distance syntactic coherence; reasoning-based recognition strategies; recognized sentence syntactic inconsistency; semantic classification trees; sentence patterns; sentence posterior probabilities; trigram analogy; word error rates; Automatic speech recognition; Automatic testing; Character recognition; Classification tree analysis; Error analysis; Error correction; Learning automata; Research and development; Scheduling; Telecommunications;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2003.818318
Filename :
1255462
Link To Document :
بازگشت