DocumentCode
3521076
Title
Improving speech recognition accuracy with contextual phonemes and MMI training
Author
Derouault, A.-M. ; Merialdo, Bernard
Author_Institution
IBM France Sci. Center, Paris, France
fYear
1989
fDate
23-26 May 1989
Firstpage
116
Abstract
The authors experiment with a combination of two methods previously proposed to improve the performance of their speech recognition system. One method is based on the definition of an improved system of phonetic units, which takes into account the most important coarticulation effects. This system has been defined using knowledge about coarticulation, and by studying the errors of a standard phonetic system. The second method is based on the use of maximum mutual information (MMI) as a criterion in the training phase of the speech recognition system. MMI is designed to maximize the probability of the correct text versus the other possible texts, and it is expected to provide better discrimination of the correct text than the standard maximum-likelihood criterion. These methods have been tested independently on phonetic recognition, and each of them improved the recognition accuracy of the system. Results of recognition experiments that combine the two methods are presented and discussed. They show that this combination improves the average recognition rate, both for phonetic and word recognition
Keywords
speech recognition; MMI training; coarticulation effects; contextual phonemes; maximum mutual information; phonetic recognition; recognition accuracy; speech recognition; word recognition; Computer hacking; Context modeling; Dentistry; Loudspeakers; Mutual information; Natural languages; Prototypes; Speech recognition; System testing; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location
Glasgow
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1989.266377
Filename
266377
Link To Document