مرکز منطقه ای اطلاع رساني علوم و فناوري - French large vocabulary recognition with cross-word phonology transducers

DocumentCode :

353709

Title :

French large vocabulary recognition with cross-word phonology transducers

Author :

Boulianne, G. ; Brousseau, J. ; Ouellet, P. ; Dumouchel, P.

Author_Institution :

Centre de Recherche Inf. de Montreal, Que., Canada

Volume :

fYear :

2000

fDate :

2000

Firstpage :

1675

Abstract :

Although finite-state transducers have been widely used in linguistics, their application to speech recognition has begun only recently (M. Mohri, 1997). We describe our implementation of French large vocabulary recognition based on transducers, and how we take advantage of this approach to integrate automatic pronunciation rules and cross-word phenomena such as French “liaison”. We also show that a simple, single-level Viterbi algorithm can efficiently decode speech recognition transducers and handle cross-word context models and cross-word phonological rules. In our experiments we compared network size, error rate and decoding speed of our transducer based recognizer against a baseline HTK recognizer, on a large vocabulary French dictation task. Transducers reduced search time by a factor of 25 compared to our HTK recognizer. We also examined the effect of automated pronunciation rules, and their combination with crossword phonological rules that control “liaison”. We obtained a 23% relative reduction in the word error rate on a 5000 word task

Keywords :

Viterbi decoding; finite state machines; natural languages; speech recognition; transducers; word processing; French dictation task; French large vocabulary recognition; automated pronunciation rules; automatic pronunciation rules; baseline HTK recognizer; cross-word context models; cross-word phenomena; cross-word phonological rules; cross-word phonology transducers; crossword phonological rules; decoding speed; error rate; finite-state transducers; linguistics; network size; search time; single-level Viterbi algorithm; speech recognition transducers; transducer based recognizer; word error rate; Acoustic transducers; Automata; Context modeling; Costs; Dictionaries; Error analysis; Hidden Markov models; Natural languages; Speech recognition; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location :

Istanbul

ISSN :

1520-6149

Print_ISBN :

0-7803-6293-4

Type :

conf

DOI :

10.1109/ICASSP.2000.862072

Filename :

862072

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=353709