DocumentCode
353709
Title
French large vocabulary recognition with cross-word phonology transducers
Author
Boulianne, G. ; Brousseau, J. ; Ouellet, P. ; Dumouchel, P.
Author_Institution
Centre de Recherche Inf. de Montreal, Que., Canada
Volume
3
fYear
2000
fDate
2000
Firstpage
1675
Abstract
Although finite-state transducers have been widely used in linguistics, their application to speech recognition has begun only recently (M. Mohri, 1997). We describe our implementation of French large vocabulary recognition based on transducers, and how we take advantage of this approach to integrate automatic pronunciation rules and cross-word phenomena such as French “liaison”. We also show that a simple, single-level Viterbi algorithm can efficiently decode speech recognition transducers and handle cross-word context models and cross-word phonological rules. In our experiments we compared network size, error rate and decoding speed of our transducer based recognizer against a baseline HTK recognizer, on a large vocabulary French dictation task. Transducers reduced search time by a factor of 25 compared to our HTK recognizer. We also examined the effect of automated pronunciation rules, and their combination with crossword phonological rules that control “liaison”. We obtained a 23% relative reduction in the word error rate on a 5000 word task
Keywords
Viterbi decoding; finite state machines; natural languages; speech recognition; transducers; word processing; French dictation task; French large vocabulary recognition; automated pronunciation rules; automatic pronunciation rules; baseline HTK recognizer; cross-word context models; cross-word phenomena; cross-word phonological rules; cross-word phonology transducers; crossword phonological rules; decoding speed; error rate; finite-state transducers; linguistics; network size; search time; single-level Viterbi algorithm; speech recognition transducers; transducer based recognizer; word error rate; Acoustic transducers; Automata; Context modeling; Costs; Dictionaries; Error analysis; Hidden Markov models; Natural languages; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.862072
Filename
862072
Link To Document