DocumentCode :
353709
Title :
French large vocabulary recognition with cross-word phonology transducers
Author :
Boulianne, G. ; Brousseau, J. ; Ouellet, P. ; Dumouchel, P.
Author_Institution :
Centre de Recherche Inf. de Montreal, Que., Canada
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1675
Abstract :
Although finite-state transducers have been widely used in linguistics, their application to speech recognition has begun only recently (M. Mohri, 1997). We describe our implementation of French large vocabulary recognition based on transducers, and how we take advantage of this approach to integrate automatic pronunciation rules and cross-word phenomena such as French “liaison”. We also show that a simple, single-level Viterbi algorithm can efficiently decode speech recognition transducers and handle cross-word context models and cross-word phonological rules. In our experiments we compared network size, error rate and decoding speed of our transducer based recognizer against a baseline HTK recognizer, on a large vocabulary French dictation task. Transducers reduced search time by a factor of 25 compared to our HTK recognizer. We also examined the effect of automated pronunciation rules, and their combination with crossword phonological rules that control “liaison”. We obtained a 23% relative reduction in the word error rate on a 5000 word task
Keywords :
Viterbi decoding; finite state machines; natural languages; speech recognition; transducers; word processing; French dictation task; French large vocabulary recognition; automated pronunciation rules; automatic pronunciation rules; baseline HTK recognizer; cross-word context models; cross-word phenomena; cross-word phonological rules; cross-word phonology transducers; crossword phonological rules; decoding speed; error rate; finite-state transducers; linguistics; network size; search time; single-level Viterbi algorithm; speech recognition transducers; transducer based recognizer; word error rate; Acoustic transducers; Automata; Context modeling; Costs; Dictionaries; Error analysis; Hidden Markov models; Natural languages; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.862072
Filename :
862072
Link To Document :
بازگشت