• DocumentCode
    353709
  • Title

    French large vocabulary recognition with cross-word phonology transducers

  • Author

    Boulianne, G. ; Brousseau, J. ; Ouellet, P. ; Dumouchel, P.

  • Author_Institution
    Centre de Recherche Inf. de Montreal, Que., Canada
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1675
  • Abstract
    Although finite-state transducers have been widely used in linguistics, their application to speech recognition has begun only recently (M. Mohri, 1997). We describe our implementation of French large vocabulary recognition based on transducers, and how we take advantage of this approach to integrate automatic pronunciation rules and cross-word phenomena such as French “liaison”. We also show that a simple, single-level Viterbi algorithm can efficiently decode speech recognition transducers and handle cross-word context models and cross-word phonological rules. In our experiments we compared network size, error rate and decoding speed of our transducer based recognizer against a baseline HTK recognizer, on a large vocabulary French dictation task. Transducers reduced search time by a factor of 25 compared to our HTK recognizer. We also examined the effect of automated pronunciation rules, and their combination with crossword phonological rules that control “liaison”. We obtained a 23% relative reduction in the word error rate on a 5000 word task
  • Keywords
    Viterbi decoding; finite state machines; natural languages; speech recognition; transducers; word processing; French dictation task; French large vocabulary recognition; automated pronunciation rules; automatic pronunciation rules; baseline HTK recognizer; cross-word context models; cross-word phenomena; cross-word phonological rules; cross-word phonology transducers; crossword phonological rules; decoding speed; error rate; finite-state transducers; linguistics; network size; search time; single-level Viterbi algorithm; speech recognition transducers; transducer based recognizer; word error rate; Acoustic transducers; Automata; Context modeling; Costs; Dictionaries; Error analysis; Hidden Markov models; Natural languages; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.862072
  • Filename
    862072