• DocumentCode
    353620
  • Title

    Fast decoding in large vocabulary name dialing

  • Author

    Suontausta, Janne ; Hakkinen, Juha ; Viikki, Olli

  • Author_Institution
    Speech & Audio Syst. Lab., Nokia Res. Center, Tampere, Finland
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1535
  • Abstract
    The fast decoding problem is a key challenge virtually in all practical real-time speech recognition systems since model decoding is still by far the most time-consuming operation in automatic speech recognition (ASR) systems. In current speech recognizers, there is typically a trade-off between the desired vocabulary size, the processing power available for speech recognition, and the recognition accuracy. Fast decoding methods are often needed in order to meet the real-time requirements set for a system. The use of these methods may of course not degrade the recognition accuracy. In this paper, we investigate the performance of efficient decoding methods in large vocabulary name dialing. Tree-structured lexicon, fast observation probability evaluation, and adaptive Viterbi beam search are developed and integrated in a name dialing system. The system is tested with lexicons ranging from 100 to 3000 entries. With the lexicon of 1000 words the utilization of the fast decoding methods speeds up the system by 282%. The speed-up degrades the recognition accuracy as little as 0.95%
  • Keywords
    Viterbi decoding; hidden Markov models; search problems; speech coding; speech recognition; telephony; ASR; adaptive Viterbi beam search; automatic speech recognition; fast decoding; fast observation probability evaluation; large vocabulary name dialing; processing powe; real-time requirements; real-time speech recognition systems; recognition accuracy; speech recognition; tree-structured lexicon; vocabulary size; Automatic speech recognition; Decoding; Degradation; Power system modeling; Real time systems; Speech processing; Speech recognition; System testing; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.861951
  • Filename
    861951