• DocumentCode
    2937703
  • Title

    Rethinking of computation for future-generation, knowledge-rich speech recognition and understanding

  • Author

    Deng, Li

  • Author_Institution
    Microsoft Res., Redmond, WA, USA
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    1801
  • Lastpage
    1804
  • Abstract
    A new trend is emerging in the semiconductor industry that future computation speedups will likely come more from parallelism than from having faster individual computing elements. Most algorithm designers for the current, HMM-based speech recognition systems, which have the recognition performance significantly lower than that of human, have not embraced this trend. This is partly attributed to the state-of-the-art sequential algorithms that have involved extremely clever schemes to speed up single-processor performance developed and matured over many years. This invited presentation advances two arguments. First, much more powerful speech systems in the future generations will likely approach human performance with new architectures that integrate rich knowledge sources and overcome the reasonably well understood limitations of the current HMM-based systems. Second, the success of the above endeavor will require complete rethinking of computation issues, likely disposing of the traditional thinking of HMM-centric sequential processing and embracing parallel computing in the new architectures mimicking key aspects of the human speech processing system. Four case studies are provided in this paper extracted from some recent influential work that may shape the foundation of this potentially active research area.
  • Keywords
    hidden Markov models; parallel algorithms; speech processing; speech recognition; HMM; computation rethinking; human speech processing system; knowledge-rich speech recognition system; knowledge-rich speech understanding; parallel computing; semiconductor industry; state-of-the-art sequential algorithm; Automatic speech recognition; Computer architecture; Concurrent computing; Decoding; Electronics industry; Hidden Markov models; Humans; Parallel processing; Speech processing; Speech recognition; computation; decoding; knowledge integration.; parallelism; speech recognition; speech understanding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202872
  • Filename
    5202872