• DocumentCode
    2046959
  • Title

    Speech silicon AM: an FPGA-based acoustic modeling pipeline for hidden Markov model based speech recognition

  • Author

    Schuster, Jeffrey W. ; Gupta, Kshitij ; Hoare, Raymond

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Pittsburgh Univ., PA, USA
  • fYear
    2006
  • fDate
    25-29 April 2006
  • Abstract
    This paper presents the design of a FPGA-based hardware co-processor, based on the SPHINX 3 speech recognition engine from CMU; capable of performing acoustic modeling (AM) for medium sized vocabularies in real-time. By creating an input-driven pipeline for performing the calculations, we were able to maximize the throughput of the system while simultaneously minimizing the number of pipeline stalls. Use advanced placement techniques enabled post place-and-route speeds even greater than those necessary for real-time operation while operating at maximum workload. Further, by using input control vectors all FSMs were removed from the design, greatly increasing the flexibility of the design. These results combined with the ability to reprogram the system for different recognition tasks serve to create a system capable of in a vast array of environments. Synthesis to both Xilinx Virtex 4 and Spartan 3 FPGAs helps to further characterize the flexibility of the architecture.
  • Keywords
    acoustic signal processing; field programmable gate arrays; hidden Markov models; speech recognition; FPGA-based acoustic modeling pipeline; FPGA-based hardware coprocessor; SPHINX 3 speech recognition engine; Spartan 3 FPGA; Xilinx Virtex 4; hidden Markov model; input control vector; input-driven pipeline; post place-and-route speed; speech silicon acoustic modeling; Control system synthesis; Coprocessors; Engines; Hardware; Hidden Markov models; Pipelines; Silicon; Speech recognition; Throughput; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
  • Print_ISBN
    1-4244-0054-6
  • Type

    conf

  • DOI
    10.1109/IPDPS.2006.1639473
  • Filename
    1639473