• DocumentCode
    13174
  • Title

    Large Vocabulary Speech Recognition on Parallel Architectures

  • Author

    Cardinal, Patrick ; Dumouchel, P. ; Boulianne, Gilles

  • Author_Institution
    Dept. of Speech Recognition, Centre de Rech. Inf. de Montreal (CRIM), Montréal, QC, Canada
  • Volume
    21
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    2290
  • Lastpage
    2300
  • Abstract
    The speed of modern processors has remained constant over the last few years but the integration capacity continues to follow Moore´s law and thus, to be scalable, applications must be parallelized. The parallelization of the classical Viterbi beam search has been shown to be very difficult on multi-core processor architectures or massively threaded architectures such as Graphics Processing Unit (GPU). The problem with this approach is that active states are scattered in memory and thus, they cannot be efficiently transferred to the processor memory. This problem can be circumvented by using the A* search which uses a heuristic to significantly reduce the number of explored hypotheses. The main advantage of this algorithm is that the processing time is moved from the search in the recognition network to the computation of heuristic costs, which can be designed to take advantage of parallel architectures. Our parallel implementation of the A* decoder on a 4-core processor with a GPU led to a speed-up factor of 6.13 compared to the Viterbi beam search at its maximum capacity and an improvement of 4% absolute in accuracy at real-time.
  • Keywords
    multiprocessing systems; parallel architectures; speech recognition; vocabulary; A* decoder; A* search; GPU; classical Viterbi beam search; graphics processing unit; integration capacity; large vocabulary speech recognition; multicore processor architectures; parallel architectures; Large scale systems; Optimization; Parallel processing; Pattern recognition; Vocabularies; GPU; multicore; parallel; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2271591
  • Filename
    6547992