• DocumentCode
    2298321
  • Title

    Reducing computational complexity and response latency through the detection of contentless frames

  • Author

    Sukkar, Rafid A. ; Herman, Shawn M. ; Setlur, Anand R. ; Mitchell, Carl D.

  • Author_Institution
    Lucent Technol., Naperville, IL, USA
  • Volume
    6
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3751
  • Abstract
    In this paper, we present a method that manipulates the decoding network to reduce both computational complexity and response latency while maintaining high ASR accuracy. The method employs a TSVQ (tree structured vector quantization) classifier that reliably discriminates between silence and non-silence frames. Reductions in computational complexity and response latency are achieved through three techniques: 1) silence skipping, 2) silence-based pruning of the dynamic programming network, and 3) early decision. Experimental results on a connected digit task and a large vocabulary company name task show that the proposed method can reduce ASR response latency by more than 82%. Furthermore, the computational complexity, measured in CPU seconds, was reduced by 13.6% on the connected digit task and 6.7% on the company name task while maintaining the recognition accuracy of the baseline system
  • Keywords
    computational complexity; decision theory; decoding; dynamic programming; speech coding; speech recognition; vector quantisation; ASR accuracy; TSVQ classifier; automatic speech recognition; computational complexity; connected digit task; contentless frames; decoding network; dynamic programming network; early decision; large vocabulary company name task; nonsilence frames; recognition accuracy; response latency; silence frames; silence skipping; silence-based pruning; tree structured vector quantization; Automatic speech recognition; Computational complexity; Computer networks; Decoding; Delay; Dynamic programming; Hidden Markov models; Maintenance; Real time systems; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.860218
  • Filename
    860218