• DocumentCode
    2279564
  • Title

    Distributed speech recognition with codec parameters

  • Author

    Raj, Bhiksha ; Migdal, Joshua ; Singh, Rita

  • Author_Institution
    Mitsubishi Electr. Res. Labs., Cambridge, MA, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    127
  • Lastpage
    130
  • Abstract
    Communication devices which perform distributed speech recognition (DSR) tasks currently transmit standardized coded parameters of speech signals. Recognition features are extracted from signals reconstructed using these on a remote server. Since reconstruction losses degrade recognition performance, proposals are being considered to standardize DSR-codecs which derive recognition features, to be transmitted and used directly for recognition. However, such a codec must be embedded on the transmitting device, along with its current standard codec. Performing recognition using codec bitstreams avoids these complications: no additional feature-extraction mechanism is required on the device, and there are no reconstruction losses on the server. We propose an LDA-based method for extracting optimal feature sets from codec bitstreams and demonstrate that features so derived result in improved recognition performance for the LPC, GSM and CELP codecs. For GSM and CELP, we show that the performance is comparable to that with uncoded speech and standard DSR-codec features.
  • Keywords
    cellular radio; distributed processing; feature extraction; linear predictive coding; speech codecs; speech coding; speech recognition; CELP; GSM; LPC; codec parameters; coded parameters; distributed speech recognition; feature extraction; linear discriminant analysis; signal reconstruction; Code standards; Codecs; Degradation; Feature extraction; GSM; Linear predictive coding; Performance loss; Propagation losses; Proposals; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034604
  • Filename
    1034604