• DocumentCode
    1468891
  • Title

    Quantization of cepstral parameters for speech recognition over the World Wide Web

  • Author

    Digalakis, Vassilios V. ; Neumeyer, Leonardo G. ; Perakakis, Manolis

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Tech. Univ., Heraklion, Greece
  • Volume
    17
  • Issue
    1
  • fYear
    1999
  • fDate
    1/1/1999 12:00:00 AM
  • Firstpage
    82
  • Lastpage
    90
  • Abstract
    We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web (WWW). We compare a server-only processing model where the client encodes and transmits the speech signal to the server, to a model where the recognition front end runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly. We find that the required bit rate to achieve the recognition performance of high-quality unquantized speech is just 2000 bits per second
  • Keywords
    Internet; cepstral analysis; client-server systems; information resources; speech coding; speech recognition; 2000 bit/s; Internet; WWW; World Wide Web; bit rate; cepstral coefficients; cepstral parameters quantization; client-server model; encoding paradigm; high-quality unquantized speech; recognition front end; recognition performance; server-only processing model; speech coding; speech recognition; speech signal transmission; speech-enabled applications; Bit rate; Cepstral analysis; Encoding; Quantization; Service oriented architecture; Speech processing; Speech recognition; Web server; Web sites; World Wide Web;
  • fLanguage
    English
  • Journal_Title
    Selected Areas in Communications, IEEE Journal on
  • Publisher
    ieee
  • ISSN
    0733-8716
  • Type

    jour

  • DOI
    10.1109/49.743698
  • Filename
    743698