DocumentCode
1468891
Title
Quantization of cepstral parameters for speech recognition over the World Wide Web
Author
Digalakis, Vassilios V. ; Neumeyer, Leonardo G. ; Perakakis, Manolis
Author_Institution
Dept. of Electron. & Comput. Eng., Tech. Univ., Heraklion, Greece
Volume
17
Issue
1
fYear
1999
fDate
1/1/1999 12:00:00 AM
Firstpage
82
Lastpage
90
Abstract
We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web (WWW). We compare a server-only processing model where the client encodes and transmits the speech signal to the server, to a model where the recognition front end runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly. We find that the required bit rate to achieve the recognition performance of high-quality unquantized speech is just 2000 bits per second
Keywords
Internet; cepstral analysis; client-server systems; information resources; speech coding; speech recognition; 2000 bit/s; Internet; WWW; World Wide Web; bit rate; cepstral coefficients; cepstral parameters quantization; client-server model; encoding paradigm; high-quality unquantized speech; recognition front end; recognition performance; server-only processing model; speech coding; speech recognition; speech signal transmission; speech-enabled applications; Bit rate; Cepstral analysis; Encoding; Quantization; Service oriented architecture; Speech processing; Speech recognition; Web server; Web sites; World Wide Web;
fLanguage
English
Journal_Title
Selected Areas in Communications, IEEE Journal on
Publisher
ieee
ISSN
0733-8716
Type
jour
DOI
10.1109/49.743698
Filename
743698
Link To Document