DocumentCode :
1468891
Title :
Quantization of cepstral parameters for speech recognition over the World Wide Web
Author :
Digalakis, Vassilios V. ; Neumeyer, Leonardo G. ; Perakakis, Manolis
Author_Institution :
Dept. of Electron. & Comput. Eng., Tech. Univ., Heraklion, Greece
Volume :
17
Issue :
1
fYear :
1999
fDate :
1/1/1999 12:00:00 AM
Firstpage :
82
Lastpage :
90
Abstract :
We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web (WWW). We compare a server-only processing model where the client encodes and transmits the speech signal to the server, to a model where the recognition front end runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly. We find that the required bit rate to achieve the recognition performance of high-quality unquantized speech is just 2000 bits per second
Keywords :
Internet; cepstral analysis; client-server systems; information resources; speech coding; speech recognition; 2000 bit/s; Internet; WWW; World Wide Web; bit rate; cepstral coefficients; cepstral parameters quantization; client-server model; encoding paradigm; high-quality unquantized speech; recognition front end; recognition performance; server-only processing model; speech coding; speech recognition; speech signal transmission; speech-enabled applications; Bit rate; Cepstral analysis; Encoding; Quantization; Service oriented architecture; Speech processing; Speech recognition; Web server; Web sites; World Wide Web;
fLanguage :
English
Journal_Title :
Selected Areas in Communications, IEEE Journal on
Publisher :
ieee
ISSN :
0733-8716
Type :
jour
DOI :
10.1109/49.743698
Filename :
743698
Link To Document :
بازگشت