DocumentCode :
336804
Title :
Automatic speech recognition: a communication perspective
Author :
Atal, Bishnu S.
Author_Institution :
AT&T Labs., Florham Park, NJ, USA
Volume :
1
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
457
Abstract :
Speech recognition is usually regarded as a problem in the field of pattern recognition, where one first estimates the probability density function of each pattern to be recognized and then uses Bayes theorem to identify the pattern which provides the highest likelihood for the observed speech data. In this paper, we take a different approach to this problem. In speech recognition, the goal is communication of information by voice and we discuss the basics of speech recognition from a communication perspective. The speech signal at the acoustic level has a bit rate of 64 kb/s but the underlying sound patterns have an information rate of less than 100 b/s. What is the role of this high bit rate at the acoustic level? We discuss the principles of decoding patterns that are submerged in an ocean of seemingly irrelevant information
Keywords :
acoustic signal processing; channel capacity; pattern recognition; probability; speech recognition; 64 kbit/s; Bayes theorem; acoustic level; automatic speech recognition; bit rate; channel capacity; information communication; information rate; observed speech data; probability density function; sound patterns decoding; speech signal; statistical pattern recognition; Application software; Automatic speech recognition; Bit rate; Pattern recognition; Performance analysis; Probability density function; Spectral analysis; Speech analysis; Speech recognition; Underwater acoustics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758161
Filename :
758161
Link To Document :
بازگشت