DocumentCode :
388583
Title :
On the use of transient information in speech recognition
Author :
Lienard, Jean-Sylvain ; Soong, Frank K.
Author_Institution :
Bell Laboratories, Murray Hill, New Jersey
Volume :
9
fYear :
1984
fDate :
30742
Firstpage :
9
Lastpage :
12
Abstract :
In this paper we investigate the effects of signal processing on the performance of isolated-word recognition by changing various time-resolution related parameters. The vocabulary used, { , is a highly confusable subset of the 39-word alpha-digit database. We showed that the recognition performance is significantly improved by trace segmentation which compresses the steady-state parts of speech signals and refines the endpoints. By changing the cutoff frequency of the low-pass filter in the filterbank analysis, we observed the existence of an optimal region of cutoff frequencies ranging from 50 to 100 Hz (at -6 dB). Outside this region, the performance does not deteriorate completely even at a very low cutoff frequency where the transients are severely distorted. This phenomenon was explained by the fact of spectral modification of the steady-state vowels following the initial transients.
Keywords :
Band pass filters; Cutoff frequency; Databases; Filter bank; Linear predictive coding; Sampling methods; Shape control; Speech analysis; Speech recognition; Steady-state;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
Type :
conf
DOI :
10.1109/ICASSP.1984.1172563
Filename :
1172563
Link To Document :
بازگشت