DocumentCode
2997458
Title
Spectral movement function and its application to speech recognition
Author
Aikawa, Kiyoaki ; Furui, Sadaoki
Author_Institution
NTT Human Interface Lab., Tokyo, Japan
fYear
1988
fDate
11-14 Apr 1988
Firstpage
223
Abstract
The authors propose a new mathematical method for extracting spectral movement in a time-sequence of speech spectrum. The spectral movement is characterized by the time and frequency derivative of a time-sequence of log spectrum envelopes. Spectral movement direction, the movement toward a higher or lower frequency region, can be identified by the sign of the proposed function. A parameter which can be used for speech segmentation is derived form this function A distance measure for speech recognition is also derived as the Euclidean distance between two spectral movement patterns extracted by the proposed function. This distance is easily calculated using cepstrum coefficients. Speech recognition results using dynamic time warping template matching with this new distance measure indicate that recognition error rate can be reduced to less than half compared with the conventional Euclidean cepstrum distance measure
Keywords
speech recognition; Euclidean distance; cepstrum coefficients; distance measure; dynamic time warping template matching; error rate; log spectrum envelopes; phoneme segmentation; spectral movement function; speech recognition; speech segmentation; time-sequence; Cepstrum; Data mining; Euclidean distance; Frequency; Humans; Laboratories; Nonlinear filters; Speech recognition; Time measurement; Tracking;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location
New York, NY
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1988.196554
Filename
196554
Link To Document