DocumentCode :
1090425
Title :
Dynamic programming algorithm optimization for spoken word recognition
Author :
Sakoe, Hiroaki ; Chiba, Seibi
Author_Institution :
Nippon Electric Company, Limited, Kawasaki, Japan
Volume :
26
Issue :
1
fYear :
1978
fDate :
2/1/1978 12:00:00 AM
Firstpage :
43
Lastpage :
49
Abstract :
This paper reports on an optimum dynamic progxamming (DP) based time-normalization algorithm for spoken word recognition. First, a general principle of time-normalization is given using time-warping function. Then, two time-normalized distance definitions, called symmetric and asymmetric forms, are derived from the principle. These two forms are compared with each other through theoretical discussions and experimental studies. The symmetric form algorithm superiority is established. A new technique, called slope constraint, is successfully introduced, in which the warping function slope is restricted so as to improve discrimination between words in different categories. The effective slope constraint characteristic is qualitatively analyzed, and the optimum slope constraint condition is determined through experiments. The optimized algorithm is then extensively subjected to experimental comparison with various DP-algorithms, previously applied to spoken word recognition by different research groups. The experiment shows that the present algorithm gives no more than about two-thirds errors, even compared to the best conventional algorithm.
Keywords :
Acoustics; Constraint optimization; Dynamic programming; Feature extraction; Fluctuations; Heuristic algorithms; Pattern matching; Signal processing algorithms; Speech processing; Timing;
fLanguage :
English
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
0096-3518
Type :
jour
DOI :
10.1109/TASSP.1978.1163055
Filename :
1163055
Link To Document :
بازگشت