DocumentCode :
976327
Title :
Structural methods in automatic speech recognition
Author :
Levinson, Stephen E.
Author_Institution :
AT & T Bell Laboratories, Murray Hill, NJ, USA
Volume :
73
Issue :
11
fYear :
1985
Firstpage :
1625
Lastpage :
1650
Abstract :
The past decade has witnessed substantial progress toward the goal of constructing a machine capable of understanding colloquial discourse. Central to this progress has been the development and application of mathematical methods that permit modeling the speech signal as a complex code with several coexisting levels of structure. The most successful of these are "template matching," stochastic modeling, and probabilistic parsing. The manifestation of common themes such as dynamic programming and finite-state descriptions accentuates a superficial likeness amongst the methods which is often mistaken for the deeper similarity arising from their shared Bayesian foundation. In this paper, we outline the mathematical bases of these methods, invariant metrics, hidden Markov chains, and formal grammars, respectively. We then recount and briefly interpret the results of experiments in speech recognition to which the various methods were applied. Since these mathematical principles seem to bear little resemblance to traditional linguistic characterizations of speech, the success of the experiments is occasionally attributed, even by their authors, merely to excellent engineering. We conclude by speculating that, quite to the contrary, these methods actually constitute a powerful theory of speech that can be reconciled with and elucidate conventional linguistic theories while being used to build truly competent mechanical speech recognizers.
Keywords :
Automatic speech recognition; Bayesian methods; Dynamic programming; Hidden Markov models; Mathematical model; Natural languages; Power engineering and energy; Speech coding; Speech recognition; Stochastic processes;
fLanguage :
English
Journal_Title :
Proceedings of the IEEE
Publisher :
ieee
ISSN :
0018-9219
Type :
jour
DOI :
10.1109/PROC.1985.13344
Filename :
1457612
Link To Document :
بازگشت