DocumentCode
3044384
Title
A composite source model for speaker and isolated word recognition
Author
Fontana, Robert J. ; Fox, Michael S.
Author_Institution
Carnegie-Mellon University, Pittsburgh, Pennsylvania
Volume
6
fYear
1981
fDate
29677
Firstpage
741
Lastpage
745
Abstract
A composite source is an indexed family of random processes (subsources) together with a switch which chooses from among these processes in a stochastic fashion. Such a source has often been proposed as a model for speech and other processes having piece-wise, or quasi, stationary behavior. Until recently, however, very little has been known about such models from either a theoretical or a practical perspective. In this paper, we consider a speaker/isolated word recognition system derived from a composite source model for speech production. In particular, estimates of the underlying subsources are obtained using a modified data compression algorithm. Switch sequences are then derived from these estimates for each utterance. Finally, switch sequences are compared in the time domain (using Levenshtein´s metric) and from a statistical point of view (via variation distance). Both modes of comparison are seen to be highly correlated and produce a recognition procedure with very encouraging results.
Keywords
Data compression; Linear predictive coding; Performance analysis; Production systems; Random processes; Speech analysis; Speech processing; Speech recognition; Stochastic processes; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '81.
Type
conf
DOI
10.1109/ICASSP.1981.1171128
Filename
1171128
Link To Document