مرکز منطقه ای اطلاع رساني علوم و فناوري - A composite source model for speaker and isolated word recognition

DocumentCode :

3044384

Title :

A composite source model for speaker and isolated word recognition

Author :

Fontana, Robert J. ; Fox, Michael S.

Author_Institution :

Carnegie-Mellon University, Pittsburgh, Pennsylvania

Volume :

fYear :

1981

fDate :

29677

Firstpage :

741

Lastpage :

745

Abstract :

A composite source is an indexed family of random processes (subsources) together with a switch which chooses from among these processes in a stochastic fashion. Such a source has often been proposed as a model for speech and other processes having piece-wise, or quasi, stationary behavior. Until recently, however, very little has been known about such models from either a theoretical or a practical perspective. In this paper, we consider a speaker/isolated word recognition system derived from a composite source model for speech production. In particular, estimates of the underlying subsources are obtained using a modified data compression algorithm. Switch sequences are then derived from these estimates for each utterance. Finally, switch sequences are compared in the time domain (using Levenshtein´s metric) and from a statistical point of view (via variation distance). Both modes of comparison are seen to be highly correlated and produce a recognition procedure with very encouraging results.

Keywords :

Data compression; Linear predictive coding; Performance analysis; Production systems; Random processes; Speech analysis; Speech processing; Speech recognition; Stochastic processes; Switches;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '81.

Type :

conf

DOI :

10.1109/ICASSP.1981.1171128

Filename :

1171128

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3044384