• DocumentCode
    3044384
  • Title

    A composite source model for speaker and isolated word recognition

  • Author

    Fontana, Robert J. ; Fox, Michael S.

  • Author_Institution
    Carnegie-Mellon University, Pittsburgh, Pennsylvania
  • Volume
    6
  • fYear
    1981
  • fDate
    29677
  • Firstpage
    741
  • Lastpage
    745
  • Abstract
    A composite source is an indexed family of random processes (subsources) together with a switch which chooses from among these processes in a stochastic fashion. Such a source has often been proposed as a model for speech and other processes having piece-wise, or quasi, stationary behavior. Until recently, however, very little has been known about such models from either a theoretical or a practical perspective. In this paper, we consider a speaker/isolated word recognition system derived from a composite source model for speech production. In particular, estimates of the underlying subsources are obtained using a modified data compression algorithm. Switch sequences are then derived from these estimates for each utterance. Finally, switch sequences are compared in the time domain (using Levenshtein´s metric) and from a statistical point of view (via variation distance). Both modes of comparison are seen to be highly correlated and produce a recognition procedure with very encouraging results.
  • Keywords
    Data compression; Linear predictive coding; Performance analysis; Production systems; Random processes; Speech analysis; Speech processing; Speech recognition; Stochastic processes; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '81.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1981.1171128
  • Filename
    1171128