• DocumentCode
    3007292
  • Title

    A computational model for separating two simultaneous talkers

  • Author

    Weintraub, Mitchel

  • Author_Institution
    SRI International, Menlo Park, CA
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    81
  • Lastpage
    84
  • Abstract
    This paper describes a computational model that attempts to separate two simultaneous talkers. The goal of this model is to improve a speech recognition system´s ability to recognize what each of the two talkers say. The model consists of the following stages: (1) an iterative dynamic programming algorithm to track the pitch period for each of the two talkers, (2) a Markov model to determine the characteristics (e.g. voiced-unvoiced) of each speaker´s voice, (3) a recursive algorithm that uses both local periodicity information and local spectral continuity constraints to compute a spectral estimate of each talker, (4) a resynthesis algorithm to convert the spectral estimate of each talker into a speech waveform, and (5) a speaker-independent continuous-digit-recognition system that attempts to recognize what each of two talkers is saying. The system was trained and tested on a database of simultaneous digit strings spoken by a male and female talker. An evaluation of the different stages of this model is presented.
  • Keywords
    Computational modeling; Databases; Dynamic programming; Frequency; Heuristic algorithms; Iterative algorithms; Psychoacoustic models; Recursive estimation; Speech recognition; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1169115
  • Filename
    1169115