A computational model for separating two simultaneous talkers

Author

Weintraub, Mitchel

Author_Institution

SRI International, Menlo Park, CA

Volume

11

fYear

1986

fDate

31503

Firstpage

81

Lastpage

84

Abstract

This paper describes a computational model that attempts to separate two simultaneous talkers. The goal of this model is to improve a speech recognition system´s ability to recognize what each of the two talkers say. The model consists of the following stages: (1) an iterative dynamic programming algorithm to track the pitch period for each of the two talkers, (2) a Markov model to determine the characteristics (e.g. voiced-unvoiced) of each speaker´s voice, (3) a recursive algorithm that uses both local periodicity information and local spectral continuity constraints to compute a spectral estimate of each talker, (4) a resynthesis algorithm to convert the spectral estimate of each talker into a speech waveform, and (5) a speaker-independent continuous-digit-recognition system that attempts to recognize what each of two talkers is saying. The system was trained and tested on a database of simultaneous digit strings spoken by a male and female talker. An evaluation of the different stages of this model is presented.

Keywords

Computational modeling; Databases; Dynamic programming; Frequency; Heuristic algorithms; Iterative algorithms; Psychoacoustic models; Recursive estimation; Speech recognition; System testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.

Type

conf

DOI

10.1109/ICASSP.1986.1169115

Filename

1169115