Single-Channel Multitalker Speech Recognition

Author

Rennie, Steven J. ; Hershey, John R. ; Olsen, Peder A.

Volume

Issue

fYear

2010

Firstpage

Lastpage

Abstract

We have described some of the problems with modeling mixed acoustic signals in the log spectral domain using graphical models, as well as some current approaches to handling these problems for multitalker speech separation and recognition. We have also reviewed methods for inference on FHMMs (factorial hidden Markov model) and methods for handling the nonlinear interaction function in the log spectral domain. These methods are capable of separating and recognizing speech better than human listeners on the SSC task.

Keywords

acoustic signal processing; hidden Markov models; spectral analysis; speech recognition; speech synthesis; SSC task; factorial hidden Markov model; graphical model; log spectral domain; mixed acoustic signal; multitalker speech separation; nonlinear interaction function; single-channel multitalker speech recognition; Acoustics; Complexity theory; Computational modeling; Data models; Hidden Markov models; Speech recognition;

fLanguage

English

Journal_Title

Signal Processing Magazine, IEEE

Publisher

ieee

ISSN

1053-5888

Type

jour

DOI

10.1109/MSP.2010.938081

Filename

5563101

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1312840