Title :
Single-Channel Multitalker Speech Recognition
Author :
Rennie, Steven J. ; Hershey, John R. ; Olsen, Peder A.
Abstract :
We have described some of the problems with modeling mixed acoustic signals in the log spectral domain using graphical models, as well as some current approaches to handling these problems for multitalker speech separation and recognition. We have also reviewed methods for inference on FHMMs (factorial hidden Markov model) and methods for handling the nonlinear interaction function in the log spectral domain. These methods are capable of separating and recognizing speech better than human listeners on the SSC task.
Keywords :
acoustic signal processing; hidden Markov models; spectral analysis; speech recognition; speech synthesis; SSC task; factorial hidden Markov model; graphical model; log spectral domain; mixed acoustic signal; multitalker speech separation; nonlinear interaction function; single-channel multitalker speech recognition; Acoustics; Complexity theory; Computational modeling; Data models; Hidden Markov models; Speech recognition;
Journal_Title :
Signal Processing Magazine, IEEE
DOI :
10.1109/MSP.2010.938081