DocumentCode
1312840
Title
Single-Channel Multitalker Speech Recognition
Author
Rennie, Steven J. ; Hershey, John R. ; Olsen, Peder A.
Volume
27
Issue
6
fYear
2010
Firstpage
66
Lastpage
80
Abstract
We have described some of the problems with modeling mixed acoustic signals in the log spectral domain using graphical models, as well as some current approaches to handling these problems for multitalker speech separation and recognition. We have also reviewed methods for inference on FHMMs (factorial hidden Markov model) and methods for handling the nonlinear interaction function in the log spectral domain. These methods are capable of separating and recognizing speech better than human listeners on the SSC task.
Keywords
acoustic signal processing; hidden Markov models; spectral analysis; speech recognition; speech synthesis; SSC task; factorial hidden Markov model; graphical model; log spectral domain; mixed acoustic signal; multitalker speech separation; nonlinear interaction function; single-channel multitalker speech recognition; Acoustics; Complexity theory; Computational modeling; Data models; Hidden Markov models; Speech recognition;
fLanguage
English
Journal_Title
Signal Processing Magazine, IEEE
Publisher
ieee
ISSN
1053-5888
Type
jour
DOI
10.1109/MSP.2010.938081
Filename
5563101
Link To Document