DocumentCode
744154
Title
Cochannel Speaker Identification in Anechoic and Reverberant Conditions
Author
Xiaojia Zhao ; Yuxuan Wang ; DeLiang Wang
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Volume
23
Issue
11
fYear
2015
Firstpage
1727
Lastpage
1736
Abstract
Speaker identification (SID) in cochannel speech, where two speakers are talking simultaneously over a single recording channel, is a challenging problem. Previous studies address this problem in the anechoic environment under the Gaussian mixture model (GMM) framework. On the other hand, cochannel SID in reverberant conditions has not been addressed. This paper studies cochannel SID in both anechoic and reverberant conditions. We first investigate GMM-based approaches and propose a combined system that integrates two cochannel SID methods. Second, we explore deep neural networks (DNNs) for cochannel SID and propose a DNN-based recognition system. Evaluation results demonstrate that our proposed systems significantly improve SID performance over recent approaches in both anechoic and reverberant conditions and various target-to-interferer ratios.
Keywords
Gaussian processes; anechoic chambers (acoustic); mixture models; neural nets; reverberation chambers; speaker recognition; DNN; GMM framework; Gaussian mixture model; SID; anechoic conditions; anechoic environment; cochannel SID; cochannel SID methods; cochannel speaker identification; cochannel speech; deep neural networks; reverberant conditions; single recording channel; Gaussian mixture model; Hidden Markov models; IEEE transactions; Speech; Speech processing; Speech recognition; Cochannel speaker identification; Gaussian mixture model (GMM); deep neural network (DNN); reverberation; target-to-interferer ratio;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2015.2447284
Filename
7128346
Link To Document