مرکز منطقه ای اطلاع رساني علوم و فناوري - Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition

DocumentCode :

1118334

Title :

Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition

Author :

Zhao, Xianyu ; Ou, Zhijian

Author_Institution :

Dept. of Electron. Eng., Tsinghua Univ., Beijing

Volume :

Issue :

fYear :

2007

fDate :

3/1/2007 12:00:00 AM

Firstpage :

1114

Lastpage :

1122

Abstract :

In conventional microphone array speech recognition, the array processor and the speech recognizer are loosely coupled. The only connection between the two modules is the enhanced target signal output from the array processor, which then gets treated as a single input to the recognizer. In this approach, useful environmental information, which can be provided by the array processor and also needs to be exploited by the recognizer, is ignored. Inherently, the array processor can generate multiple outputs of spatially filtered signals, as a multi-input-multi-output (MIMO) module. In this paper, a closely coupled approach is proposed, in which a recognizer with model-based noise compensation exploits the reference noise outputs from a MIMO array processor. Specifically, a multichannel model-based noise compensation is presented, including the compensation procedure using the vector Taylor series (VTS) expansion and parameter estimation using the expectation-maximization (EM) algorithm. It is also shown how to construct MIMO array processors from conventional beamformers. A number of practical implementations of the conventional loosely coupled approach and the proposed closely coupled approach were tested on a publicly available database, the Multichannel Overlapping Number Corpus (MONC). Experimental results showed that the proposed closely coupled approach significantly improved the speech recognition performance in the overlapping speech situations

Keywords :

array signal processing; expectation-maximisation algorithm; filtering theory; microphone arrays; signal denoising; speech recognition; MIMO array processor; closely coupled array processing; enhanced target signal output; expectation-maximization algorithm; microphone array speech recognition; multi-input-multi-output module; multichannel model-based noise compensation; multichannel overlapping number corpus; parameter estimation; spatially filtered signals; vector Taylor series expansion; Array signal processing; MIMO; Microphone arrays; Parameter estimation; Signal generators; Signal processing; Speech processing; Speech recognition; Target recognition; Taylor series; Array signal processing; microphone array; model-based compensation; robust speech recognition;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.881673

Filename :

4100701

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1118334