مرکز منطقه ای اطلاع رساني علوم و فناوري - Distributed microphone array processing for speech source separation with classifier fusion

DocumentCode :

2173227

Title :

Distributed microphone array processing for speech source separation with classifier fusion

Author :

Souden, Mehrez ; Kinoshita, Keisuke ; Delcroix, Marc ; Nakatani, Tomohiro

Author_Institution :

NTT Commun. Sci. Labs., Kyoto, Japan

fYear :

2012

fDate :

23-26 Sept. 2012

Firstpage :

Lastpage :

Abstract :

We propose a new approach for clustering and separating competing speech signals using a distributed microphone array (DMA). This approach can be viewed as an extension of expectation-maximization (EM)-based source separation to DMAs. To achieve distributed processing, we assume the conditional independence (with respect to sources´ activities) of the normalized recordings of different nodes. By doing so, only the posterior probabilities of sources´ activities need to be shared between nodes. Consequently, the EM algorithm is formulated such that at the expectation step, local posterior probabilities are estimated locally and shared between nodes. In the maximization step, every node fuses the received probabilities via either product or sum rules and estimates its local parameters. We show that, even if we make binary decisions (presence/ absence of speech) during EM iterations instead of transmitting continuous posterior probability values, we can achieve separation without causing significant speech distortion. Our preliminary investigations demonstrate that the proposed processing technique approaches the centralized solution and can outperform Oracle best node-wise clustering in terms of objective source separation metrics.

Keywords :

blind source separation; distortion; expectation-maximisation algorithm; microphone arrays; optimisation; pattern clustering; probability; sensor fusion; signal classification; speech processing; binary decision; centralized solution; classifier fusion; competing speech signal clustering; competing speech signal separation; continuous posterior probability value; distributed microphone array processing; expectation-maximization-based source separation; local posterior probability; maximization step; normalized recording; objective source separation metrics; speech distortion; speech source separation; Arrays; Estimation; Microphones; Signal to noise ratio; Speech; Speech processing; Vectors; Distributed microphone array processing; blind source separation; classifier combination; speech clustering;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning for Signal Processing (MLSP), 2012 IEEE International Workshop on

Conference_Location :

Santander

ISSN :

1551-2541

Print_ISBN :

978-1-4673-1024-6

Electronic_ISBN :

1551-2541

Type :

conf

DOI :

10.1109/MLSP.2012.6349782

Filename :

6349782

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2173227