مرکز منطقه ای اطلاع رساني علوم و فناوري - Feature enhancement by speaker-normalized splice for robust speech recognition

DocumentCode :

3426385

Title :

Feature enhancement by speaker-normalized splice for robust speech recognition

Author :

Shinohara, Yusuke ; Masuko, Takashi ; Akamine, Masami

Author_Institution :

Toshiba Corp. R&D Center 1, Kawasaki

fYear :

2008

fDate :

March 31 2008-April 4 2008

Firstpage :

4881

Lastpage :

4884

Abstract :

The SPLICE method of feature enhancement is known for its powerful performance. It learns a mapping from noisy to clean feature vectors given a set of stereo training data. However, feature vector variation caused by speaker changes conceals noise-induced variation, which is what we want to find in the SPLICE training. In this paper, an improvement of SPLICE by means of speaker-normalization is proposed. The training data is first normalized with respect to speaker variation, and a mapping is learned afterward. CMLLR with a GMM as its target is utilized for the speaker-normalization, where the GMM representing a standard speaker is learned via a novel variant of the speaker adaptive training. The proposed method was evaluated on Aurora2, and achieved a relative word error rate reduction of 38% over the conventional SPLICE.

Keywords :

signal denoising; speech recognition; SPLICE method; feature vector variation; robust speech recognition; speaker-normalized splice; Degradation; Error analysis; Noise robustness; Phase noise; Piecewise linear techniques; Research and development; Speech recognition; Training data; Vectors; Working environment noise; Feature enhancement; SPLICE; robust speech recognition; speaker adaptive training; speaker normalization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location :

Las Vegas, NV

ISSN :

1520-6149

Print_ISBN :

978-1-4244-1483-3

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2008.4518751

Filename :

4518751

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3426385