Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition

Author

Lin, Shih-Hsiang ; Chen, Berlin ; Yeh, Yao-Ming

Author_Institution

Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Normal Univ., Taipei

Volume

17

Issue

1

fYear

2009

Firstpage

84

Lastpage

94

Abstract

The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Several methods have been proposed to improve ASR robustness over the last few decades. The related literature can be generally classified into two categories according to whether the methods are directly based on the feature domain or consider some specific statistical feature characteristics. In this paper, we present a polynomial regression approach that has the merit of directly characterizing the relationship between speech features and their corresponding distribution characteristics to compensate for noise interference. The proposed approach and a variant were thoroughly investigated and compared with a few existing noise robustness approaches. All experiments were conducted using the Aurora-2 database and task. The results show that our approaches achieve considerable word error rate reductions over the baseline system and are comparable to most of the conventional robustness approaches discussed in this paper.

Keywords

polynomials; regression analysis; speech recognition; Aurora-2 database; automatic speech recognition; noise interference; polynomial regression approach; robust speech recognition; speech features; statistical feature characteristics; Automatic speech recognition; Interference; Linear discriminant analysis; Noise level; Noise robustness; Polynomials; Spatial databases; Speech enhancement; Speech recognition; Vectors; Clustering; histogram equalization; polynomial regression; robustness; speech recognition;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TASL.2008.2007612

Filename

4740142