Acoustic feature conversion using a polynomial based feature transferring algorithm

Author

Syu-Siang Wang ; Lin, Peng ; Dau-Cheng Lyu ; Yu Tsao ; Hsin-Te Hwang ; Borching Su

Author_Institution

Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan

fYear

2014

fDate

12-14 Sept. 2014

Firstpage

454

Lastpage

458

Abstract

This study proposes a polynomial based feature transferring (PFT) algorithm for acoustic feature conversion. The PFT process consists of estimation and conversion phases. The estimation phase aims to compute a polynomial based transfer function using only a small set of parallel source and target features. With the estimated transfer function, the conversion phase converts large sets of source features to target ones. This study evaluates the proposed PFT algorithm using a robust automatic speech recognition (ASR) task on the Aurora-2 database. The source features were MFCCs with cepstral mean and variance normalization (CMVN), and the target features were advanced front end features (AFE). Compared to CMVN, AFE provides better robust speech recognition performance but requires more complicated and expensive cost for feature extraction. By PFT, we intend to use a simple transfer function to obtain AFE-like acoustic features from the source CMVN features. Experimental results on Aurora-2 demonstrate that the PFT generated AFE-like features that can notably improve the CMVN performance and approach results achieved by AFE. Furthermore, the recognition accuracy of PFT was better than that of histogram equalization (HEQ) and polynomial based histogram equalization (PHEQ). The results confirm the effectiveness of PFT with just a few sets of parallel features.

Keywords

acoustic signal processing; feature extraction; polynomials; speech recognition; statistical analysis; AFE; ASR task; Aurora-2 database; CMVN; PFT algorithm; PHEQ; acoustic feature conversion; advanced front end feature; cepstral mean and variance normalization; conversion phase; estimation phase; polynomial based feature transferring algorithm; polynomial based histogram equalization; polynomial based transfer function; robust automatic speech recognition task; Acoustics; Feature extraction; Polynomials; Robustness; Speech; Speech recognition; Transfer functions; acoustic feature conversion; feature transformation; robust feature extraction; robust speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on

Conference_Location

Singapore

Type

conf

DOI

10.1109/ISCSLP.2014.6936632

Filename

6936632