DocumentCode
134240
Title
Acoustic feature conversion using a polynomial based feature transferring algorithm
Author
Syu-Siang Wang ; Lin, Peng ; Dau-Cheng Lyu ; Yu Tsao ; Hsin-Te Hwang ; Borching Su
Author_Institution
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
454
Lastpage
458
Abstract
This study proposes a polynomial based feature transferring (PFT) algorithm for acoustic feature conversion. The PFT process consists of estimation and conversion phases. The estimation phase aims to compute a polynomial based transfer function using only a small set of parallel source and target features. With the estimated transfer function, the conversion phase converts large sets of source features to target ones. This study evaluates the proposed PFT algorithm using a robust automatic speech recognition (ASR) task on the Aurora-2 database. The source features were MFCCs with cepstral mean and variance normalization (CMVN), and the target features were advanced front end features (AFE). Compared to CMVN, AFE provides better robust speech recognition performance but requires more complicated and expensive cost for feature extraction. By PFT, we intend to use a simple transfer function to obtain AFE-like acoustic features from the source CMVN features. Experimental results on Aurora-2 demonstrate that the PFT generated AFE-like features that can notably improve the CMVN performance and approach results achieved by AFE. Furthermore, the recognition accuracy of PFT was better than that of histogram equalization (HEQ) and polynomial based histogram equalization (PHEQ). The results confirm the effectiveness of PFT with just a few sets of parallel features.
Keywords
acoustic signal processing; feature extraction; polynomials; speech recognition; statistical analysis; AFE; ASR task; Aurora-2 database; CMVN; PFT algorithm; PHEQ; acoustic feature conversion; advanced front end feature; cepstral mean and variance normalization; conversion phase; estimation phase; polynomial based feature transferring algorithm; polynomial based histogram equalization; polynomial based transfer function; robust automatic speech recognition task; Acoustics; Feature extraction; Polynomials; Robustness; Speech; Speech recognition; Transfer functions; acoustic feature conversion; feature transformation; robust feature extraction; robust speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936632
Filename
6936632
Link To Document