DocumentCode :
3245048
Title :
Improved robust features for speech recognition by integrating time-frequency principal components (TFPC) and histogram equalization (HEQ)
Author :
Tsai, Shang-nien ; Lee, Lin-shun
Author_Institution :
Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2003
fDate :
30 Nov.-3 Dec. 2003
Firstpage :
297
Lastpage :
302
Abstract :
Robustness for speech recognition technologies with respect to adverse environments has been a key issue for real applications. Time-frequency principal components (TFPC) features have been shown to be a set of powerful data-driven features under matched circumstances, while histogram equalization (HEQ) has been proposed as an efficient feature transformation approach to reduce the mismatch between training and testing conditions. It is proposed that TFPC features can be well integrated with HEQ. HEQ generates a well-matched environment, in which TFPC features can be properly utilized. Extensive experiments with respect to the AURORA2 database verified that improved performance in adverse circumstances can be achieved.
Keywords :
learning (artificial intelligence); principal component analysis; speaker recognition; PCA; data-driven features; feature transformation; histogram equalization; principal component analysis; speech recognition; time-frequency principal component features; Acoustic applications; Automatic speech recognition; Histograms; Linear discriminant analysis; Mel frequency cepstral coefficient; Principal component analysis; Robustness; Spatial databases; Speech recognition; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
Type :
conf
DOI :
10.1109/ASRU.2003.1318457
Filename :
1318457
Link To Document :
بازگشت