DocumentCode
3245048
Title
Improved robust features for speech recognition by integrating time-frequency principal components (TFPC) and histogram equalization (HEQ)
Author
Tsai, Shang-nien ; Lee, Lin-shun
Author_Institution
Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2003
fDate
30 Nov.-3 Dec. 2003
Firstpage
297
Lastpage
302
Abstract
Robustness for speech recognition technologies with respect to adverse environments has been a key issue for real applications. Time-frequency principal components (TFPC) features have been shown to be a set of powerful data-driven features under matched circumstances, while histogram equalization (HEQ) has been proposed as an efficient feature transformation approach to reduce the mismatch between training and testing conditions. It is proposed that TFPC features can be well integrated with HEQ. HEQ generates a well-matched environment, in which TFPC features can be properly utilized. Extensive experiments with respect to the AURORA2 database verified that improved performance in adverse circumstances can be achieved.
Keywords
learning (artificial intelligence); principal component analysis; speaker recognition; PCA; data-driven features; feature transformation; histogram equalization; principal component analysis; speech recognition; time-frequency principal component features; Acoustic applications; Automatic speech recognition; Histograms; Linear discriminant analysis; Mel frequency cepstral coefficient; Principal component analysis; Robustness; Spatial databases; Speech recognition; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN
0-7803-7980-2
Type
conf
DOI
10.1109/ASRU.2003.1318457
Filename
1318457
Link To Document