DocumentCode
3760660
Title
On noise robust feature for speech recognition based on power function family
Author
Hilman F. Pardede
Author_Institution
Research Center for Informatics, Indonesian Institute of Sciences, Jl. Cisitu No 21/154D Bandung, Indonesia 40135
fYear
2015
Firstpage
386
Lastpage
390
Abstract
In this paper, a new feature robust against environmental noise is proposed for automatic speech recognition (ASR). This feature has similar extraction process with Power-Normalized Cepstral Coeffients (PNCC) except on two aspects. First, a generalization of the log function called the q-logarithmic function is applied to replace the power function and secondly, the mean normalization process is implemented before discrete cosine transform (DCT) instead of after it as in many traditional feature extraction algorithms. The proposed feature, called Q-Log Normalized Cepstral Coeffients (QLNCC), is shown more robust compared to two traditional features: MFCC and PLP. It is also better than PNCC without adding much complexity.
Keywords
"Speech","Feature extraction","Mel frequency cepstral coefficient","Speech recognition","Discrete cosine transforms","Robustness","Noise measurement"
Publisher
ieee
Conference_Titel
Intelligent Signal Processing and Communication Systems (ISPACS), 2015 International Symposium on
Type
conf
DOI
10.1109/ISPACS.2015.7432801
Filename
7432801
Link To Document