Title of article
Robust Speech Recognition Based on Mixed Histogram Transform and Asymmetric Noise Suppression
Author/Authors
Farsi، Hassan نويسنده Department of Electronics and Communications Engineering, University of Birjand, Birjand, Iran , , Koohi Moghadam ، Samane نويسنده Department of Engineering, University of payam noor, Mashhaad, Iran ,
Issue Information
فصلنامه با شماره پیاپی 0 سال 2013
Pages
11
From page
1
To page
11
Abstract
This paper proposes a new feature extraction algorithm which is robust against noise using histogram compensation and asymmetric filter. Temporal masking is used to improve Automatic Speech Recognition (ASR) systems specifically in matched and multi-style training conditions. Nonlinear filtering and temporal masking are used in the proposed algorithm. By matching the power histograms of the input in each frequency band to those obtained over clean training data, and then mixing the processed and unprocessed spectrums together, speech recognition accuracy can be appropriately increased. The obtained results show that recognition accuracy in comparison with Mel Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP) and Power Normalized Cepstral Coefficients (PNCC), improves in various training conditions and different SNRs.
Journal title
Majlesi Journal of Electrical Engineering
Serial Year
2013
Journal title
Majlesi Journal of Electrical Engineering
Record number
950962
Link To Document