DocumentCode
74573
Title
Robust speech recognition using harmonic features
Author
Yeh Huann Goh ; Raveendran, Paramesaran ; Jamuar, Sudhanshu S.
Author_Institution
Dept. of Electr. Eng., Univ. of Malaya, Kuala Lumpur, Malaysia
Volume
8
Issue
2
fYear
2014
fDate
Apr-14
Firstpage
167
Lastpage
175
Abstract
In this study, the authors propose a speech recognition system using harmonic structure related information to detect harmonic features in noisy environment. The proposed algorithm first extracts the harmonic components contained inside the speech signals using sine function convolution. By setting the frequency of the sine function as equal to the fundamental frequency of speech signals, harmonic components can be extracted out. The reconstructed signal obtained by summing up the extracted harmonic components is found to have a high degree of correlation with the original signal. The extracted frame energy measure of the harmonic components has been further processed to become dynamic harmonic features and then used together with the European Telecommunications Standards Institute (ETSI) front-end processed mel-frequency cepstral coefficients (MFCC) feature or the perceptual linear prediction (PLP) feature in the speech recognition system. The proposed enhanced speech recognition system shows a better recognition rate over the ETSI front-end processed MFCC (or PLP)-based speech recognition system.
Keywords
signal reconstruction; speech recognition; ETSI front-end processed MFCC; European Telecommunications Standards Institute; PLP-based speech recognition system; dynamic harmonic features; extracted frame energy; extracted harmonic components; front-end processed mel-frequency cepstral coefflcients; harmonic components; harmonic features; harmonic structure related information; high degree of correlation; noisy environment; perceptual linear prediction; reconstructed signal; robust speech recognition; sine function convolution; speech recognition system; speech signal frequency; speech signals;
fLanguage
English
Journal_Title
Signal Processing, IET
Publisher
iet
ISSN
1751-9675
Type
jour
DOI
10.1049/iet-spr.2013.0094
Filename
6786928
Link To Document