DocumentCode
336793
Title
Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition
Author
Chan, C.P. ; Wong, Y.W. ; Lee, Tan ; Ching, P.C.
Author_Institution
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Volume
1
fYear
1999
fDate
15-19 Mar 1999
Firstpage
405
Abstract
This paper describes a novel approach of using multi-resolution analysis (MRA) for automatic speech recognition. Two-dimensional MRA is applied to the short-time log spectrum of speech signal to extract the slowly varying spectral envelope that contains the most important articulatory and phonetic information. After passing through a standard cepstral analysis process, the MRA features are used for speech recognition in the same way as conventional short-time features like MFCCs, PLPs, etc. Preliminary experiments on both clean connected speech and noisy telephone conversation speech show that the use of MRA cepstra results in a significant reduction in insertion error when compared with MFCCs
Keywords
cepstral analysis; feature extraction; signal resolution; speech processing; speech recognition; 2D multi-resolution analysis; MFCC; PLP; articulatory information; automatic speech recognition; cepstral analysis; clean connected speech; experiments; feature extraction; insertion error reduction; noisy telephone conversation speech; phonetic information; short-time log spectrum; slowly varying spectral envelope; speech signal; Band pass filters; Cepstral analysis; Data mining; Frequency; Low pass filters; Signal analysis; Signal resolution; Speech analysis; Speech recognition; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.758148
Filename
758148
Link To Document