DocumentCode :
337439
Title :
Discriminative spectral-temporal multiresolution features for speech recognition
Author :
McMahon, P. ; Harte, N. ; Vaseghi, S. ; McCourt, P.
Author_Institution :
Sch. of Electr. Eng., Queen´´s Univ., Belfast, UK
Volume :
2
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
581
Abstract :
Multi-resolution features, which are based on the premise that there may be more cues for phonetic discrimination in a given sub-band than in another, have been shown to outperform the standard MFCC feature set for both classification and recognition tasks on the TIMIT database. This paper presents an investigation into possible strategies to extend these ideas from the spectral domain into both the spectral and temporal domains. Experimental work on the integration of segmental models, which are better at capturing the longer term phonetic correlation of a phonetic unit, into the discriminative multi-resolution framework is presented. Results are presented which show that including this supplementary temporal information offers an improvement performance for the phoneme classification task over the standard multi-resolution MFCC feature set with time derivatives appended. Possible strategies for the extension of theses techniques into the area of continuous speech recognition are discussed
Keywords :
correlation methods; feature extraction; hidden Markov models; signal classification; signal resolution; spectral analysis; speech recognition; MFCC feature set; TIMIT database; continuous speech recognition; discriminative spectral-temporal multiresolution features; experiment; performance; phoneme classification; phonetic correlation; phonetic discrimination; phonetic unit; segmental HMM; spectral domain; speech classification; sub-band; temporal domain; temporal information; time derivatives; Auditory system; Cepstral analysis; Electronic mail; Hidden Markov models; Humans; Information filtering; Mel frequency cepstral coefficient; Spatial databases; Speech recognition; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.759733
Filename :
759733
Link To Document :
بازگشت