Title :
Joint feature and model training for minimum detection errors applied to speech subword detection
Author :
Johnsen, Magne H. ; Canterla, Alfonso M.
Author_Institution :
Dept. of Electron. & Telecommun., NTNU, Trondheim, Norway
Abstract :
This paper presents methods and results for joint optimization of the feature extraction and the model parameters of a detector. We further define a discriminative training criterion called Minimum Detection Error (MDE). The criterion can optimize the F-score or any other detection performance metric. The methods are used to design detectors of subwords in continuous speech, i.e. to spot phones and articulatory features. For each subword detector the MFCC filterbank matrix and the Gaussian means in the HMM models are jointly optimized. For experiments on TIMIT, the optimized detectors clearly outperform the baseline detectors and also our previous MCE based detectors. The results indicate that the same performance metric should be used for training and test and that accuracy outperforms F-score with respect to relative improvement. Furter, the optimized filterbanks usually reflect typical acoustic properties of the corresponding detection classes.
Keywords :
Gaussian processes; cepstral analysis; channel bank filters; error detection; hidden Markov models; optimisation; performance evaluation; speech processing; F-score; Gaussian means; HMM models; MCE based detectors; MDE; MFCC filterbank matrix; TIMIT; acoustic property; baseline detectors; continuous speech; detection classes; detection performance metric; discriminative training criterion; feature extraction; feature training; joint optimization; minimum detection errors; model parameters; model training; optimized detectors; optimized filterbanks; speech subword detection; subword detectors design; Detectors; Feature extraction; Hidden Markov models; Measurement; Mel frequency cepstral coefficient; Speech; Training; Discriminative training; Joint training of features and models; Minimum Detection Error criterion; Speech subword detection;
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2012 IEEE International Workshop on
Conference_Location :
Santander
Print_ISBN :
978-1-4673-1024-6
Electronic_ISBN :
1551-2541
DOI :
10.1109/MLSP.2012.6349729