DocumentCode :
2176296
Title :
Phoneme recognition using Boosted Binary Features
Author :
Roy, Anindya ; Magimai-Doss, Mathew ; Marcel, Sébastien
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4868
Lastpage :
4871
Abstract :
In this paper, we propose a novel parts-based binary-valued feature for ASR. This feature is extracted using boosted ensembles of simple threshold-based classifiers. Each such classifier looks at a specific pair of time-frequency bins located on the spectro-temporal plane. These features termed as Boosted Binary Features (BBF) are integrated into standard HMM-based system by using multilayer perceptron (MLP) and single layer perceptron (SLP). Preliminary studies on TIMIT phoneme recognition task show that BBF yields similar or better performance compared to MFCC (67.8% accuracy for BBF vs. 66.3% accuracy for MFCC) using MLP, while it yields significantly better performance than MFCC (62.8% accuracy for BBF vs. 45.9% for MFCC) using SLP. This demonstrates the potential of the proposed feature for speech recognition.
Keywords :
feature extraction; hidden Markov models; multilayer perceptrons; pattern classification; speech recognition; time-frequency analysis; ASR; BBF; HMM-based system; MLP; SLP; TIMIT phoneme recognition task; boosted binary feature; multilayer perceptron; parts-based binary-valued feature; simple threshold-based classifier; single layer perceptron; spectrotemporal plane; speech recognition; time-frequency bin; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Time frequency analysis; Training; Phoneme recognition; automatic speech recognition; binary features; boosting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947446
Filename :
5947446
Link To Document :
بازگشت