DocumentCode :
3516198
Title :
Context-independent phoneme recognition using a K-Nearest Neighbour classification approach
Author :
Golipour, Ladan ; Shaughnessy, Douglas O.
Author_Institution :
INRS-EMT, Quebec Univ., Montreal, QC
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
1341
Lastpage :
1344
Abstract :
In this paper we investigate a non-parametric classification of English phonemes in speaker-independent continuous speech. We employ the ldquovotingrdquo k-nearest neighbour (k-NN) classifier, a powerful technique in pattern recognition problems, along with a new representation of phonemes for the speech recognition task. We also exploit the idea behind ldquoapproximaterdquo k-NN that results in a very fast way of computing the k approximate closest neighbours of each data point. Comparing the recognition performance of the proposed method with the HMM-based recognizer of HTK toolkit reveals that the k-NN-based recognizer outperforms its counterpart. In addition, incorporating the ldquoapproximaterdquo nearest neighbour search instead of the ldquoexactrdquo one results in completing the training step much faster than the HMM-based system, and the testing step with a comparable computational time. We also reduced the amount of the training data by applying a pattern recognition technique, called ldquothinningrdquo algorithm. The outcome was a considerable reduction in the k-NN search space and hence the execution time, and also a slight increase in the recognition performance.
Keywords :
feature extraction; hidden Markov models; natural languages; pattern classification; speech recognition; HMM-based system; context-independent English phoneme recognition; feature extraction; hidden Markov model; k-nearest neighbour classification approach; pattern recognition technique; speech recognition; thinning algorithm; Automatic speech recognition; Error analysis; Hidden Markov models; Humans; Loudspeakers; Pattern classification; Pattern recognition; Speech recognition; Training data; Voting; approximate index search; pattern classification; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959840
Filename :
4959840
Link To Document :
بازگشت